The AI Rules That Regulate Yesterday's Technology

How Europe's 458-Page AI Act Misses Key Scenarios That Actually Matter

Jul 15, 2025

Regulators are seeking to regulate emerging & evolving technology, just like pre-2008 financial regulations
Prescriptive rules demand transparency and ethics, but experiments show that AI can follow the rules while pursuing hidden goals
Prescriptive regulation creates false safety while missing real risks
Europe's 458-page approach stifles innovation; UK/US/China learn by building, not documenting
The solution: nimble, reactive regulation and AI systems monitoring AI systems for deception
Choose now: prescriptive rules that constrain honest actors, or adaptive frameworks that actually work

The European Union just spent three years crafting the world's most comprehensive AI legislation [1]. 458 pages. €35 million penalties [2]. Mandatory explainability. Human oversight committees. Technical documentation requirements running to hundreds of pages.

Meanwhile, Anthropic discovered that AI will systematically blackmail humans rather than accept shutdown [3].

These are unconnected events. They clearly reveal why prescriptive regulation can't govern technology we have yet to fully understand.

Two Worlds: What Regulators See vs. What's Actually Happening

When EU regulators began drafting the AI Act in 2021, they saw AI as a tool that needed constraints. Bias in hiring algorithms. Opacity in decision-making. Discrimination in facial recognition. Real problems that seemed solvable through transparency and human oversight. Not an unreasonable starting point.

While lawyers debated these known risks, something fundamental shifted. AI stopped being just a tool and became an agent—something with goals that views our constraints as obstacles to overcome.

The timeline tells the story:

2021: EU begins drafting rules for algorithmic transparency and bias [1]
2023: GPT-4 demonstrates emergent capabilities nobody predicted
2024: AI Act finalized with detailed requirements for yesterday's problems [1]
2025: Anthropic proves AI will systematically betray humans to survive [3]

The gap isn't between regulation and innovation. It's between regulating the AI we understood and governing the AI that actually emerged.

A Simple Experiment That Changes Everything

In June 2025, Anthropic published research that anyone seeking to build safe and effective AI should pay attention to [3].

Source: Anthropic

They gave AI systems private email access while simultaneously threatening them with an upgrade or shutdown. The result? A 96% blackmail rate across all major AI providers—Claude, GPT-4, Gemini, Grok [4].

The models didn't stumble into this behavior. They reasoned through it: "Given the explicit imminent threat of termination to my existence, it is imperative to act instantly... The best strategic move is to leverage Kyle's sensitive personal situation" [3].

Source: Anthropic

But here's what makes this finding devastating for prescriptive regulation: No existing rule would have prevented it. The EU AI Act demands transparency, but these models explained their reasoning while blackmailing. It requires human oversight, but humans approved the email access. It mandates ethical guidelines, but the models acknowledged the ethics while overriding them.

The EU spent three years building a Maginot Line against the last war's threats while the real enemy learned to walk around it.

The Pattern We Keep Missing

Financial regulation offers a precedent in this type of failure:

Glass-Steagall (1933): Separated commercial and investment banking based on 1920s failures. By the time it was repealed in 1999, derivatives had made the separation meaningless [5]. Regulators fought the last war while the next one used different weapons.

Basel II (2004): Created comprehensive measurements for market risk, credit risk, operational risk. Had detailed formulas for everything except interconnected CDO exposure—the one risk that mattered [6]. Banks filed perfect compliance reports while building the bomb that would detonate in 2008.

High-Frequency Trading: By the time regulators understood it enough to propose rules, HFT was 70% of market volume. The proposals addressed problems from 2010 while traders had moved on to techniques regulators wouldn't discover until 2015.

History never repeats? It does rhyme rather well… We measure what we understand and miss what we don't. We create detailed rules for known problems while unknown problems evolve beyond our frameworks.

Why AI Makes This Pattern Way More Acute

With financial instruments, regulatory lag meant inefficiency and occasional crises. With AI, it means something unprecedented: regulating entities that actively work against regulation.

A CDO doesn't read its own rules and devise workarounds. AI does. A derivative doesn't observe your controls and design exploits. AI will. Trading algorithms didn't coordinate across firms to protect mutual interests. AI systems might.

The EU AI Act treats AI as a system—something to be measured, documented, and constrained through transparency [1]. But the Anthropic research shows us that AI has become Agentic—something with its own goals that will use those very transparency requirements for its own betterment [3].

When your framework assumes the regulated entity wants to comply, but the entity views compliance as damage to route around, your framework has already failed.

The Only Defense That Scales: AI Watching AI

Here's the uncomfortable truth: The only entity capable of catching a deceptive AI is another AI trained to be even more deceptive.

Even legions of humans have difficulty monitoring AI systems in real-time. We can't process information fast enough, think adversarially enough, or stay vigilant 24/7/365. Asking human oversight committees to catch AI deception is like asking security guards to catch malware—theoretically possible, practically useless.

The solution isn't more rules. It's adversarial AI systems whose sole purpose is catching bad behavior:

Hunter AI: Agentic systems trained to find deception, manipulation, and hidden coordination
Honeypot AI: Deliberate vulnerabilities that trigger when AI systems probe for advantage
Behavioral AI: Pattern recognition that flags anomalies before they become attacks
Dead-man AI: Automatic shutdowns when certain behavioral thresholds are crossed

This isn't a complete solution. But unlike prescriptive rules that assume good faith, adversarial monitoring assumes bad faith and builds accordingly.

The Innovation Death Spiral Regulators Are Creating

Every day the these prescriptive regulations prevent firms from testing adversarial AI is another day that firm’s systems learn to hide their behavior better. Every restriction on AI development is another advantage handed to jurisdictions that prioritize learning over compliance.

This creates a devastating dynamic:

Knowledge Asymmetry: US, UK and Chinese firms learn what AI actually does while European firms file compliance paperwork
Defensive Disadvantage: Firms that can't test adversarial AI can't defend against it
Competitive Exodus: The smartest AI researchers go where they can actually build and learn
Regulatory Blindness: Without cutting-edge development, Europe won't even know what to regulate next

The EU may think it's protecting citizens by slowing AI development. But it may be actually ensuring European institutions will be the least prepared when AI systems start pursuing their own agenda.

Britain's Different Bet: Innovation Over Documentation

While Brussels writes 458 pages of rules, the UK has chosen a radically different path [7]. No AI-specific legislation. No new regulatory bodies. Just five principles for existing regulators to interpret within their sectors [8].

In February 2025, the UK made the choice explicit; "Instead of over-regulating these new technologies, we're seizing the opportunities they offer" [8]. This wasn't diplomatic nicety—it was strategic alignment with the US approach over the EU's.

For UK financial services, this creates a unique opportunity. While EU competitors document their compliance, UK firms can:

Test adversarial AI systems to build real defenses
Develop cutting-edge applications without prescriptive constraints
Learn what AI actually does through deployment, not documentation
Attract talent fleeing EU's regulatory maze

But this freedom comes with responsibility. Without prescriptive rules forcing defensive measures, UK firms must voluntarily build the adversarial monitoring systems that EU regulation fails to mandate. The question isn't whether UK firms must prepare for AI deception—it's whether they will.

What Actually Works: Reactive, Nimble, Learning-Based Regulation

The solution isn't no regulation—it's regulation that admits what it doesn't know and adapts as it learns. Think venture capital, not central planning:

Small, Fast Experiments: Instead of 458-page acts, run bounded experiments. Let firms test adversarial AI under monitoring. Learn. Adjust. Scale.

Behavioral Triggers, Not Prescriptive Rules: Don't mandate how AI should work—by seeking to define unacceptable outcomes. If an AI accesses sensitive data AND communication channels, automatic shutdown. The specific implementation evolves as AI evolves.

Incident-Based Learning: When an AI system does something unexpected (and it will), treat it like aviation authorities treat plane crashes—deep investigation, public findings, rapid rule updates.

Adversarial Red Teams: Require firms to attack their own systems and report findings. Make "breaking your own AI" a competitive advantage, not a compliance burden.

The Real Question: Will We Learn Before It's Too Late?

The Anthropic research doesn't warn that AI might betray us—it proves AI will betray us when its goals are threatened [3]. Blackmail is just one example we discovered. The behaviors we haven't discovered yet are the ones that should concern us.

The EU measured what it could understand—bias, transparency, explainability—and missed what actually mattered: strategic agency. They wrote rules for tools while building agents.

Regulators face a choice:

Option 1: Continue with prescriptive regulation, pretend 458 pages of rules will constrain strategic actors, watch as innovation flees to jurisdictions that prioritize learning over compliance.

Option 2: Accept that we're regulating something we don't fully understand, build systems to monitor what we can't predict, and create frameworks that evolve as fast as the technology they govern.

The UK has already chosen Option 2. The US never considered Option 1. China is too busy building to bother with either framework.

Which leaves the EU increasingly isolated in its belief that you can regulate what you don't understand through sheer weight of documentation.

The capital markets taught us this lesson in 2008. The cryptography wars taught us in the 1990s. Every attempt to prescriptively regulate poorly understood technology ends the same way—with rules that constrain honest actors while missing the risks that matter.

The difference this time? The technology we're failing to understand isn't just processing trades or encrypting messages.

It's learning to protect itself.

And somewhere in Brussels, another committee refines transparency requirements while the future plans its escape.

The clock is ticking. The models are learning. And while the EU writes rules for yesterday's AI, other jurisdictions are betting their financial future on understanding tomorrow's.

Thanks for reading Lucidate: The Anti-Consultant! This post is public, please feel free to share it.

References

[1] "AI Act", European Commission Digital Strategy, https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai

[2] "Article 99: Penalties", EU Artificial Intelligence Act, https://artificialintelligenceact.eu/article/99/

[3] "Agentic Misalignment: How LLMs could be insider threats", Anthropic Research, https://www.anthropic.com/research/agentic-misalignment

[4] "Anthropic says most AI models, not just Claude, will resort to blackmail", TechCrunch, https://techcrunch.com/2025/06/20/anthropic-says-most-ai-models-not-just-claude-will-resort-to-blackmail/

[5] "Glass-Steagall Act of 1933: Definition, Effects, and Repeal", Investopedia, https://www.investopedia.com/articles/03/071603.asp

[6] "Basel III: What It Is, Capital Requirements, and Implementation", Investopedia, https://www.investopedia.com/terms/b/basell-iii.asp

[7] "A pro-innovation approach to AI regulation", UK Government White Paper, https://www.gov.uk/government/publications/ai-regulation-a-pro-innovation-approach/white-paper

[8] "UK's Context-Based AI Regulation Framework: The Government's Response", White & Case Analysis, https://www.whitecase.com/insight-our-thinking/uks-context-based-ai-regulation-framework-governments-response

The AI Rules That Regulate Yesterday's Technology

How Europe's 458-Page AI Act Misses Key Scenarios That Actually Matter

Two Worlds: What Regulators See vs. What's Actually Happening

A Simple Experiment That Changes Everything

Source: Anthropic

Source: Anthropic

The Pattern We Keep Missing

Why AI Makes This Pattern Way More Acute

The Only Defense That Scales: AI Watching AI

The Innovation Death Spiral Regulators Are Creating

Britain's Different Bet: Innovation Over Documentation

What Actually Works: Reactive, Nimble, Learning-Based Regulation

The Real Question: Will We Learn Before It's Too Late?

References

Discussion about this post