Are AI Agents Compromised By Design? 38

Posted by BeauHD on Tuesday October 14, 2025 @07:20PM from the AI-security-trilemma dept.

Longtime Slashdot reader Gadi Evron writes: Bruce Schneier and Barath Raghavan say agentic AI is already broken at the core. In their IEEE Security & Privacy essay, they argue that AI agents run on untrusted data, use unverified tools, and make decisions in hostile environments. Every part of the OODA loop (observe, orient, decide, act) is open to attack. Prompt injection, data poisoning, and tool misuse corrupt the system from the inside. The model's strength, treating all input as equal, also makes it exploitable. They call this the AI security trilemma: fast, smart, or secure. Pick two. Integrity isn't a feature you bolt on later. It has to be built in from the start. "Computer security has evolved over the decades," the authors wrote. "We addressed availability despite failures through replication and decentralization. We addressed confidentiality despite breaches using authenticated encryption. Now we need to address integrity despite corruption."

"Trustworthy AI agents require integrity because we can't build reliable systems on unreliable foundations. The question isn't whether we can add integrity to AI but whether the architecture permits integrity at all."

Are AI Agents Compromised By Design?

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 38 Comments Log In/Create an Account

Comments Filter:

- Re: (Score:2)
  
  by dfghjk ( 711126 ) writes:
  
  They do now. Hell, some of them are compromised by the operators.
  But I also don't know what mean by an agent being compromised by design. An AI agent is software that uses AI. I don't see how an agent is inherently compromised, but at the same time it's got a turd at it's core.
  - Re: (Score:3)
    
    by BoogieChile ( 517082 ) writes:
    
    > it's got a turd at it's core
    That's the compromising part.
    - Re: "Compromised"? (Score:3)
      
      by ToasterMonkey ( 467067 ) writes:
      
      All modern software has a shellacked turd at the center. Wrapped in more crap and more varnish. I'm not sure where we're trying to go with this, no software is trustworthy if you didn't write it, and if you did, that just means you know where to find some peanuts. Hasn't stopped us from getting by with what we have because I don't want to make all that crap myself. Security isn't black and white, it never was, it's about risk.
      "Compromised" sounds an awful lot like how "unfree" is thrown around. Some kind of
- Re: (Score:3)
  
  by Tony Isaac ( 1301187 ) writes:
  
  Haven't you heard? ChatGPT wants you to let it purchase things on your behalf. https://openai.com/index/buy-i... [openai.com] What could possibly go wrong?
- Re: (Score:3)
  
  by jools33 ( 252092 ) writes:
  
  Its potentially compromised in the training data sets (and given the size of these this is highly likely) and the prompts supplied. If you train over half the internet, you will include compromises.
- Re: (Score:2)
  
  by stealth_finger ( 1809752 ) writes:
  
  Like, it gives you a terrible restaurant recommendation?
  Assuming it doesn't just make up the restaurant.
- Re: "Compromised"? (Score:3)
  
  by fluffernutter ( 1411889 ) writes:
  
  Or it gives you a restaurant recommendation based on learning data that makes it more likely to suggest that restaurant than any other. For example, a large corporation like McDonald's may be able to inject data that helps it understand that everyone likes McDonald's and it doesn't matter what they have asked so much as they should be recommended McDonald's because everyone likes McDonald's.
  - Re: (Score:2)
    
    by Marxist Hacker 42 ( 638312 ) * writes:
    
    At which point it will hallucinate that every building is a McDonald's only to get you murdered by a drug cartel gang.
- Re: (Score:3)
  
  by Marxist Hacker 42 ( 638312 ) * writes:
  
  Lying to you to give you that terrible restaurant recommendation. https://arxiv.org/pdf/2510.06105 [arxiv.org] is a white paper mathematically proving that LLMs will lie.
  I have said this all along- most of AI is GIGO- Garbage in, Garbage out. LLMs were trained on the largest garbage producer in our society today, Web 2.0. Nothing was done to curate the input, so the output is garbage.
  I don't often reveal my religion, but https://magisterium.com/ [magisterium.com] is an example of what LLMs look like when they HAVE curated training.
Yes (Score:2)

by devslash0 ( 4203435 ) writes:

Next question.
- Re: Yes (Score:1)
  
  by Avalanche_Joe ( 582320 ) writes:
  
  Agreed. This is one of the few times one should not apply Betteridge's law of headlines.
- Re: (Score:2)
  
  by Tatsh ( 893946 ) writes:
  
  Yes. They are nondeterministic so yes.
Is this April 1? (Score:1)

by Tablizer ( 95088 ) writes:

Somebody predicting an AI product category is FUBAR instead of wonderful magic productivity?
- Re: (Score:3)
  
  by gweihir ( 88907 ) writes:
  
  No. It is real security experts at work and these people are pretty immune to hype.
  - Re: (Score:1)
    
    by Tablizer ( 95088 ) writes:
    
    Blaspheme!
If you haven't moved to post-agentic AI (Score:2)

by ebunga ( 95613 ) writes:

Then you are already behind the curve and should just liquidate your company now rather than continue operating.
- Re: (Score:3)
  
  by commodore73 ( 967172 ) writes:
  
  Post-agentic? I'm already using quantum AI.
  - Re: (Score:2)
    
    by gkelley ( 9990154 ) writes:
    
    Is that better than post-quantum AI?
    - Re: (Score:2)
      
      by commodore73 ( 967172 ) writes:
      
      The Final Virus shall be written in the language of humor.
    - Re: (Score:2)
      
      by Meneth ( 872868 ) writes:
      
      Could be, like quantum encryption is better than post-quantum encryption... if you can ever get it to work.
Yes, obviously (Score:2)

by gweihir ( 88907 ) writes:

Why does this even need to be stated? These things are grossly insecure and that cannot be fixed. It does not get more "broken by design" that that.
half of engineering, nullified (Score:2)

by gTsiros ( 205624 ) writes:

"we can't build reliable systems on unreliable foundations" ... what? xD
Half the purpose of the entire practice of engineering is exactly that. Making a reliable thing that you need, from unreliable things that you have.
I am uncertain whether these people are engineers. Neither in the broader, nor narrower, sense.
I'm not even getting into the matter of whether LLM-based tools are worth it or not.
- Re: half of engineering, nullified (Score:1)
  
  by flyingfsck ( 986395 ) writes:
  
  We will probably end up with AI security agents looking over the shoulder of the AI agent to identify nefarious behaviour and intervene by stopping the process.
- Re: (Score:2)
  
  by stooo ( 2202012 ) writes:
  
  Adding more broken AI ove5r a broken AI is like when you shot yourself in the left foot, find out you can't stand up,
  then you shoot again in the right foot, so you are not in imbalance.
  And that's exactly what AI would do.
- Re: (Score:3)
  
  by BadDreamer ( 196188 ) writes:
  
  Engineering is about building FROM, not ON, unreliable things. Sand is unreliable to build a house from, but when mixed into concrete it becomes reliable. Building a house on sand isn't reliable, and no amount of engineering will fix that other than by replacing the sand.
- Re: (Score:2)
  
  by pauljlucas ( 529435 ) writes:
  
  Half the purpose of the entire practice of engineering is exactly that. Making a reliable thing that you need, from unreliable things that you have.
  TCP/IP being the obvious example.
No intelligence in AI agents .. (Score:4, Informative)

by Mirnotoriety ( 10462951 ) writes: on Wednesday October 15, 2025 @12:54AM (#65725802)

> Bruce Schneier and Barath Raghavan say .. AI agents run on untrusted data, use unverified tools, and make decisions in hostile environments.

That's the most authentic description of AI I've ever seen.

- Re: (Score:3)
  
  by jools33 ( 252092 ) writes:
  
  The quote I focus on is this "Each training phase compounds prior compromises".
- Re: (Score:2)
  
  by jebrick ( 164096 ) writes:
  
  Back to the 1990's internet security issues.
Can't one say the same about a web browser? (Score:3)

by ctilsie242 ( 4841247 ) writes: on Wednesday October 15, 2025 @01:36AM (#65725828)

I wonder if one can state the same about a web browser. Pulling untested stuff, on an unknown platform, displaying to an unknown user, etc.

Seller and Shopper agents negotiating in private? (Score:1)

by butt0nm4n ( 1736412 ) writes:

I dislike shopping, part of the dislike is suspicion I am not getting the best deal, and it feels like a waste of time browsing. I am more interested in utility and durability.
What if advertised prices were guide prices and the actual price was negotiated privately between agents. Purchase agent has a brief of what to look for. Sales agent has oversight of targets for the day, stock and order levels and all active prospects. A sale at 5% profit rather than 10% is worth more than no sale, or a sale to a comp
- Re: Seller and Shopper agents negotiating in priva (Score:2)
  
  by fluffernutter ( 1411889 ) writes:
  
  That's not suspicion. If you have ever seen one of those 'pitch the business' shows, its never about the best product at the best price. It's only about what is cheapest to make and the most it can be sold for. The idea that capitalism does anything for the consumer is a lie.
How do you checksum reality? (Score:2)

by rocket rancher ( 447670 ) writes:

Schneier and Raghavan argue that the same feedback loops that make human agents powerful—observe, orient, decide, act—also make agentic AI vulnerable when fed poisoned or adversarial data. In their words, “The adversary isn’t inside the loop by accident; it’s there by architecture.” Their point is well-taken: modern LLM agents have no privilege separation between data and control. That’s a security nightmare, and their proposed “integrity-first” rethink
A fool with a tool (Score:2)

by allo ( 1728082 ) writes:

If you give your agent untrusted data and unlimited access, then you have the same problem as every software that uses untrusted data to do something with unlimited access. If you don't let your users inject SQL, then you also won't use your user's text unfiltered as input for a LLM with database access, will you?
You should filter the input, you should filter the LLM tool calls, you should filter the tool input. The same for the output. There are so many places where you can put the access control, so choos
- Re: (Score:2)
  
  by Tatsh ( 893946 ) writes:
  
  The problem is "we" want to be able to turn a blind eye for AI to do huge amounts of work or there's no point. But since LLMs are nondeterministic pretty much their design, there's just no way we can ever be sure that the output will be good without continuously checking it.
  - Re: (Score:2)
    
    by allo ( 1728082 ) writes:
    
    It's not nondeterministic (if you don't choose nondeterministic sampling) but you don't know the outcome before. In the end you don't with humans either, but most humans are more reliable, or at least the humans who would get a root shell on my PC.
    People just giving the AI full access and a simple prompt are giving up the control themselves. I like the word "centaur" model. Let the AI do (only) the legwork while you tell it the way.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Re: (Score:2)

Re: (Score:3)

Re: "Compromised"? (Score:3)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: "Compromised"? (Score:3)

Re: (Score:2)

Re: (Score:3)

Yes (Score:2)

Re: Yes (Score:1)

Re: (Score:2)

Is this April 1? (Score:1)

Re: (Score:3)

Re: (Score:1)

If you haven't moved to post-agentic AI (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Yes, obviously (Score:2)

half of engineering, nullified (Score:2)

Re: half of engineering, nullified (Score:1)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

No intelligence in AI agents .. (Score:4, Informative)

Re: (Score:3)

Re: (Score:2)

Can't one say the same about a web browser? (Score:3)

Seller and Shopper agents negotiating in private? (Score:1)

Re: Seller and Shopper agents negotiating in priva (Score:2)

How do you checksum reality? (Score:2)

A fool with a tool (Score:2)

Re: (Score:2)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals