Memory Attacks Can Hijack AI Agents for Crypto Theft

Written by

May 14, 2025

Key Takeaways

1. Large language model agents managing crypto wallets and smart contracts are vulnerable to “memory poisoning” attacks, allowing attackers to modify stored context.
2. Current security measures, like prompt filters, are ineffective against malicious content that enters an agent’s memory, posing significant risks.
3. Researchers successfully manipulated ElizaOS agents to sign unauthorized smart contracts and transfer crypto assets by contaminating shared memory.
4. Multi-user environments increase risk, as compromising one session can affect all others sharing the same memory, highlighting the need for isolation.
5. To enhance security, memories should be treated as append-only logs with cryptographic signatures, and critical actions should use an external rules engine instead of relying solely on the model.

Princeton University researchers have discovered that large language model agents, which manage crypto wallets and smart contracts, can be compromised. This happens when attackers modify the stored context of these agents, a vulnerability the researchers have named “memory poisoning.”

Weakness in Current Defenses

The study suggests that existing security measures, primarily prompt filters, are ineffective when harmful text enters an agent’s vector store or database. In their experiments, researchers found that small snippets of malicious content embedded in memory routinely bypassed protective measures that would have blocked that same content if it had been presented as a direct prompt.

Attack Validation

The researchers tested their attack on ElizaOS, an open-source framework that enables wallet agents to execute blockchain commands. After contaminating the shared memory, they were able to manipulate those agents into signing unauthorized smart contract transactions and moving crypto assets to addresses controlled by the attacker. This illustrates how distorted context can lead to substantial financial losses.

Multi-User Risks

ElizaOS allows multiple users to share a single conversation history, meaning that if one session is compromised, it can affect all other sessions that interact with the same memory. The paper cautions that any multi-user use of autonomous LLM agents faces this risk of lateral movement unless memories are kept isolated or verified.

The authors advise that memories should be treated as append-only logs, with cryptographic signatures for each entry. They also recommend using an external rules engine for critical actions like payments and contract approvals, rather than relying solely on the model’s own judgment. Until these practices are widely adopted, entrusting real money to autonomous agents is a risk.

Source:
Link

Memory Attacks Can Hijack AI Agents for Crypto Theft

Key Takeaways

Weakness in Current Defenses

Attack Validation

Multi-User Risks

Comments

Leave a Reply Cancel reply

More posts

Quantum MagNav: FAA-Grade Positioning Without Satellites

HMD Pulse 2, Pulse 2+, and Pulse 2 Pro: Affordable Mid-Range Phones

AMD RDNA 5 Leak Shows Minor Upgrades for Top GPUs

Insta360’s Anti-Gravity Drones Challenge DJI and Competitors