Survey of safe interfaces, threat models, and standards for LLM-driven agents that act on blockchains

January 8, 20268 min

Overview

Decision SnapshotNeeds Validation

The survey compiles a broad, recent literature base and practical artifacts; it is immediately useful for system design but is a synthesis rather than a single validated implementation.

Citations0

Evidence Strength0.75

Confidence0.82

Risk Signals12

Trust Signals

Findings with numeric evidence: 5/5

Findings with evidence refs: 5/5

Results with explicit delta: 0/4

Reproducibility

Status: Code + data available

Open source: Partial

At A Glance

Cost impact: 65%

Production readiness: 40%

Novelty: 55%

Authors

Saad Alqithami

Links

Abstract / PDF

Why It Matters For Business

Connecting AI agents to blockchains can automate treasury, trading and governance but also creates irreversible financial and governance risk; invest in intent typing, policy gates, hardened custody, and MEV-aware execution before granting any agent signing power.

Who Should Care

Summary TLDR

This paper surveys how LLM-driven autonomous agents can safely interact with public blockchains. The authors reviewed 3,270 records and synthesized 317 relevant works, coded 85 systems in depth, and compared 20+ platforms across custody, policy, observability, and execution. They propose a five-part integration taxonomy (read-only → multi-agent workflows), a threat model focused on prompt injection, MEV and key compromise, and a practical roadmap: standardize Transaction Intent Schema (TIS) and Policy Decision Record (PDR) plus reproducible benchmarks and evaluation checklists.

Problem Statement

Connecting probabilistic, tool-using AI agents to immutable blockchains creates unique risks: irreversible financial loss, bearer-key authorization, adversarial transaction ordering (MEV), and cross-chain complexity. We need standardized, auditable interfaces and enforcement layers so agents can plan and act without exposing users and protocols to unacceptable economic or governance risk.

Main Contribution

A five-part taxonomy of agent→chain integration patterns from read-only analytics to multi-agent workflows.

A threat model tailored to agent-driven transaction pipelines, mapping attack classes across observe→verify stages.

Key Findings

Systematic literature review found 317 relevant works from 3,270 records.

Numbers317 studies selected from 3,270 records

Practical UseUse this consolidated literature as the evidence baseline when designing agent-to-chain interfaces and not rely on scattered blog posts or vendor docs.

Evidence RefSection 2.4 PRISMA-style selection (Identification->Inclusion)

85 systems were coded in depth and 20+ representative systems were analyzed across 13 dimensions.

Numbers85 coded; 20+ in capability matrix

Practical UseAdopt the capability dimensions (custody, policy, preview, simulation, observability, recovery) as a checklist when evaluating platforms.

Evidence RefSection 2.5 and Table 6

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Records screened3,270Identification and screening counts in PRISMA-style flowSection 2.4
Studies included in qualitative synthesis317Final qualitative corpus sizeSection 2.4

What To Try In 7 Days

Run a small pilot: agent generates TIS-style intents but humans sign every transaction.

Add preflight simulation to any production transaction pipeline (forked state test before sign).

Audit wallet flows: move from raw keys to session keys or smart-account modules with strict spend limits.

Agent Features

Memory
short-term context windowsretrieval-augmented state from indexers (cached world model)
Planning
outcome-first intent planningsimulation-driven plan scoringMEV-aware route selection
Tool Use
function callingRPC/indexer toolingoracle-augmented data fetch
Frameworks
MCP (Model Context Protocol)UTCP-like tool-call standardsERC-4337 smart accounts
Is Agentic

Yes

Architectures
chain-of-thought / ReAct-style plannertool-augmented LLMs (function calling)multi-agent proposer/verifier/executor
Collaboration
quorum-based approvalrole-specialized agent workflows

Optimization Features

Token Efficiency
intent summarization to reduce action complexity
Infra Optimization
bundlers and paymasters for ERC-4337MPC/TEE signing to remove keys from agent host
Model Optimization
not applicable (survey)
System Optimization
preflight simulation and canonical intent hashingpolicy checking off-chain before signing
Inference Optimization
MEV-aware routing and solver selectionprivate orderflow to reduce public mempool exposure

Reproducibility

Code AvailableYes
Data AvailableYes
Open Source StatusPartial
LicenseUnknown

Risks & Boundaries

Limitations

Focuses on public, permissionless blockchains (mainly Ethereum/EVM); private chains get limited coverage.

Emphasizes LLM-based agents; other agent paradigms (pure RL) receive limited treatment.

When Not To Use

For fully autonomous high-frequency trading without hardened MPC or TEE custody.

When target execution venues do not support account abstraction or intent-based settlement.

Failure Modes

Prompt injection steering the planner to malicious intents.

Tool/data-plane spoofing (compromised RPC or oracle) causing unsafe decisions.

Core Entities

Models

GPT-3ReActChain-of-ThoughtToolformerGorillaNexusRaven

Metrics

PIR (prompt injection resistance)MEV leakageAccuracy

Datasets

AndroidWorldAndroidLabAgentClinicMLGym

Benchmarks

AndroidWorldAndroidLabAgentClinicMLGym

Context Entities

Models

LLM agents (tool-using architectures)

Metrics

production_readinessevidence_strength

Benchmarks

proposed safety/economic evaluation suite (this paper)