A systematic, practitioner-focused map of 193 multi-agent security threats and how 16 frameworks cover them

Overview

Decision SnapshotNeeds Validation

The paper offers a comprehensive taxonomy and a clear cross-framework comparison that practitioners can act on, but many recommendations are governance-level and require engineering follow‑through to operationalize.

Citations0

Evidence Strength0.60

Confidence0.85

Risk Signals9

Trust Signals

Findings with numeric evidence: 5/5

Findings with evidence refs: 5/5

Results with explicit delta: 0/6

Reproducibility

Status: No open assets linked

Open source: No

At A Glance

Cost impact: 70%

Production readiness: 60%

Novelty: 60%

Authors

Tam Nguyen, Moses Ndebugre, Dheeraj Arremsetty

Links

Abstract / PDF

Why It Matters For Business

Multi‑agent systems amplify security and cost risks (data leaks, tool abuse, resource exhaustion) and current frameworks leave blind spots; companies must combine frameworks and add technical controls to avoid regulatory, financial, and operational loss.

Who Should Care

CTO Product Manager ML Engineer Engineering Lead Data Scientist Founder

Summary TLDR

This paper builds a technical knowledge base of production multi-agent AI systems, uses generative-AI-assisted threat modeling plus expert review to derive a taxonomy of 193 distinct agentic threats across nine categories, and scores 16 security/governance frameworks against every threat. No framework covers most multi-agent gaps: OWASP ASI leads at 65.3% coverage, CDAO GenAI covers development/ops well, and non-determinism and data-leakage channels are the worst‑covered domains. The work is a practical guide to picking frameworks and prioritizing defenses for real multi-agent deployments.

Problem Statement

Multi-agent AI systems (agents that share memory, delegate tools, and coordinate) create new, behavioral attack surfaces not covered well by existing AI or infrastructure frameworks. Practitioners lack a systematic threat taxonomy and cross-framework coverage data to guide secure architecture and tool choice.

Main Contribution

A 193-item taxonomy of security threats unique to production multi-agent AI systems across nine categories.

A reproducible four-phase method: deep system knowledge base, generative-AI-assisted threat modeling, per-threat survey planning, and cross-framework scoring.

Key Findings

Multi-agent threat taxonomy contains 193 distinct, agent-specific threats.

Numbers193 threat items across 9 categories

Practical UseTreat multi-agent security as a distinct domain: use per-threat mapping rather than broad AI checklists when designing defenses.

Evidence RefAbstract, Sec. I-III

Survey evaluated 16 security frameworks against every threat item.

Numbers16 frameworks scored on 193 items

Practical UseFramework choice materially changes covered risks; pick frameworks by lifecycle phase and category coverage rather than brand.

Evidence RefAbstract, Sec. VI

Results

Metric	Value	Baseline	Delta	Split / Dataset	Evidence	Evidence Ref
Threat taxonomy size	193 items	—	—	—	Composed from generative-AI-assisted modeling and expert validation	Sec. I-III
Frameworks surveyed	16 frameworks	—	—	—	Cross-framework scoring across production and governance frameworks	Abstract, Sec. V-VI

What To Try In 7 Days

Inventory agent surfaces: list agents, shared memories, tool registries, and vector stores.

Run a quick gap matrix: map your controls vs the paper's nine categories and flag non-determinism and data‑leakage gaps.

Add short-term mitigations: per-agent cryptographic identity, signed tool manifests, and per-call least-privilege enforcement.

Agent Features

Memory

episodic memory (per-session history)semantic memory (knowledge bases, RAG)working memory / scratchpadKV cache and attention caches

Planning

hierarchical planning (HTN)MCTS (Monte Carlo Tree Search)self-consistency and majority votingreflection/critic loops

Tool Use

function-calling APIsweb agents and scrapersdatabase tools (SQL, vector DBs)cloud/SaaS APIsplugin ecosystems

Frameworks

LangChainAutoGenLangGraphCrewAISemantic KernelNVIDIA NeMo

Is Agentic

Yes

Architectures

hierarchical agentsplan-and-executereActswarm/multi-agent ensemblessupervisor-worker orchestration

Collaboration

inter-agent messaging (GroupChat, AgentCards)orchestrator-based delegationpeer trust and reputationshared vector stores

Optimization Features

Token Efficiency

context window tuningcompression/summarizationcache re-use

Infra Optimization

autoscaling policiesload-balancer routingedge model caching

Model Optimization

quantization (INT8/FP16)TensorRT engine builddynamic batching

System Optimization

MIG GPU partitionsmulti-instance GPU sharingcontainer-level caching

Training Optimization

LoRAfew-shot and fine-tuning pipelinescurriculum and replay buffers

Inference Optimization

dynamic batching (Triton)speculative decodingreplica routing

Reproducibility

Code AvailableNo

Data AvailableNo

Open Source StatusNo

LicenseUnknown

Risks & Boundaries

Limitations

Rapidly evolving field: coverage reflects state at publication and needs frequent updates.

Framework scoring mixes governance and technical controls; operational applicability varies by org.

When Not To Use

For simple single-agent chatbots without tool access or persistent memory—overkill.

If you need low-latency, single-model microservices where traditional infra controls suffice.

Failure Modes

Applying a single framework and assuming full coverage leaves blind spots (non-determinism, planning).

Relying only on governance checklists without runtime controls causes detection gaps during streaming and stochastic execution.

Core Entities

Models

GPT-3GPT-4NVIDIA NeMo

Metrics

coverage fraction (framework vs taxonomy)mean per-category framework scoreOWASP ASI coverage %

Benchmarks

ST-WebAgentBenchAgentBench

Context Entities

Models

ReAct-style LLMsRAG retrieversvision-language models (NeVA)Whisper

Metrics

coverage (%)mean framework score per categorynumber of threat items unaddressed

Datasets

enterprise RAG corporaevaluation benchmark datasets referenced in frameworks

Benchmarks

Pass@K-style reliability metricsCuP (Completion Under Policies) in ST-WebAgentBench

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

Multi-agent threat taxonomy contains 193 distinct, agent-specific threats.

Survey evaluated 16 security frameworks against every threat item.

Results

What To Try In 7 Days

Agent Features

Optimization Features

Reproducibility

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Benchmarks

Context Entities

Models

Metrics

Datasets

Benchmarks

You May Also Want to Read

Chemistry foundation models power structure-focused multimodal RAG inside hierarchical multi-agent workflows

Key finding

Argues that 'agentic' buzzwords mostly rebrand decades-old agent and multi-agent research

Key finding

TRiSM: practical trust, risk and security controls for LLM-based multi-agent systems

Key finding

A dynamic town simulation that tests LLM agents on doing tasks while following local cultural norms

Key finding

A process-aware, auditable multi-agent evaluator that produces more stable, human-aligned scores than a single LLM judge

Key finding