A systematic, practitioner-focused map of 193 multi-agent security threats and how 16 frameworks cover them

March 9, 20268 min

Overview

Decision SnapshotNeeds Validation

The paper offers a comprehensive taxonomy and a clear cross-framework comparison that practitioners can act on, but many recommendations are governance-level and require engineering follow‑through to operationalize.

Citations0

Evidence Strength0.60

Confidence0.85

Risk Signals9

Trust Signals

Findings with numeric evidence: 5/5

Findings with evidence refs: 5/5

Results with explicit delta: 0/6

Reproducibility

Status: No open assets linked

Open source: No

At A Glance

Cost impact: 70%

Production readiness: 60%

Novelty: 60%

Authors

Tam Nguyen, Moses Ndebugre, Dheeraj Arremsetty

Links

Abstract / PDF

Why It Matters For Business

Multi‑agent systems amplify security and cost risks (data leaks, tool abuse, resource exhaustion) and current frameworks leave blind spots; companies must combine frameworks and add technical controls to avoid regulatory, financial, and operational loss.

Who Should Care

Summary TLDR

This paper builds a technical knowledge base of production multi-agent AI systems, uses generative-AI-assisted threat modeling plus expert review to derive a taxonomy of 193 distinct agentic threats across nine categories, and scores 16 security/governance frameworks against every threat. No framework covers most multi-agent gaps: OWASP ASI leads at 65.3% coverage, CDAO GenAI covers development/ops well, and non-determinism and data-leakage channels are the worst‑covered domains. The work is a practical guide to picking frameworks and prioritizing defenses for real multi-agent deployments.

Problem Statement

Multi-agent AI systems (agents that share memory, delegate tools, and coordinate) create new, behavioral attack surfaces not covered well by existing AI or infrastructure frameworks. Practitioners lack a systematic threat taxonomy and cross-framework coverage data to guide secure architecture and tool choice.

Main Contribution

A 193-item taxonomy of security threats unique to production multi-agent AI systems across nine categories.

A reproducible four-phase method: deep system knowledge base, generative-AI-assisted threat modeling, per-threat survey planning, and cross-framework scoring.

Key Findings

Multi-agent threat taxonomy contains 193 distinct, agent-specific threats.

Numbers193 threat items across 9 categories

Practical UseTreat multi-agent security as a distinct domain: use per-threat mapping rather than broad AI checklists when designing defenses.

Evidence RefAbstract, Sec. I-III

Survey evaluated 16 security frameworks against every threat item.

Numbers16 frameworks scored on 193 items

Practical UseFramework choice materially changes covered risks; pick frameworks by lifecycle phase and category coverage rather than brand.

Evidence RefAbstract, Sec. VI

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Threat taxonomy size193 itemsComposed from generative-AI-assisted modeling and expert validationSec. I-III
Frameworks surveyed16 frameworksCross-framework scoring across production and governance frameworksAbstract, Sec. V-VI

What To Try In 7 Days

Inventory agent surfaces: list agents, shared memories, tool registries, and vector stores.

Run a quick gap matrix: map your controls vs the paper's nine categories and flag non-determinism and data‑leakage gaps.

Add short-term mitigations: per-agent cryptographic identity, signed tool manifests, and per-call least-privilege enforcement.

Agent Features

Memory
episodic memory (per-session history)semantic memory (knowledge bases, RAG)working memory / scratchpadKV cache and attention caches
Planning
hierarchical planning (HTN)MCTS (Monte Carlo Tree Search)self-consistency and majority votingreflection/critic loops
Tool Use
function-calling APIsweb agents and scrapersdatabase tools (SQL, vector DBs)cloud/SaaS APIsplugin ecosystems
Frameworks
LangChainAutoGenLangGraphCrewAISemantic KernelNVIDIA NeMo
Is Agentic

Yes

Architectures
hierarchical agentsplan-and-executereActswarm/multi-agent ensemblessupervisor-worker orchestration
Collaboration
inter-agent messaging (GroupChat, AgentCards)orchestrator-based delegationpeer trust and reputationshared vector stores

Optimization Features

Token Efficiency
context window tuningcompression/summarizationcache re-use
Infra Optimization
autoscaling policiesload-balancer routingedge model caching
Model Optimization
quantization (INT8/FP16)TensorRT engine builddynamic batching
System Optimization
MIG GPU partitionsmulti-instance GPU sharingcontainer-level caching
Training Optimization
LoRAfew-shot and fine-tuning pipelinescurriculum and replay buffers
Inference Optimization
dynamic batching (Triton)speculative decodingreplica routing

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusNo
LicenseUnknown

Risks & Boundaries

Limitations

Rapidly evolving field: coverage reflects state at publication and needs frequent updates.

Framework scoring mixes governance and technical controls; operational applicability varies by org.

When Not To Use

For simple single-agent chatbots without tool access or persistent memory—overkill.

If you need low-latency, single-model microservices where traditional infra controls suffice.

Failure Modes

Applying a single framework and assuming full coverage leaves blind spots (non-determinism, planning).

Relying only on governance checklists without runtime controls causes detection gaps during streaming and stochastic execution.

Core Entities

Models

GPT-3GPT-4NVIDIA NeMo

Metrics

coverage fraction (framework vs taxonomy)mean per-category framework scoreOWASP ASI coverage %

Benchmarks

ST-WebAgentBenchAgentBench

Context Entities

Models

ReAct-style LLMsRAG retrieversvision-language models (NeVA)Whisper

Metrics

coverage (%)mean framework score per categorynumber of threat items unaddressed

Datasets

enterprise RAG corporaevaluation benchmark datasets referenced in frameworks

Benchmarks

Pass@K-style reliability metricsCuP (Completion Under Policies) in ST-WebAgentBench