TRiSM: practical trust, risk and security controls for LLM-based multi-agent systems

June 4, 20257 min

Overview

Decision SnapshotReady For Pilot

This survey synthesizes many recent studies and standards into practical controls, but gaps remain in standardized benchmarks, longitudinal validation, and large-scale red-team results.

Citations0

Evidence Strength0.80

Confidence0.85

Risk Signals10

Trust Signals

Findings with numeric evidence: 2/4

Findings with evidence refs: 4/4

Results with explicit delta: 0/3

Reproducibility

Status: No open assets linked

Open source: Partial

At A Glance

Cost impact: 70%

Production readiness: 60%

Novelty: 40%

Authors

Shaina Raza, Ranjan Sapkota, Manoj Karkee, Christos Emmanouilidis

Links

Abstract / PDF

Why It Matters For Business

Agentic systems increase autonomy and regulatory exposure; TRiSM reduces legal, reputational and operational risk while enabling auditable, compliant deployment.

Who Should Care

Summary TLDR

This survey adapts an industry-style Trust, Risk, and Security Management (TRiSM) framework to LLM-based multi-agent systems (AMAS). It catalogs AMAS-specific threats (prompt injection, memory poisoning, collusion), proposes two measurable metrics (Component Synergy Score and Tool Utilization Efficacy) to assess coordination and tool use, maps concrete controls across five TRiSM pillars, and issues a practical research and compliance roadmap for regulated deployments.

Problem Statement

LLM-based multi-agent systems introduce new, system-level risks from shared memory, tool calls, and inter-agent coordination. Existing literature focuses on agent capabilities but lacks an integrated, operational TRiSM view (governance, explainability, security, privacy, lifecycle controls) tailored to AMAS.

Main Contribution

A TRiSM framework specifically mapped to LLM-based multi-agent systems (explainability, ModelOps, security, privacy, governance).

A risk taxonomy for AMAS highlighting prompt injection, memory poisoning, agent collusion, emergent misbehavior, and tool abuse.

Key Findings

Academic interest in agentic AI has exploded, especially after ChatGPT's launch.

NumbersMulti-agent papers: 890 (2019) → 18,500 (2024); LLM-agent papers: ~09,800 (post-2022)

Practical UseExpect rapid adoption and integration pressure; start TRiSM practices early to avoid retrofitting security and governance.

Evidence RefFigures 1 and 3; Introduction

The paper proposes two concrete, measurable metrics for AMAS evaluation.

Practical UseUse CSS to measure inter-agent enablement and TUE to instrument and monitor tool calls in production.

Evidence RefSection 5 (CSS and TUE definitions)

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Component Synergy Score (CSS) exampleCSS ≈ 0.32 for Planner→Coder→Tester exampleillustrative software AMAS example in Section 5Planner speeds Coder by 35% (Impact=0.35); Tester quality conditioned on Coder = 0.92; CSS = 0.35×0.92≈0.32Section 5 (CSS illustrative example)
Bibliometric growthmulti-agent papers: 89018,500 (20192024); LLM-agent papers ~09,800arXiv publication countsFigure 1 and IntroductionFigure 1; Introduction

What To Try In 7 Days

Add prompt hygiene and input sanitization to agent entry points.

Log tool calls and basic reasoning traces (timestamps, agent role, tool args).

Enforce least-privilege tool access and require human sign-off for sensitive actions.

Agent Features

Memory
Persistent vector DB (long-term memory)Working context / short-term memoryMemory scoping and TTL
Planning
Chain-of-Thought (CoT)ReAct (plan-act-observe loop)Layered-CoT (decomposed reasoning)Plan-then-Execute
Tool Use
API/function callingToolformer-style learned API callsMRKL-style expert routing
Frameworks
AutoGenMetaGPTLangGraphLangChainOpenAI Agents SDK
Is Agentic

Yes

Architectures
LLM-based multi-agent orchestrator with shared memoryRole-specialized agent pipelines (planner/verifier/executor)Middleware + Task Manager + World Model
Collaboration
Protocolized communication (A2A/ANP)Role-based coordination and hierarchical monitoringCross-agent validation / critics

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusPartial
LicenseUnknown

Risks & Boundaries

Limitations

Survey synthesizes literature but does not present new empirical defenses or code.

Few standardized stress-tests and limited cross-study benchmark comparability.

When Not To Use

For toy prototypes where no sensitive data or external actions are involved.

When strict low-latency requirements prohibit runtime monitoring and multi-agent checks.

Failure Modes

Memory poisoning that persists across sessions and agents.

Cascading failures due to orchestration misrouting or compromised orchestrator.

Core Entities

Models

GPT-4GPT-3.5LLaMALLaVAProvider-agnostic LLMs

Metrics

Component Synergy Score (CSS)Tool Utilization Efficacy (TUE)Attack Success Rate (ASR)robustness degradationECEBrier scoreuser satisfaction (CSAT)

Datasets

HarmBenchJailbreakBenchToolBenchAgentBenchGAIAWebArena

Benchmarks

HarmBenchJailbreakBenchToolBenchAgentBenchGAIAHELMMLCommons AI Safety

Context Entities

Models

Toolformer-style API-calling modelsNeuro-symbolic MRKL hybrids

Metrics

composite trustworthiness vectorcoordination efficiency (messages/tokens, rounds-to-consensus)

Datasets

simulated multi-agent scenariosred-team prompt collections

Benchmarks

domain-specific tool-use testsmulti-agent adversarial scenario suites