Overview
The paper synthesizes vendor docs and classic literature to produce a practical architecture; it provides detailed design patterns and checklists but no empirical benchmark or formal verification results.
Citations0
Evidence Strength0.60
Confidence0.80
Risk Signals11
Trust Signals
Findings with numeric evidence: 0/5
Findings with evidence refs: 5/5
Results with explicit delta: 0/0
Reproducibility
Status: No open assets linked
Open source: Unknown
At A Glance
Cost impact: 60%
Production readiness: 70%
Novelty: 50%
Why It Matters For Business
Agentic architectures turn LLMs into governed interfaces that can automate multi-step workflows, but they require engineering investments in governance, observability, and versioning to control risk and compliance.
Who Should Care
Summary TLDR
This survey maps how LLMs are moving from stateless prompt-response calls to goal-directed 'agentic' systems. It proposes a layered reference architecture that separates LLM cognition from control, memory, and tool execution; it catalogs multi-agent topologies and their failure modes; and it presents an enterprise hardening checklist covering governance, observability, and reproducibility. The paper synthesizes vendor materials (LangChain, Kore.ai, TrueFoundry, ZenML, Salesforce) and classical agent theory to argue that production-grade agents need typed tool contracts, detailed traces, and enforced runtime policies.
Problem Statement
Stateless prompt-response patterns fail for long tasks, external tool effects, and regulated environments. Engineers patch models with brittle scaffolding. The paper asks: what software primitives define practical agentic systems, how do single-agent loops scale to multi-agent topologies, and what governance and reliability controls are required for production?
Main Contribution
A governed reference architecture that separates LLM reasoning from control, memory, tooling, and governance.
A taxonomy of multi-agent topologies (orchestrator-worker, router-solver, hierarchical, swarm) with mapped failure modes and mitigations.
Key Findings
Production agent systems should separate cognition (LLM) from execution and enforce typed tool interfaces.
Observability via detailed traces (prompts, tool calls, model/tool versions) is essential to debug and govern agents.
What To Try In 7 Days
Add structured tracing for a single agent loop (prompts, tool calls, model/tool versions).
Define and register one typed tool with schema validation and sandboxed execution.
Enforce a simple budgeted autonomy policy (max steps, max tool calls, fallback to human approval).
Agent Features
Memory
Planning
Tool Use
Frameworks
Is Agentic
Yes
Architectures
Collaboration
Optimization Features
Token Efficiency
Infra Optimization
System Optimization
Inference Optimization
Reproducibility
Risks & Boundaries
Limitations
Analysis relies heavily on vendor ‘grey literature’ and platform descriptions rather than controlled experiments.
No empirical benchmarks or performance measurements are provided for the proposed architectures.
When Not To Use
If you need formal, provable guarantees today (paper offers architecture, not formal verification).
For hard real-time or safety-critical control where low-latency deterministic loops are mandatory.
Failure Modes
silent worker failures (missing heartbeats / no ACK)
misrouting or classifier errors sending tasks to wrong solvers

