A configurable multi-agent framework that adds persona trees and a skill-backed cognitive architecture to make LLM agents act more human in場

Overview

Decision SnapshotNeeds Validation

The idea is practical for prototyping social scenarios but evidence is limited to a single classroom use-case with GPT-3.5 and expert-coded annotations.

Citations5

Evidence Strength0.50

Confidence0.82

Risk Signals9

Trust Signals

Findings with numeric evidence: 4/4

Findings with evidence refs: 4/4

Results with explicit delta: 1/4

Reproducibility

Status: No open assets linked

Open source: Partial

At A Glance

Cost impact: 50%

Production readiness: 40%

Novelty: 60%

Authors

Shi Jinxin, Zhao Jiabao, Wang Yilei, Wu Xingjiao, Li Jiawen, He Liang

Links

Abstract / PDF

Why It Matters For Business

CGMI lets product teams simulate social workflows (training, UX, game NPCs, edtech) with more realistic agent behavior by adding persona trees and memory-driven planning.

Who Should Care

Product Manager ML Engineer Engineering Lead Founder Data Scientist

Summary TLDR

CGMI is a framework that combines a tree-structured persona model, a cognitive architecture (declarative/procedural/working memory + skill library), and auxiliary general/supervisory agents to simulate realistic multi-agent social scenes. The paper demonstrates CGMI on a virtual classroom built with GPT-3.5-turbo-16k. Results show teacher-led discourse proportions and that persona-driven selection produces more realistic student responses than random selection. The system is presented as a research platform—not a production product—and the authors plan to open-source it.

Problem Statement

LLM-based agents can act in roles but tend to forget role settings, produce shallow content, and lack structured memory and coordination. The paper asks how to give agents stable personalities, deeper domain-aware reasoning, and realistic multi-agent communication for social simulations.

Main Contribution

A tree-structured persona model for assigning, testing, and restoring agent traits to keep roles stable across long dialogues.

A cognitive architecture (working, declarative, procedural memories) plus a configurable skill library that uses Chain-of-Thought and Chain-of-Action to form and retrieve domain knowledge.

Key Findings

Teacher utterances dominated classroom discourse in simulated lessons.

NumbersTeacher behavior averaged 61.23% of discourse (across C1–C3).

Practical UseUse CGMI to study or prototype teacher-led scenarios because simulated teachers naturally control lesson flow, matching typical classrooms.

Evidence RefTable 1 (FIAS analysis)

Persona-aware selection yields notably different answer patterns than random choice.

NumbersWillingness agent recommended Emily 9 times vs random 3 times (others redistributed).

Practical UsePrefer persona-driven selection for realistic participation modeling—it concentrates answers in plausible students instead of spreading them arbitrarily.

Evidence RefFigure 6 (answer willingness comparison)

Results

Metric	Value	Baseline	Delta	Split / Dataset	Evidence	Evidence Ref
Teacher discourse proportion (overall)	61.23%	—	—	Three simulated lessons (C1–C3)	FIAS annotations by two experts	Table 1
Student discourse proportion (facilitated by teacher prompts)	23.53%	—	—	Three simulated lessons (C1–C3)	FIAS annotations by two experts	Table 1

What To Try In 7 Days

Build a small CGMI demo: 1 teacher + 3 students + 1 supervisor using GPT-3.5 and a simple persona tree.

Compare random vs persona-based selection for role assignment and measure engagement (counts of responses).

Add a tiny skill library (3 domain prompts) to let one agent reflect and plan across two short sessions.

Agent Features

Memory

Working memory (short-term)Declarative memory (facts)Procedural memory (skills/actions)Skill library (configurable domain knowledge)

Planning

Reflection modulePlanning moduleChain-of-Thought (CoT) for declarative memoryChain-of-Action (CoA) for procedural memory

Tool Use

supervisory agentsassistant agentsconsistency checker agent

Frameworks

CGMI

Is Agentic

Yes

Architectures

Tree-structured persona modelCognitive architecture (working/declarative/procedural memory)

Collaboration

Multi-agent coordination with supervisory arbitrationPersona-based answer selectionRole + general agent binding

Reproducibility

Code AvailableNo

Data AvailableNo

Open Source StatusPartial

LicenseUnknown

Risks & Boundaries

Limitations

Evaluation is limited to a single classroom domain and GPT-3.5-turbo-16k; generality is untested.

Quantitative claims rely on expert annotation and selected examples rather than large-scale user studies.

When Not To Use

High-stakes or safety-critical decision systems (medical, legal).

Production agents requiring rigorous provable correctness or compliance.

Failure Modes

LLM may still produce superficial or off-role outputs despite persona trees (persona forgetting).

Supervisory or assistant agents could mis-route actions and break the intended interaction flow.

A configurable multi-agent framework that adds persona trees and a skill-backed cognitive architecture to make LLM agents act more human in場

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

Teacher utterances dominated classroom discourse in simulated lessons.

Persona-aware selection yields notably different answer patterns than random choice.

Results

What To Try In 7 Days

Agent Features

Reproducibility

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

Teacher utterances dominated classroom discourse in simulated lessons.

Persona-aware selection yields notably different answer patterns than random choice.

Results

What To Try In 7 Days

Agent Features

Reproducibility

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

You May Also Want to Read

Chemistry foundation models power structure-focused multimodal RAG inside hierarchical multi-agent workflows

Key finding

Argues that 'agentic' buzzwords mostly rebrand decades-old agent and multi-agent research

Key finding

TRiSM: practical trust, risk and security controls for LLM-based multi-agent systems

Key finding

A dynamic town simulation that tests LLM agents on doing tasks while following local cultural norms

Key finding

A process-aware, auditable multi-agent evaluator that produces more stable, human-aligned scores than a single LLM judge

Key finding