A configurable multi-agent framework that adds persona trees and a skill-backed cognitive architecture to make LLM agents act more human in場

August 24, 20237 min

Overview

Decision SnapshotNeeds Validation

The idea is practical for prototyping social scenarios but evidence is limited to a single classroom use-case with GPT-3.5 and expert-coded annotations.

Citations5

Evidence Strength0.50

Confidence0.82

Risk Signals9

Trust Signals

Findings with numeric evidence: 4/4

Findings with evidence refs: 4/4

Results with explicit delta: 1/4

Reproducibility

Status: No open assets linked

Open source: Partial

At A Glance

Cost impact: 50%

Production readiness: 40%

Novelty: 60%

Authors

Shi Jinxin, Zhao Jiabao, Wang Yilei, Wu Xingjiao, Li Jiawen, He Liang

Links

Abstract / PDF

Why It Matters For Business

CGMI lets product teams simulate social workflows (training, UX, game NPCs, edtech) with more realistic agent behavior by adding persona trees and memory-driven planning.

Who Should Care

Summary TLDR

CGMI is a framework that combines a tree-structured persona model, a cognitive architecture (declarative/procedural/working memory + skill library), and auxiliary general/supervisory agents to simulate realistic multi-agent social scenes. The paper demonstrates CGMI on a virtual classroom built with GPT-3.5-turbo-16k. Results show teacher-led discourse proportions and that persona-driven selection produces more realistic student responses than random selection. The system is presented as a research platform—not a production product—and the authors plan to open-source it.

Problem Statement

LLM-based agents can act in roles but tend to forget role settings, produce shallow content, and lack structured memory and coordination. The paper asks how to give agents stable personalities, deeper domain-aware reasoning, and realistic multi-agent communication for social simulations.

Main Contribution

A tree-structured persona model for assigning, testing, and restoring agent traits to keep roles stable across long dialogues.

A cognitive architecture (working, declarative, procedural memories) plus a configurable skill library that uses Chain-of-Thought and Chain-of-Action to form and retrieve domain knowledge.

Key Findings

Teacher utterances dominated classroom discourse in simulated lessons.

NumbersTeacher behavior averaged 61.23% of discourse (across C1–C3).

Practical UseUse CGMI to study or prototype teacher-led scenarios because simulated teachers naturally control lesson flow, matching typical classrooms.

Evidence RefTable 1 (FIAS analysis)

Persona-aware selection yields notably different answer patterns than random choice.

NumbersWillingness agent recommended Emily 9 times vs random 3 times (others redistributed).

Practical UsePrefer persona-driven selection for realistic participation modeling—it concentrates answers in plausible students instead of spreading them arbitrarily.

Evidence RefFigure 6 (answer willingness comparison)

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Teacher discourse proportion (overall)61.23%Three simulated lessons (C1–C3)FIAS annotations by two expertsTable 1
Student discourse proportion (facilitated by teacher prompts)23.53%Three simulated lessons (C1–C3)FIAS annotations by two expertsTable 1

What To Try In 7 Days

Build a small CGMI demo: 1 teacher + 3 students + 1 supervisor using GPT-3.5 and a simple persona tree.

Compare random vs persona-based selection for role assignment and measure engagement (counts of responses).

Add a tiny skill library (3 domain prompts) to let one agent reflect and plan across two short sessions.

Agent Features

Memory
Working memory (short-term)Declarative memory (facts)Procedural memory (skills/actions)Skill library (configurable domain knowledge)
Planning
Reflection modulePlanning moduleChain-of-Thought (CoT) for declarative memoryChain-of-Action (CoA) for procedural memory
Tool Use
supervisory agentsassistant agentsconsistency checker agent
Frameworks
CGMI
Is Agentic

Yes

Architectures
Tree-structured persona modelCognitive architecture (working/declarative/procedural memory)
Collaboration
Multi-agent coordination with supervisory arbitrationPersona-based answer selectionRole + general agent binding

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusPartial
LicenseUnknown

Risks & Boundaries

Limitations

Evaluation is limited to a single classroom domain and GPT-3.5-turbo-16k; generality is untested.

Quantitative claims rely on expert annotation and selected examples rather than large-scale user studies.

When Not To Use

High-stakes or safety-critical decision systems (medical, legal).

Production agents requiring rigorous provable correctness or compliance.

Failure Modes

LLM may still produce superficial or off-role outputs despite persona trees (persona forgetting).

Supervisory or assistant agents could mis-route actions and break the intended interaction flow.

Core Entities

Models

gpt-3.5-turbo-16k

Metrics

FIAS interaction category proportionsanswer recommendation counts (per student)