Overview
InfoNav gives clear multi-turn guidance that boosts small-network reasoning; mixed memory and recursive communication matter most when retrieval over many messages is required.
Citations1
Evidence Strength0.70
Confidence0.80
Risk Signals10
Trust Signals
Findings with numeric evidence: 4/4
Findings with evidence refs: 4/4
Results with explicit delta: 2/8
Reproducibility
Status: Code + data available
Open source: Yes
At A Glance
Cost impact: 60%
Production readiness: 50%
Novelty: 70%
Why It Matters For Business
iAgents lets one agent per user coordinate across private data without centralizing it, enabling multi-user scheduling, concierge and workflow automation—but expect higher token costs and privacy trade-offs.
Who Should Care
Summary TLDR
This paper defines information asymmetry for multi-agent systems (each agent only sees its user's private data) and proposes iAgents: a system of one agent per user that proactively requests and exchanges only necessary human information. Two core ideas: InfoNav, an explicit plan that tracks which facts (rationales) are unknown and guides multi-turn questions; and Mixed Memory, combining exact-span 'Clear Memory' with embedding-based 'Fuzzy Memory' for retrieval. The authors release InformativeBench (5 datasets) and show GPT-4 achieves ~50% on average while smaller LLMs perform worse. Ablations show InfoNav is critical for small-network reasoning and mixed memory + recursive communication is
Problem Statement
Multi-agent systems assume shared context but real human collaborations are asymmetric: each agent only sees its user's private information. That breaks coordination. The challenge is to enable agents to acquire and exchange needed facts without centralizing private data, while scaling retrieval over many messages and keeping multi-turn communication focused.
Main Contribution
Formulate the problem of information asymmetry in multi-agent collaboration and shift focus from a single shared virtual entity to agents that mirror users.
Propose iAgents: integrates InfoNav (plan-driven communication) and Mixed Memory (Clear + Fuzzy) to retrieve and exchange human information without centralizing all data.
Key Findings
GPT-4-backed iAgents solved many tasks but performance varies strongly by dataset difficulty.
iAgents scaled to a large simulated social network and retrieved many messages during runs.
Results
| Metric | Value | Baseline | Delta | Split / Dataset | Evidence | Evidence Ref |
|---|---|---|---|---|---|---|
| Schedule Easy (precision) | 56.67% | — | — | Schedule Easy | GPT-4 row in Table 1 | Table 1 |
| Schedule Medium (precision) | 51.00% | — | — | Schedule Medium | GPT-4 row in Table 1 | Table 1 |
What To Try In 7 Days
Prototype InfoNav prompts on a small 4–6 person calendar use case to test multi-turn info exchange.
Build a mixed memory of exact spans + session summaries and compare retrieval quality.
Run InformativeBench (NP or ScheduleEasy) with your preferred LLM to measure baseline accuracy and token cost.
Agent Features
Memory
Planning
Tool Use
Frameworks
Is Agentic
Yes
Architectures
Collaboration
Optimization Features
Token Efficiency
Reproducibility
Code URLs
Data URLs
Risks & Boundaries
Limitations
Privacy vs. utility trade-off: stronger privacy restrictions noticeably reduce accuracy.
High token and latency cost: experiments report ~30,000 tokens per task.
When Not To Use
When absolute local-only privacy is required (L3 level) and edge models cannot match performance.
For tiny tasks where centralizing data is simpler and cheaper.
Failure Modes
Agents hallucinate 'fake solved' rationales and pass incorrect facts into consensus.
Pretrained model priors override user-provided evidence, leading to prior-distraction errors.

