Use LLM agents and a fishbowl discussion to simulate participatory urban planning and improve resident satisfaction

February 27, 20247 min

Overview

Decision SnapshotNeeds Validation

Proof-of-concept results on two real regions and multiple runs show promise, but the method omits costs, ownership, and other practical constraints; more engineering and human integration needed for production.

Citations11

Evidence Strength0.60

Confidence0.78

Risk Signals9

Trust Signals

Findings with numeric evidence: 5/5

Findings with evidence refs: 5/5

Results with explicit delta: 6/6

Reproducibility

Status: No open assets linked

Open source: No

At A Glance

Cost impact: 50%

Production readiness: 30%

Novelty: 60%

Authors

Zhilun Zhou, Yuming Lin, Depeng Jin, Yong Li

Links

Abstract / PDF

Why It Matters For Business

Simulated multi-agent LLM planning can surface local needs early, reducing time and rehearsal costs before engaging humans; it helps test many “what-if” land-use options quickly while keeping service coverage competitive.

Who Should Care

Summary TLDR

This paper builds a multi-agent system of LLMs that simulates a planner plus thousands of resident agents to produce land-use plans. Residents are role-played from census distributions and discuss via a fishbowl mechanism (inner/outer circles). The planner (GPT-4 vision) proposes an initial map, residents discuss in rounds, discussion is summarized, and the planner revises the plan. On two Beijing regions the method raises need-aware metrics (Satisfaction and Inclusion) substantially vs baselines while keeping service/access metrics competitive. The setup omits costs, ownership, and many real-world constraints.

Problem Statement

Traditional participatory planning is slow, costly, and hard to scale to thousands of residents. How can we simulate many stakeholders cheaply and efficiently so planners can create land-use plans that actually reflect diverse residents' needs?

Main Contribution

A multi-agent LLM framework that role-plays a planner and many residents to simulate participatory urban planning.

A fishbowl discussion mechanism (inner/outer circles + summaries) to scale resident discussion and limit context length.

Key Findings

Simulated participatory planning raised resident Satisfaction to 0.787 on HLG.

NumbersSatisfaction 0.787 (HLG) vs 0.708 (best baseline DRL)

Practical UseUse LLM-based resident role-play plus discussion to increase the fraction of resident needs met within 500m in simulated plans; expect roughly +11% on this benchmark.

Evidence RefTable 2

Inclusion for marginalized groups improved to 0.773 on HLG.

NumbersInclusion 0.773 (HLG) vs 0.716 (DRL)

Practical UseThe method better incorporates marginalized groups' needs than automated baselines, so it can surface minority needs in early-stage planning simulations.

Evidence RefTable 2

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Satisfaction0.787DRL 0.708 (best baseline)+11.2%HLGTable 2 shows Satisfaction 0.787 for Ours vs 0.708 for DRLTable 2
Inclusion0.773DRL 0.716+8.0%HLGTable 2 shows Inclusion 0.773 for Ours vs 0.716 for DRLTable 2

What To Try In 7 Days

Run a small pilot: create 100 resident agents from local demographics and role-play 1 community with GPT-4 vision and gpt-3.5 residents.

Use 3 fishbowl rounds and compare Satisfaction/Inclusion vs a planner-only baseline.

Produce summaries after each round to keep context short and reuse them in prompts.

Agent Features

Memory
Short-term discussion summaryRound-by-round history aggregation
Planning
Planning with LLMsTask DecompositionCommunity-level revision
Tool Use
Multimodal map inputPrompt-based role-play
Frameworks
Inner/outer fishbowlRole-play prompts
Is Agentic

Yes

Architectures
GPT-4-visionGPT-3.5
Collaboration
Fishbowl discussionSequential community revision

Optimization Features

Token Efficiency
Use summaries to limit token growth
Inference Optimization
Reduce context by summarizing rounds

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusNo
LicenseUnknown

Risks & Boundaries

Limitations

Does not model ownership, development cost, or regulatory constraints.

Land-use types and requirements are simplified to eight categories.

When Not To Use

For legally binding or final planning decisions that require ownership/cost modeling.

When transparent, auditable decision chains are required without prompt engineering.

Failure Modes

LLM-generated residents may hallucinate unrealistic needs or locations.

Prompt bias can skew which resident concerns are surfaced.

Core Entities

Models

gpt-4-vision-previewgpt-3.5-turbo-1106

Metrics

ServiceEcologySatisfactionInclusion

Datasets

Huilongguan (HLG)Dahongmen (DHM)