Overview
The framework is a well-documented prototype that reliably generates editable early-stage BIM for many prompts; it needs expanded tool coverage, structural/regulatory rules, and stronger spatial reasoning before production deployment.
Citations6
Evidence Strength0.80
Confidence0.85
Risk Signals10
Trust Signals
Findings with numeric evidence: 4/4
Findings with evidence refs: 4/4
Results with explicit delta: 0/4
Reproducibility
Status: Code + data available
Open source: Partial
At A Glance
Cost impact: 60%
Production readiness: 60%
Novelty: 60%
Why It Matters For Business
Text2BIM lets designers describe early-stage buildings in plain language and get editable BIM models, reducing manual modeling effort and speeding concept-to-BIM workflows while preserving the ability to refine results in standard BIM tools.
Who Should Care
Summary TLDR
Text2BIM is a multi-agent system that converts natural-language design requests into executable Python code that calls high-level BIM tool functions. The agents (Instruction Enhancer, Architect, Programmer, Reviewer) cooperate, run the code in a sandboxed interpreter, and use Solibri rule-checking feedback to iteratively fix problems. On 25 test prompts (3 runs each, 534 IFCs), modern LLMs produced editable BIM models with internal layout and semantics and high automated quality (most Solibri rule pass rates ≳0.95). The system is a feasibility prototype for early-stage, editable BIM generation, not a finished production tool (limited toolset, partial architectural/structural rules).
Problem Statement
Creating editable, semantically rich BIM models from plain text is hard because Text-to-3D methods produce surface geometry without BIM semantics. Designers must still learn complex authoring tools. The paper asks: can LLMs be orchestrated to generate executable modeling code and iterate with deterministic rule checks to produce native BIM models aligned with user intent?
Main Contribution
A code-centric multi-agent framework (Instruction Enhancer, Architect, Programmer, Reviewer) that converts text into Python code invoking high-level BIM tool functions to create native BIM models.
A rule-based model-checking loop (Solibri with 30 rules) that feeds back deterministic issues to agents so models are iteratively fixed.
Key Findings
The framework produced editable IFC/BIM models for 25 diverse prompts with 534 generated runs.
Automated quality (30-rule Solibri pass rate) was high: most average pass rates exceed 0.95 across prompts and LLMs.
Results
| Metric | Value | Baseline | Delta | Split / Dataset | Evidence | Evidence Ref |
|---|---|---|---|---|---|---|
| Number of generated IFC models | 534 | — | — | 25 prompts × 3 LLMs × repeated runs | Section 6.1 | Sec. 6.1 |
| Final model checking pass rate (median/most prompts) | Most averages ≳0.95 | — | — | 30 Solibri rules, Table 4 | Table 4 reports per-prompt Avg ≳0.95 for most LLM/prompt pairs | Table 4 |
What To Try In 7 Days
Run the Text2BIM prototype on 5 typical early-stage briefs to gauge quality and editability.
Define a small set of high-level tool functions (create_wall, add_door, create_slab) and test LLM-generated code execution in a sandbox.
Integrate a rule-checker (e.g., Solibri) and validate a fix-loop: generate → check → patch code → re-run.
Agent Features
Memory
Planning
Tool Use
Frameworks
Is Agentic
Yes
Architectures
Collaboration
Optimization Features
Token Efficiency
Infra Optimization
System Optimization
Training Optimization
Inference Optimization
Reproducibility
Code URLs
Risks & Boundaries
Limitations
Prototype focused on early-stage regular (non-curved) models; cannot generate advanced engineering elements (stairs, beams, columns).
Limited toolset (26 functions) constrains shape complexity and LOD.
When Not To Use
For final construction documents or structural design requiring code compliance.
When irregular/curved geometry or high LOD (>LOD200) is required.
Failure Modes
Hallucinations in long or complex prompts leading to missing or wrong coordinates.
Reviewer fixes introducing new collisions (saw-tooth issue counts) in complex models.

