Overview
The system clearly improves XML recall and syntactic completeness on DEXPI examples, but evaluation is small and lacks industrial datasets, so further validation is required before large-scale deployment.
Citations2
Evidence Strength0.55
Confidence0.85
Risk Signals12
Trust Signals
Findings with numeric evidence: 3/3
Findings with evidence refs: 3/3
Results with explicit delta: 2/4
Reproducibility
Status: Partial assets available
Open source: Partial
At A Glance
Cost impact: 60%
Production readiness: 40%
Novelty: 60%
Why It Matters For Business
Automating P&ID creation cuts manual drafting time and improves auditability by producing interoperable DEXPI XML and editable Visio drafts.
Who Should Care
Summary TLDR
The authors build ACPID, an "agentic" copilot that turns plain-English descriptions of piping systems into machine-readable DEXPI XML and draft Visio diagrams. The system uses a Plan-and-Execute LLM workflow to emit a compact DSL, deterministically translates that DSL to DEXPI Proteus XML, and renders visuals via the Visio API. On a small DEXPI-based test bench the copilot achieves much higher element recall (soundness 96.96%) and syntactic completeness (92.97%) than zero-shot or few-shot GPT-4-Turbo. Limits: small public test set, careful prompt design needed, and higher inference time versus single-shot methods.
Problem Statement
Creating P&ID diagrams is manual, slow, error-prone, and hard to audit. Prior ML work digitizes existing diagrams but does not generate interoperable P&ID machine formats directly from natural-language requests. The paper aims to automate subsystem-level P&ID creation from text while producing editable, interoperable DEXPI XML and draft Visio diagrams.
Main Contribution
ACPID copilot: an agentic, multi-step Plan-and-Execute system that converts natural language to a DSL and then to DEXPI Proteus XML.
Deterministic rule-based translator from the DSL to DEXPI XML plus a Visual Diagram Generator that renders draft Visio (.vsdx) outputs.
Key Findings
ACPID achieves much higher soundness than single-pass GPT-4-Turbo.
ACPID produces substantially more syntactically complete DEXPI XML.
Results
| Metric | Value | Baseline | Delta | Split / Dataset | Evidence | Evidence Ref |
|---|---|---|---|---|---|---|
| Soundness (proportion of prompted elements present) | 96.96% | Zero-shot 58.33%; Few-shot 65.90% | +31-38 pp vs baselines | Test bench: ~132 artifacts from DEXPI examples | Table 1; Soundness evaluated on elements, connections, attributes | Table 1; Evaluations section |
| Completeness (DEXPI XML syntactic completeness) | 92.97% | Zero-shot 0%; Few-shot 68.28% | +24.7 pp vs best baseline | 555 DEXPI sections from DEXPI Example | Table 2; Completeness checks for classes, IDs, positions, shape catalogue | Table 2; Evaluations section |
What To Try In 7 Days
Run ACPID on a couple of small subsystem descriptions to compare time-to-draft vs manual creation.
Convert existing simple P&ID text descriptions into DEXPI XML and open results in Visio for quick edits.
Use the DSL+rule translator idea to add deterministic checks to your diagram export pipeline.
Agent Features
Memory
Planning
Tool Use
Frameworks
Is Agentic
Yes
Architectures
Collaboration
Optimization Features
Token Efficiency
Infra Optimization
System Optimization
Inference Optimization
Reproducibility
Risks & Boundaries
Limitations
Evaluation limited to small public DEXPI examples; proprietary plant data not tested.
Rules-driven translation requires careful prompt design and can be rigid.
When Not To Use
When you need end-to-end full-plant diagrams in one shot without iterative steps.
When low-latency, real-time diagram generation is required.
Failure Modes
Missing or mis-linked connections between elements due to LLM planning errors.
Incorrect or incomplete XML attributes despite element presence.

