Agentic copilot that converts natural language into P&ID DEXPI XML and Visio drawings

December 17, 20247 min

Overview

Decision SnapshotNeeds Validation

The system clearly improves XML recall and syntactic completeness on DEXPI examples, but evaluation is small and lacks industrial datasets, so further validation is required before large-scale deployment.

Citations2

Evidence Strength0.55

Confidence0.85

Risk Signals12

Trust Signals

Findings with numeric evidence: 3/3

Findings with evidence refs: 3/3

Results with explicit delta: 2/4

Reproducibility

Status: Partial assets available

Open source: Partial

At A Glance

Cost impact: 60%

Production readiness: 40%

Novelty: 60%

Authors

Shreeyash Gowaikar, Srinivasan Iyengar, Sameer Segal, Shivkumar Kalyanaraman

Links

Abstract / PDF / Data

Why It Matters For Business

Automating P&ID creation cuts manual drafting time and improves auditability by producing interoperable DEXPI XML and editable Visio drafts.

Who Should Care

Summary TLDR

The authors build ACPID, an "agentic" copilot that turns plain-English descriptions of piping systems into machine-readable DEXPI XML and draft Visio diagrams. The system uses a Plan-and-Execute LLM workflow to emit a compact DSL, deterministically translates that DSL to DEXPI Proteus XML, and renders visuals via the Visio API. On a small DEXPI-based test bench the copilot achieves much higher element recall (soundness 96.96%) and syntactic completeness (92.97%) than zero-shot or few-shot GPT-4-Turbo. Limits: small public test set, careful prompt design needed, and higher inference time versus single-shot methods.

Problem Statement

Creating P&ID diagrams is manual, slow, error-prone, and hard to audit. Prior ML work digitizes existing diagrams but does not generate interoperable P&ID machine formats directly from natural-language requests. The paper aims to automate subsystem-level P&ID creation from text while producing editable, interoperable DEXPI XML and draft Visio diagrams.

Main Contribution

ACPID copilot: an agentic, multi-step Plan-and-Execute system that converts natural language to a DSL and then to DEXPI Proteus XML.

Deterministic rule-based translator from the DSL to DEXPI XML plus a Visual Diagram Generator that renders draft Visio (.vsdx) outputs.

Key Findings

ACPID achieves much higher soundness than single-pass GPT-4-Turbo.

NumbersACPID 96.96% vs Zero-shot 58.33% and Few-shot 65.90%

Practical UseUse an agentic Plan-and-Execute workflow to reliably include requested elements in generated DEXPI XML instead of relying on single-shot prompts.

Evidence RefTable 1, Soundness evaluation on ~132 artifacts

ACPID produces substantially more syntactically complete DEXPI XML.

NumbersACPID 92.97% completeness vs Few-shot 68.28% and Zero-shot 0%

Practical UseDeterministic DSL→DEXPI translation plus rule checks yields interoperable XML ready for downstream CAE tools.

Evidence RefTable 2, Completeness evaluation on 555 DEXPI sections

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Soundness (proportion of prompted elements present)96.96%Zero-shot 58.33%; Few-shot 65.90%+31-38 pp vs baselinesTest bench: ~132 artifacts from DEXPI examplesTable 1; Soundness evaluated on elements, connections, attributesTable 1; Evaluations section
Completeness (DEXPI XML syntactic completeness)92.97%Zero-shot 0%; Few-shot 68.28%+24.7 pp vs best baseline555 DEXPI sections from DEXPI ExampleTable 2; Completeness checks for classes, IDs, positions, shape catalogueTable 2; Evaluations section

What To Try In 7 Days

Run ACPID on a couple of small subsystem descriptions to compare time-to-draft vs manual creation.

Convert existing simple P&ID text descriptions into DEXPI XML and open results in Visio for quick edits.

Use the DSL+rule translator idea to add deterministic checks to your diagram export pipeline.

Agent Features

Memory
Short-term context via appending prior executed steps
Planning
LLM-generated execution plans (plan step list)
Tool Use
Deterministic DSL→DEXPI translationMicrosoft Visio C# API for rendering
Frameworks
PwR (Programming with Representation)
Is Agentic

Yes

Architectures
Plan-and-Execute agents
Collaboration
Human-in-the-loop editing and validation

Optimization Features

Token Efficiency
Agent edits reduce need to resend whole XML as context
Infra Optimization
Not addressed; paper notes higher inference time as trade-off
System Optimization
Rule-based deterministic translation to reduce LLM variability
Inference Optimization
Partial token savings by editing XML directly instead of sending full XML context

Reproducibility

Code AvailableNo
Data AvailableYes
Open Source StatusPartial
LicenseUnknown

Risks & Boundaries

Limitations

Evaluation limited to small public DEXPI examples; proprietary plant data not tested.

Rules-driven translation requires careful prompt design and can be rigid.

When Not To Use

When you need end-to-end full-plant diagrams in one shot without iterative steps.

When low-latency, real-time diagram generation is required.

Failure Modes

Missing or mis-linked connections between elements due to LLM planning errors.

Incorrect or incomplete XML attributes despite element presence.

Core Entities

Models

GPT-4-Turbo

Metrics

soundnesscompleteness

Datasets

DEXPI example P&IDs (DEXPI Consortium examples)

Context Entities

Datasets

DEXPI P&ID Specification 1.3