Agentic copilot that converts natural language into P&ID DEXPI XML and Visio drawings

Overview

Decision SnapshotNeeds Validation

The system clearly improves XML recall and syntactic completeness on DEXPI examples, but evaluation is small and lacks industrial datasets, so further validation is required before large-scale deployment.

Citations2

Evidence Strength0.55

Confidence0.85

Risk Signals12

Trust Signals

Findings with numeric evidence: 3/3

Findings with evidence refs: 3/3

Results with explicit delta: 2/4

Reproducibility

Status: Partial assets available

Open source: Partial

At A Glance

Cost impact: 60%

Production readiness: 40%

Novelty: 60%

Authors

Shreeyash Gowaikar, Srinivasan Iyengar, Sameer Segal, Shivkumar Kalyanaraman

Links

Abstract / PDF / Data

Why It Matters For Business

Automating P&ID creation cuts manual drafting time and improves auditability by producing interoperable DEXPI XML and editable Visio drafts.

Who Should Care

Engineering Lead ML Engineer Product Manager Founder

Summary TLDR

The authors build ACPID, an "agentic" copilot that turns plain-English descriptions of piping systems into machine-readable DEXPI XML and draft Visio diagrams. The system uses a Plan-and-Execute LLM workflow to emit a compact DSL, deterministically translates that DSL to DEXPI Proteus XML, and renders visuals via the Visio API. On a small DEXPI-based test bench the copilot achieves much higher element recall (soundness 96.96%) and syntactic completeness (92.97%) than zero-shot or few-shot GPT-4-Turbo. Limits: small public test set, careful prompt design needed, and higher inference time versus single-shot methods.

Problem Statement

Creating P&ID diagrams is manual, slow, error-prone, and hard to audit. Prior ML work digitizes existing diagrams but does not generate interoperable P&ID machine formats directly from natural-language requests. The paper aims to automate subsystem-level P&ID creation from text while producing editable, interoperable DEXPI XML and draft Visio diagrams.

Main Contribution

ACPID copilot: an agentic, multi-step Plan-and-Execute system that converts natural language to a DSL and then to DEXPI Proteus XML.

Deterministic rule-based translator from the DSL to DEXPI XML plus a Visual Diagram Generator that renders draft Visio (.vsdx) outputs.

Key Findings

ACPID achieves much higher soundness than single-pass GPT-4-Turbo.

NumbersACPID 96.96% vs Zero-shot 58.33% and Few-shot 65.90%

Practical UseUse an agentic Plan-and-Execute workflow to reliably include requested elements in generated DEXPI XML instead of relying on single-shot prompts.

Evidence RefTable 1, Soundness evaluation on ~132 artifacts

ACPID produces substantially more syntactically complete DEXPI XML.

NumbersACPID 92.97% completeness vs Few-shot 68.28% and Zero-shot 0%

Practical UseDeterministic DSL→DEXPI translation plus rule checks yields interoperable XML ready for downstream CAE tools.

Evidence RefTable 2, Completeness evaluation on 555 DEXPI sections

Results

Metric	Value	Baseline	Delta	Split / Dataset	Evidence	Evidence Ref
Soundness (proportion of prompted elements present)	96.96%	Zero-shot 58.33%; Few-shot 65.90%	+31-38 pp vs baselines	Test bench: ~132 artifacts from DEXPI examples	Table 1; Soundness evaluated on elements, connections, attributes	Table 1; Evaluations section
Completeness (DEXPI XML syntactic completeness)	92.97%	Zero-shot 0%; Few-shot 68.28%	+24.7 pp vs best baseline	555 DEXPI sections from DEXPI Example	Table 2; Completeness checks for classes, IDs, positions, shape catalogue	Table 2; Evaluations section

What To Try In 7 Days

Run ACPID on a couple of small subsystem descriptions to compare time-to-draft vs manual creation.

Convert existing simple P&ID text descriptions into DEXPI XML and open results in Visio for quick edits.

Use the DSL+rule translator idea to add deterministic checks to your diagram export pipeline.

Agent Features

Memory

Short-term context via appending prior executed steps

Planning

LLM-generated execution plans (plan step list)

Tool Use

Deterministic DSL→DEXPI translationMicrosoft Visio C# API for rendering

Frameworks

PwR (Programming with Representation)

Is Agentic

Yes

Architectures

Plan-and-Execute agents

Collaboration

Human-in-the-loop editing and validation

Optimization Features

Token Efficiency

Agent edits reduce need to resend whole XML as context

Infra Optimization

Not addressed; paper notes higher inference time as trade-off

System Optimization

Rule-based deterministic translation to reduce LLM variability

Inference Optimization

Partial token savings by editing XML directly instead of sending full XML context

Reproducibility

Code AvailableNo

Data AvailableYes

Open Source StatusPartial

LicenseUnknown

Data URLs

https://gitlab.com/dexpi/TrainingTestCases//tree/master/dexpi%201.3?ref type=heads

Risks & Boundaries

Limitations

Evaluation limited to small public DEXPI examples; proprietary plant data not tested.

Rules-driven translation requires careful prompt design and can be rigid.

When Not To Use

When you need end-to-end full-plant diagrams in one shot without iterative steps.

When low-latency, real-time diagram generation is required.

Failure Modes

Missing or mis-linked connections between elements due to LLM planning errors.

Incorrect or incomplete XML attributes despite element presence.

Core Entities

Models

GPT-4-Turbo

Metrics

soundnesscompleteness

Datasets

DEXPI example P&IDs (DEXPI Consortium examples)

Context Entities

Datasets

DEXPI P&ID Specification 1.3

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

ACPID achieves much higher soundness than single-pass GPT-4-Turbo.

ACPID produces substantially more syntactically complete DEXPI XML.

Results

What To Try In 7 Days

Agent Features

Optimization Features

Reproducibility

Data URLs

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Datasets

Context Entities

Datasets

You May Also Want to Read

Survey: Reframe LLMs as agents that plan, act, and continually learn

Key finding

Reference architecture, multi-agent taxonomy, and enterprise hardening for LLM agents

Key finding

Systematizes reusable 'agentic skills' for LLM agents, their lifecycle, design patterns, risks, and evaluation

Key finding

A closed-loop Sensing→Regulating→Correcting system that routes LLM execution by uncertainty to cut errors and API cost

Key finding

Diffusion-backed agents match accuracy but run ~30% faster and can reach up to 8× speedups in some cases

Key finding