Design agents around files + code to make them more composable, auditable, and maintainable

Overview

Decision SnapshotNeeds Validation

This is a conceptual, evidence-backed position: ideas are practical now, but the paper presents examples rather than large-scale evaluations.

Citations0

Evidence Strength0.50

Confidence0.70

Risk Signals11

Trust Signals

Findings with numeric evidence: 2/5

Findings with evidence refs: 5/5

Results with explicit delta: 0/0

Reproducibility

Status: No open assets linked

Open source: Unknown

At A Glance

Cost impact: 60%

Production readiness: 60%

Novelty: 50%

Authors

Deepak Babu Piskala

Links

Abstract / PDF

Why It Matters For Business

Treating resources as files and actions as code reduces integration work, makes agent behavior auditable, and lets teams reuse existing DevOps practices to manage agent artifacts.

Who Should Care

CTO Product Manager ML Engineer Engineering Lead Founder

Summary TLDR

This short position paper argues that a decades-old Unix idea—treat diverse resources as files—reappears as a practical design principle for autonomous AI agents. The authors trace the idea from Unix to DevOps ("everything as code") and to modern agents that use file-like memory, code as the action language, and file-based context retrieval. For practitioners the paper recommends: expose resources via a file-like namespace, let agents generate code (with sandboxing) instead of bespoke tool integrations, and use DevOps tooling for versioning and auditing. The paper is conceptual and cites industry examples (Anthropic, Cursor) and frameworks (AIGNE).

Problem Statement

Modern agents must talk to many different systems—APIs, databases, vector stores, cloud consoles, UIs—each with different interfaces. That interface proliferation makes agents hard to build, maintain, and audit. The paper argues that collapsing these interfaces into a uniform, file-like abstraction plus code execution reduces complexity and improves composability and traceability.

Main Contribution

Historical analysis connecting Unix's 'everything is a file' to DevOps and current agent design.

Argument and evidence that file-like namespaces and code-as-action simplify agent architectures.

Key Findings

Practitioners are converging on filesystem and code abstractions for agent context and actions.

Practical UseDesign agents to read/write a shared namespace of files rather than building many bespoke tool interfaces.

Evidence Refsec 4-5; Jerry Liu blog and AIGNE [5][6]

Code as an action language reduces the need for many custom tool integrations.

Numbers~5–10 tools vs 100+ MCP tools (reported example)

Practical UseLet the agent generate and execute code (SQL, Python, curl) and focus engineering effort on safe execution and sandboxing.

Evidence Refsec 5; Jerry Liu quote [5]

What To Try In 7 Days

Expose one external resource (API or DB) through a small file-backed adapter and let an agent read/write it.

Store conversation history and agent plans as versioned files in git to enable auditing.

Prototype one capability by having the agent generate code (SQL or Python) and run it inside a sandboxed runner.

Agent Features

Memory

file-backed conversation historyshared file namespace for memorysearchable files for context retrieval

Planning

plans persisted to filesspawn subagents that write/read shared files

Tool Use

code generation instead of bespoke tool APIsCLI over filesystemcode interpreterweb fetch

Frameworks

AIGNEAnthropic multi-agent systemLangChainLlamaIndex practices

Is Agentic

Yes

Architectures

file-system abstractioncode-as-actionlead agent + subagents orchestration

Collaboration

shared memory for agent coordinationparallel subagent search with file returns

Reproducibility

Code AvailableNo

Data AvailableNo

Open Source StatusUnknown

LicenseUnknown

Risks & Boundaries

Limitations

Non-text formats need parsing before use with file reads.

Very large document collections require indexing beyond simple file search.

When Not To Use

Low-latency device control where spool-and-read is too slow.

Highly specialized binary protocols that don't map to textual files.

Failure Modes

Agent writes unsafe or insecure code that is then executed.

Stale or inconsistent files cause agents to act on outdated context.

Core Entities

Models

Claude CodeCursor

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

Practitioners are converging on filesystem and code abstractions for agent context and actions.

Code as an action language reduces the need for many custom tool integrations.

What To Try In 7 Days

Agent Features

Reproducibility

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

You May Also Want to Read

Chemistry foundation models power structure-focused multimodal RAG inside hierarchical multi-agent workflows

Key finding

Argues that 'agentic' buzzwords mostly rebrand decades-old agent and multi-agent research

Key finding

TRiSM: practical trust, risk and security controls for LLM-based multi-agent systems

Key finding

A dynamic town simulation that tests LLM agents on doing tasks while following local cultural norms

Key finding

A process-aware, auditable multi-agent evaluator that produces more stable, human-aligned scores than a single LLM judge

Key finding