Design agents around files + code to make them more composable, auditable, and maintainable

January 16, 20266 min

Overview

Decision SnapshotNeeds Validation

This is a conceptual, evidence-backed position: ideas are practical now, but the paper presents examples rather than large-scale evaluations.

Citations0

Evidence Strength0.50

Confidence0.70

Risk Signals11

Trust Signals

Findings with numeric evidence: 2/5

Findings with evidence refs: 5/5

Results with explicit delta: 0/0

Reproducibility

Status: No open assets linked

Open source: Unknown

At A Glance

Cost impact: 60%

Production readiness: 60%

Novelty: 50%

Authors

Deepak Babu Piskala

Links

Abstract / PDF

Why It Matters For Business

Treating resources as files and actions as code reduces integration work, makes agent behavior auditable, and lets teams reuse existing DevOps practices to manage agent artifacts.

Who Should Care

Summary TLDR

This short position paper argues that a decades-old Unix idea—treat diverse resources as files—reappears as a practical design principle for autonomous AI agents. The authors trace the idea from Unix to DevOps ("everything as code") and to modern agents that use file-like memory, code as the action language, and file-based context retrieval. For practitioners the paper recommends: expose resources via a file-like namespace, let agents generate code (with sandboxing) instead of bespoke tool integrations, and use DevOps tooling for versioning and auditing. The paper is conceptual and cites industry examples (Anthropic, Cursor) and frameworks (AIGNE).

Problem Statement

Modern agents must talk to many different systems—APIs, databases, vector stores, cloud consoles, UIs—each with different interfaces. That interface proliferation makes agents hard to build, maintain, and audit. The paper argues that collapsing these interfaces into a uniform, file-like abstraction plus code execution reduces complexity and improves composability and traceability.

Main Contribution

Historical analysis connecting Unix's 'everything is a file' to DevOps and current agent design.

Argument and evidence that file-like namespaces and code-as-action simplify agent architectures.

Key Findings

Practitioners are converging on filesystem and code abstractions for agent context and actions.

Practical UseDesign agents to read/write a shared namespace of files rather than building many bespoke tool interfaces.

Evidence Refsec 4-5; Jerry Liu blog and AIGNE [5][6]

Code as an action language reduces the need for many custom tool integrations.

Numbers~510 tools vs 100+ MCP tools (reported example)

Practical UseLet the agent generate and execute code (SQL, Python, curl) and focus engineering effort on safe execution and sandboxing.

Evidence Refsec 5; Jerry Liu quote [5]

What To Try In 7 Days

Expose one external resource (API or DB) through a small file-backed adapter and let an agent read/write it.

Store conversation history and agent plans as versioned files in git to enable auditing.

Prototype one capability by having the agent generate code (SQL or Python) and run it inside a sandboxed runner.

Agent Features

Memory
file-backed conversation historyshared file namespace for memorysearchable files for context retrieval
Planning
plans persisted to filesspawn subagents that write/read shared files
Tool Use
code generation instead of bespoke tool APIsCLI over filesystemcode interpreterweb fetch
Frameworks
AIGNEAnthropic multi-agent systemLangChainLlamaIndex practices
Is Agentic

Yes

Architectures
file-system abstractioncode-as-actionlead agent + subagents orchestration
Collaboration
shared memory for agent coordinationparallel subagent search with file returns

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusUnknown
LicenseUnknown

Risks & Boundaries

Limitations

Non-text formats need parsing before use with file reads.

Very large document collections require indexing beyond simple file search.

When Not To Use

Low-latency device control where spool-and-read is too slow.

Highly specialized binary protocols that don't map to textual files.

Failure Modes

Agent writes unsafe or insecure code that is then executed.

Stale or inconsistent files cause agents to act on outdated context.

Core Entities

Models

Claude CodeCursor