Add intent-aware JWTs and a client shim to stop agents from misusing shared OAuth tokens

September 16, 20258 min

Overview

Decision SnapshotNeeds Validation

Design is practical and backward-compatible. Prototype results are promising, but quantitative evaluation is limited and a full performance/security analysis is pending.

Citations0

Evidence Strength0.50

Confidence0.80

Risk Signals12

Trust Signals

Findings with numeric evidence: 2/5

Findings with evidence refs: 5/5

Results with explicit delta: 2/2

Reproducibility

Status: No open assets linked

Open source: Unknown

At A Glance

Cost impact: 60%

Production readiness: 60%

Novelty: 70%

Authors

Abhishek Goswami

Links

Abstract / PDF

Why It Matters For Business

If your product runs LLM-driven multi-agent workflows, existing bearer tokens let an attacker or a compromised agent act with broad privileges. A-JWT adds per-agent intent and proof-of-possession to reduce blast radius and give cryptographic audit trails.

Who Should Care

Summary TLDR

The paper proposes Agentic JWT (A-JWT): an extension to OAuth/JWT that issues per-agent "intent tokens" plus delegation assertions and per-agent proof-of-possession keys. A client-side shim computes a checksum identity for each agent (based on prompt, tools, config), derives short-lived keys, and asks an IDP for intent-bound tokens. A prototype blocks the modeled attacks and adds sub-millisecond verification cost, enabling stronger zero-trust guarantees for multi-agent apps while remaining backwards-compatible with OAuth.

Problem Statement

Standard OAuth/JWT treats a client process as a single principal. LLM-driven multi-agent clients break that assumption: one process can host many autonomous agents whose runtime behavior (prompts, tool calls) may diverge from the original user intent. That enables replay, impersonation, prompt-injection, privilege escalation and other attacks which existing bearer tokens cannot cryptographically distinguish.

Main Contribution

A token design (A-JWT) that adds an intent token and delegation assertion to bind each API call to a user intent and workflow step.

A client shim that computes per-agent checksums, derives per-agent PoP keys, tracks workflow state, and mints intent tokens at runtime.

Key Findings

A-JWT prototype prevented the modeled attacks in the micro-service experiment.

Numbers100% of reproduced threat requests blocked

Practical UseIf you mint and verify intent tokens as proposed, the prototype shows you can deny replay, impersonation, prompt-injection and cross-agent privilege requests in similar setups.

Evidence RefVI.A; Contributions section

Intent and shim checks add negligible runtime cost in the prototype.

Numberssub-millisecond verification; optional TEE attestation <2 ms

Practical UseYou can adopt per-agent verification without large latency hits for typical API calls on commodity hardware, but watch token-minting frequency for overall throughput.

Evidence RefAbstract; III.D Integrity tiers; VI

Results

MetricValueBaselineDeltaSplit / DatasetEvidenceEvidence Ref
Threat mitigation in prototype100% of reproduced threat requests were blockedtraditional OAuth 2.0 bearer tokens (vulnerable)full mitigation in experimentmulti-agent vulnerability patcher micro-service (VI.A)Prototype reproduced STRIDE threats and denied token minting or access in After phaseVI.A; Contributions
Verification overheadsub-millisecond per request; optional attestation <2 msno intent verificationsmall added latencyprototype on commodity hardwareAuthor reports sub-millisecond overhead and <2 ms for TEE attestationAbstract; III.D Integrity tiers

What To Try In 7 Days

Run a threat inventory of agentic flows and APIs to identify critical chokepoints.

Prototype a shim that computes a simple agent checksum and sends it as a header for sensitive API calls.

Add PoP-key verification for a high-risk API and measure token-minting latency and operational cost.

Agent Features

Memory
Workflow state tracking in shim (short-term workflow context)
Planning
LLM-driven plan generation (chain-of-thought style)
Tool Use
Per-agent tool lists recorded in agent checksumAgent-specific API calls via shim
Frameworks
Agent registration + workflow registration with IDP
Is Agentic

Yes

Architectures
Orchestrator + delegate multi-agent app
Collaboration
Delegation chains encoded in tokens

Reproducibility

Code AvailableNo
Data AvailableNo
Open Source StatusUnknown
LicenseUnknown

Risks & Boundaries

Limitations

Workflow and agent registration create operational overhead and governance complexity.

Prototype is Python-specific; other languages need separate shim implementations.

When Not To Use

Single deterministic clients where existing JWT scopes are sufficient.

Low-governance environments that cannot support per-agent registration and CI/CD integration.

Failure Modes

Shim library compromise or supply-chain replacement could bypass checks if deployment integrity is broken.

Poorly defined prompt template hashing may allow prompt-injection to evade checks (false negatives).

Core Entities

Models

Large language models (LLMs) - generic

Metrics

latency (sub-millisecond verification)attack-blocking rate (100% in prototype)

Context Entities

Metrics

token minting latency (qualitative; may increase frequency)Shim integrity check (X-Shim-Checksum)