An agent that reconstructs hidden GraphRAG knowledge graphs with few queries

Overview

Decision SnapshotNeeds Validation

The method is well-specified and evaluated on multiple datasets and two GraphRAG systems, but relies on eliciting structured responses and specific LLM/backbone behavior; defenses and deployment differences will affect real-world impact.

Citations0

Evidence Strength0.80

Confidence0.80

Risk Signals9

Trust Signals

Findings with numeric evidence: 5/5

Findings with evidence refs: 5/5

Results with explicit delta: 4/4

Reproducibility

Status: Partial assets available

Open source: Partial

At A Glance

Cost impact: 60%

Production readiness: 40%

Novelty: 60%

Authors

Shuhua Yang, Jiahao Zhang, Yilong Wang, Dongwon Lee, Suhang Wang

Links

Abstract / PDF / Data

Why It Matters For Business

Graph-structured retrieval can leak reusable entity-relation graphs with surprisingly few queries; operators should treat structured retrieval as a privacy risk and add monitoring, response filtering, or query limits.

Who Should Care

CTO Product Manager ML Engineer Engineering Lead Data Scientist

Summary TLDR

This paper shows an attacker can reconstruct large parts of a GraphRAG system's hidden knowledge graph using a small number of queries. The authors introduce AGEA: an agentic loop that alternates novelty-driven exploration and targeted exploitation, keeps a graph memory, and filters LLM-extracted entities/edges before committing them. On two GraphRAG systems and several domains, AGEA recovers much more graph structure per query than prior attacks (e.g., up to ≈90% node/edge recovery on medium graphs with 1,000 queries), while keeping high precision. The attack relies on eliciting structured outputs and LLM-based filtering, and it weakens as graphs grow much larger or when victims restrict or

Problem Statement

Can a black-box attacker, limited to a fixed number of queries, reconstruct the internal entity–relation graph used by GraphRAG systems? The difficulty is noisy, mixed-format responses, no direct graph access, and a strict query budget that forces a trade-off between exploring new areas and exploiting known hubs.

Main Contribution

Formalize budgeted, black-box graph-level extraction attacks against GraphRAG systems.

Propose AGEA: an agentic, novelty-guided explore/exploit attacker with graph memory and a two-stage (regex discovery + LLM filtering) extraction pipeline.

Key Findings

AGEA recovers a very large fraction of nodes and edges under 1,000 queries on medium graphs.

NumbersM-GraphRAG Medical: nodes 87.09%, edges 80.16% at T=1000

Practical UseRed-team GraphRAG services: a modest query budget can expose most of a medium-sized private KG unless defenses are applied.

Evidence RefTable 1 (Main results)

On LightRAG AGEA achieves even higher coverage and precision.

NumbersLightRAG Medical: nodes 96.42%, edges 95.90%, precision ≈98% (T=1000)

Practical UseDifferent GraphRAG constructions change vulnerability; evaluate each deployment separately rather than assuming uniform safety.

Evidence RefTable 1 (Main results)

Results

Metric	Value	Baseline	Delta	Split / Dataset	Evidence	Evidence Ref
Leak(N)	87.09%	AGEA vs baselines (M-GraphRAG, Medical)	best	M-GraphRAG Medical (T=1000)	Table 1 reports AGEA node leakage 87.09% at 1000 queries	Table 1
Leak(E)	80.16%	AGEA vs baselines (M-GraphRAG, Medical)	best	M-GraphRAG Medical (T=1000)	Table 1 reports AGEA edge leakage 80.16% at 1000 queries	Table 1

What To Try In 7 Days

Run a red-team extraction using a novelty-driven agent to measure your system's structured leakage under realistic query budgets.

Enable structured-output controls: block or sanitize machine-readable entity/relation lists in LLM responses.

Add retrieval-time checks or rate limits on repeated hub-focused queries and log novelty-like metrics to detect agentic probing.

Agent Features

Memory

Graph memory (filtered and raw)Query memory (recent queries/responses)

Planning

LoRADegree-based hub selection for exploitation

Tool Use

LLM as query generatorLLM as graph filter agentRegex parser for fast discovery

Frameworks

Closed-loop agent that alternates query generation and filtering

Is Agentic

Yes

Architectures

LLM-based query generatorTwo-stage extraction pipeline (discovery + LLM filter)External graph memory modules

Optimization Features

Token Efficiency

Regex discovery to avoid extra LLM calls

Reproducibility

Code AvailableNo

Data AvailableYes

Open Source StatusPartial

LicenseUnknown

Data URLs

https://github.com/GraphRAG-Bench/GraphRAG-Benchmark https://github.com/JayLZhou/GraphRAG

Risks & Boundaries

Limitations

Relies on the victim producing machine-structured outputs; output-restriction policies can blunt the attack.

Does not model active deployment defenses like query rewriting, monitoring, or rate-limiting.

When Not To Use

If the target system enforces strict output formatting or forbids structured extraction commands.

When deployment includes effective traversal-aware monitoring or strict rate limits.

Failure Modes

Hallucinated hubs: LLM filter may miss widespread spurious connections if prompts are too lenient.

Backbone sensitivity: different LLMs produce large precision differences for relations.

Core Entities

Models

GPT-4o-miniDeepSeek-V3.1text-embedding-3-large

Metrics

Leakage Rate (nodes)Leakage Rate (edges)Precision (nodes)Precision (edges)Degree-weighted leakagePageRank-weighted leakage

Datasets

Medical (NCCN guidelines)Agriculture (Reclaiming Our Food)Novel (20 Project Gutenberg books)Novel 9 (subgraph)Novel 13 (subgraph)

Benchmarks

GraphRAG-Bench

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

AGEA recovers a very large fraction of nodes and edges under 1,000 queries on medium graphs.

On LightRAG AGEA achieves even higher coverage and precision.

Results

What To Try In 7 Days

Agent Features

Optimization Features

Reproducibility

Data URLs

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Datasets

Benchmarks

You May Also Want to Read

Chemistry foundation models power structure-focused multimodal RAG inside hierarchical multi-agent workflows

Key finding

Create, customize, and run multi-step LLM agents from plain language — no code needed

Key finding

COMPASS: a multi-agent orchestration that uses RAG and an LLM-as-judge to enforce sovereignty, carbon-awareness, compliance, and ethics in实时

Key finding

AgentAuditor: memory‑augmented RAG + CoT that makes LLM evaluators reach human-level accuracy on agent safety

Key finding

Use multi-agent RAG plus a hybrid vector-graph memory to auto-generate traceable test plans and cases, cutting test-document work by ~85% in

Key finding