Survey: using graph structure to make RAG more precise, concise, and context-aware

Overview

Decision SnapshotNeeds Validation

GraphRAG is a rapidly maturing approach with practical demos in industry; expect moderate engineering cost for graph construction and scalable retrieval, and clear gains in tasks where relations matter.

Citations22

Evidence Strength0.60

Confidence0.80

Risk Signals9

Trust Signals

Findings with numeric evidence: 6/6

Findings with evidence refs: 6/6

Results with explicit delta: 0/0

Reproducibility

Status: Partial assets available

Open source: Partial

At A Glance

Cost impact: 60%

Production readiness: 40%

Novelty: 60%

Authors

Boci Peng, Yun Zhu, Yongchao Liu, Xiaohe Bo, Haizhou Shi, Chuntao Hong, Yan Zhang, Siliang Tang

Links

Abstract / PDF / Code

Why It Matters For Business

GraphRAG injects relational facts into LLM outputs, reducing hallucination and shortening input prompts; this improves accuracy for QA, search, and domain workflows while leveraging existing graph databases.

Who Should Care

CTO ML Engineer Product Manager Data Scientist Founder

Summary TLDR

This paper is the first systematic survey of Graph Retrieval-Augmented Generation (GraphRAG). GraphRAG extends text-based RAG by indexing and retrieving graph elements (nodes, triples, paths, subgraphs) and converting them into formats LMs can consume. The authors organize research into three stages—G-Indexing, G-Retrieval, G-Generation—cover core methods (graph/text/vector/hybrid indexing; non-parametric/LM/GNN retrievers; graph languages and embeddings; hybrid GNN+LM generators), benchmarks, industry systems, and open challenges like scalable retrieval, dynamic graphs, multimodality, and context compression. The repo link and many literature pointers are provided for quick follow-up.

Problem Statement

Text-only RAG misses structured relations, produces redundant long context, and struggles to capture global relational context. GraphRAG aims to retrieve structured graph elements to supply relational knowledge, reduce verbosity, and improvefaithful, context-aware generation.

Main Contribution

First comprehensive survey of GraphRAG methods and applications.

Formalizes GraphRAG pipeline into three stages: G-Indexing, G-Retrieval, G-Generation.

Key Findings

GraphRAG workflow decomposes into three repeatable stages: Graph-Based Indexing, Graph-Guided Retrieval, and Graph-Enhanced Generation.

Numbers3 stages

Practical UseWhen building GraphRAG, split work into indexing, retrieval, and generation modules and optimize each independently.

Evidence RefSections 4, 5-7

Retrieval granularity strongly affects trade-offs: nodes/triples are fast but narrow; paths/subgraphs capture richer context but explode combinatorially.

Numbersgranularities: nodes, triplets, paths, subgraphs

Practical UseStart with node/triplet retrieval for latency-critical apps; use path/subgraph retrieval for multi-hop reasoning and accept higher compute cost.

Evidence RefSection 6.3

What To Try In 7 Days

Prototype a small text-attributed graph from internal docs and index it in Neo4j.

Implement a two-stage retriever: fast non-parametric seed (BFS/PCST) + LM or cross-encoder reranker.

Feed concise graph language summaries (edge table or node sequences) to an LLM prompt and compare accuracy vs. text-only RAG.

Agent Features

Memory

Graph database as long-term structured memoryCommunity summaries as compressed global context

Planning

LLM-generated reasoning pathsAgentic hop prediction to stop retrieval

Tool Use

Graph DB traversal (BFS/DFS)SPARQL/Cypher queriesLLM function-calling for retrieval

Frameworks

LlamaIndexLangChainNeo4j + NaLLMNebulaGraph GraphRAG

Architectures

GNN + LM (hybrid)GNN cascaded into LMParallel GNN + LM fusion

Collaboration

GNN encoders working with LMsMulti-stage retriever + reranker pipelines

Optimization Features

Token Efficiency

Summaries/community reports to shorten promptsGraph languages (adjacency/edge tables) to compress structure

Infra Optimization

Use vector indices (LSH) for fast nearest-neighbor lookupLeverage graph DB traversal for structural queries

Model Optimization

Prompt/prefix tuning for LMsLightweight GNN variants

System Optimization

Hybrid indexing (graph + vector + text) for latency/recall trade-offs

Training Optimization

Distant supervision for retriever pathsContrastive pretraining for passage/subgraph embeddings

Inference Optimization

Multi-stage retrieval to reduce LM callsConstrained decoding for valid KB queries

Reproducibility

Code AvailableYes

Data AvailableNo

Open Source StatusPartial

LicenseUnknown

Code URLs

https://github.com/pengboci/GraphRAG-Survey https://github.com/microsoft/graphrag https://github.com/eosphoros-ai/DB-GPT https://github.com/neo4j/NaLLM

Risks & Boundaries

Limitations

Most methods tested on small graphs; large-scale industrial graphs remain challenging.

Converting graphs into LM-input formats can produce long contexts that LMs mishandle.

When Not To Use

When your problem is pure text retrieval with no relational reasoning needs.

When you cannot afford graph construction and maintenance costs.

Failure Modes

Over-retrieval: returning huge subgraphs that exceed token limits and confuse the LM.

Poor retrieval precision when entity linking is noisy, causing wrong evidence to drive answers.

Core Entities

Models

GPT-4GPT-3LLaMALLaMA2Qwen2RoBERTaBERTSentenceBERTGCNGATGraphSAGEGraph TransformerGreaseLMENGINEG-RetrieverGNN-RAGGRAGKG-GPT

Metrics

Exact Match (EM)F1AccuracyRecallMRRHits@KBERTScoreGPT4ScoreBLEUROUGE-LNDCG@K

Datasets

GRBENCHGraphQAWebQSPWebQuestionsCWQGrailQACSQAConceptNetWikidataFreebaseDBpediaCMeKGCPubMed-KGHotpotQASTaRKCRAG

Benchmarks

GRBENCHSTaRKCRAG

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

GraphRAG workflow decomposes into three repeatable stages: Graph-Based Indexing, Graph-Guided Retrieval, and Graph-Enhanced Generation.

Retrieval granularity strongly affects trade-offs: nodes/triples are fast but narrow; paths/subgraphs capture richer context but explode combinatorially.

What To Try In 7 Days

Agent Features

Optimization Features

Reproducibility

Code URLs

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Datasets

Benchmarks

You May Also Want to Read

Turn an LLM output into a mini knowledge graph, check each fact with an NLI model, and get explainable hallucination flags

Key finding

Combine LLMs with a medical knowledge graph to get more accurate, verifiable scientific answers

Key finding

Use a personal causal graph so an LLM recommends foods that better lower your post-meal glucose

Key finding

A practical survey showing how knowledge graphs can make LLMs better at complex question answering

Key finding

MindMap: prompt LLMs with knowledge-graph evidence to produce explicit graph-style reasoning and reduce hallucination

Key finding