Use an LLM-based agent with memory, world knowledge, and a graph tool to improve zero-shot next-location prediction

Overview

Decision SnapshotNeeds Validation

The idea is practical and improves zero-shot metrics, but high LLM API cost, hallucination risk, and limited large-scale tests reduce production readiness.

Citations1

Evidence Strength0.65

Confidence0.87

Risk Signals11

Trust Signals

Findings with numeric evidence: 3/3

Findings with evidence refs: 3/3

Results with explicit delta: 3/3

Reproducibility

Status: Code + data available

Open source: Partial

At A Glance

Cost impact: 30%

Production readiness: 50%

Novelty: 70%

Authors

Jie Feng, Yuwei Du, Jie Zhao, Yong Li

Links

Abstract / PDF / Code / Data

Why It Matters For Business

AgentMove improves zero-shot location ranking across cities without retraining a local model, so companies can prototype personalized recommendations or prefetching where labeled local mobility data is scarce.

Who Should Care

ML Engineer Product Manager Data Scientist CTO Engineering Lead

Summary TLDR

AgentMove is an LLM-based agent pipeline for next-location prediction. It splits the task into three modules: a spatial-temporal memory that stores short- and long-term user behavior, a world-knowledge generator that asks an LLM to propose multi-scale urban candidates using text addresses (via OpenStreetMap), and a collective extractor that builds a global location-transition graph (NetworkX) to leverage other users' patterns. Combined results are post-processed by an LLM to output ranked location candidates. On two datasets (Foursquare + ISP) across 12 cities, AgentMove beats prior baselines on 8/12 metrics with reported improvements of about 3.3%–8.6% in many cases, shows more stable cross

Problem Statement

Deep models need lots of local mobility data and do not generalize zero-shot. LLMs can reason zero-shot but prior LLM approaches produce final outputs directly and miss domain structure: individual memory, urban structure, and population-level patterns. The paper asks: can a structured LLM agent that adds memory, explicit world knowledge, and a graph-based social view improve zero-shot next-location prediction?

Main Contribution

AgentMove: an LLM-based agentic pipeline that decomposes next-location prediction into three subproblems and recombines their outputs.

Spatial-temporal memory module with short-term, long-term memory and a dynamic user profile to capture individual habits.

Key Findings

AgentMove wins most metrics versus baselines in zero-shot tests

NumbersBest results in 8 of 12 metrics; improvements range 3.33%–8.57%

Practical UseUse the agent pipeline to improve zero-shot next-location ranking in place-recommendation and mobility tasks without retraining local models.

Evidence RefAbstract, Table 1

Combining modules yields the largest gains

Numbers+7% to +45% vs base prompt in select metrics

Practical UseImplement memory + world knowledge + collective graph together rather than only one module for best zero-shot accuracy.

Evidence RefTable 2 (ablation)

Results

Metric	Value	Baseline	Delta	Split / Dataset	Evidence	Evidence Ref
Acc@1 (FSQ@Tokyo)	AgentMove 0.185	GETNext 0.205	-9.76%	FSQ@Tokyo	Table 1 shows per-city metrics	Table 1
Acc@5 (FSQ@Tokyo)	AgentMove 0.465	GETNext 0.450	+3.33%	FSQ@Tokyo	Table 1 row 'AgentMove'	Table 1

What To Try In 7 Days

Build a lightweight memory store (short + long) for user sessions and generate simple profile prompts for an LLM.

Use OpenStreetMap reverse-geocoding to attach text addresses to POIs and prompt an LLM for multi-scale candidate POIs.

Construct a small location-transition graph (NetworkX) from historical logs and query k-hop neighbors as extra candidates.

Agent Features

Memory

short-term memorylong-term memoryuser profile summaries

Planning

manual task decomposition into 3 subtasksfinal reasoning step that merges outputs

Tool Use

NetworkX (graph queries)OpenStreetMap (address lookup)LLM APIs (GPT4o-mini, Llama variants)

Frameworks

agent pipeline (planning, memory, tools, reasoning)

Is Agentic

Yes

Architectures

LLM-based agent pipelinemodular decomposition (memory / world / collective / reasoning)

Collaboration

aggregated population graph for shared patterns

Reproducibility

Code AvailableYes

Data AvailableYes

Open Source StatusPartial

LicenseUnknown

Code URLs

https://github.com/tsinghua-fib-lab/AgentMove

Data URLs

Foursquare (public LBSN dataset, Yang et al. 2016)https://github.com/vonfeng/DPLink/tree/master/data (ISP GPS / DPLink reference)

Risks & Boundaries

Limitations

LLM hallucination: generated addresses/POIs can be false and need verification.

High inference cost: LLM API calls make large-scale deployment expensive.

When Not To Use

Real-time, high-throughput systems where LLM latency and cost are prohibitive.

Applications that require provable location accuracy without external validation.

Failure Modes

Hallucinated candidate POIs leading to invalid predictions.

Sparse or mismatched address lookups causing poor world-alignment.

Core Entities

Models

AgentMoveLLM-MoveLLM-ZSLLM-MobGETNextSTHGCNDeepMove

Metrics

AccuracyNDCG@5

Datasets

Foursquare (Yang et al. 2016)ISP GPS trajectory (Shanghai) (Feng et al. 2019)

Benchmarks

Zero-shot next-location prediction on 12-city split (Foursquare) and ISP Shanghai

Overview

Trust Signals

Reproducibility

At A Glance

Authors

Links

Why It Matters For Business

Who Should Care

Summary TLDR

Problem Statement

Main Contribution

Key Findings

AgentMove wins most metrics versus baselines in zero-shot tests

Combining modules yields the largest gains

Results

What To Try In 7 Days

Agent Features

Reproducibility

Code URLs

Data URLs

Risks & Boundaries

Limitations

When Not To Use

Failure Modes

Core Entities

Models

Metrics

Datasets

Benchmarks

You May Also Want to Read

Survey: Reframe LLMs as agents that plan, act, and continually learn

Key finding

Reference architecture, multi-agent taxonomy, and enterprise hardening for LLM agents

Key finding

Systematizes reusable 'agentic skills' for LLM agents, their lifecycle, design patterns, risks, and evaluation

Key finding

A closed-loop Sensing→Regulating→Correcting system that routes LLM execution by uncertainty to cut errors and API cost

Key finding

Diffusion-backed agents match accuracy but run ~30% faster and can reach up to 8× speedups in some cases

Key finding