Overview
The idea is practical and improves zero-shot metrics, but high LLM API cost, hallucination risk, and limited large-scale tests reduce production readiness.
Citations1
Evidence Strength0.65
Confidence0.87
Risk Signals11
Trust Signals
Findings with numeric evidence: 3/3
Findings with evidence refs: 3/3
Results with explicit delta: 3/3
Reproducibility
Status: Code + data available
Open source: Partial
At A Glance
Cost impact: 30%
Production readiness: 50%
Novelty: 70%
Why It Matters For Business
AgentMove improves zero-shot location ranking across cities without retraining a local model, so companies can prototype personalized recommendations or prefetching where labeled local mobility data is scarce.
Who Should Care
Summary TLDR
AgentMove is an LLM-based agent pipeline for next-location prediction. It splits the task into three modules: a spatial-temporal memory that stores short- and long-term user behavior, a world-knowledge generator that asks an LLM to propose multi-scale urban candidates using text addresses (via OpenStreetMap), and a collective extractor that builds a global location-transition graph (NetworkX) to leverage other users' patterns. Combined results are post-processed by an LLM to output ranked location candidates. On two datasets (Foursquare + ISP) across 12 cities, AgentMove beats prior baselines on 8/12 metrics with reported improvements of about 3.3%–8.6% in many cases, shows more stable cross
Problem Statement
Deep models need lots of local mobility data and do not generalize zero-shot. LLMs can reason zero-shot but prior LLM approaches produce final outputs directly and miss domain structure: individual memory, urban structure, and population-level patterns. The paper asks: can a structured LLM agent that adds memory, explicit world knowledge, and a graph-based social view improve zero-shot next-location prediction?
Main Contribution
AgentMove: an LLM-based agentic pipeline that decomposes next-location prediction into three subproblems and recombines their outputs.
Spatial-temporal memory module with short-term, long-term memory and a dynamic user profile to capture individual habits.
Key Findings
AgentMove wins most metrics versus baselines in zero-shot tests
Combining modules yields the largest gains
Results
| Metric | Value | Baseline | Delta | Split / Dataset | Evidence | Evidence Ref |
|---|---|---|---|---|---|---|
| Acc@1 (FSQ@Tokyo) | AgentMove 0.185 | GETNext 0.205 | -9.76% | FSQ@Tokyo | Table 1 shows per-city metrics | Table 1 |
| Acc@5 (FSQ@Tokyo) | AgentMove 0.465 | GETNext 0.450 | +3.33% | FSQ@Tokyo | Table 1 row 'AgentMove' | Table 1 |
What To Try In 7 Days
Build a lightweight memory store (short + long) for user sessions and generate simple profile prompts for an LLM.
Use OpenStreetMap reverse-geocoding to attach text addresses to POIs and prompt an LLM for multi-scale candidate POIs.
Construct a small location-transition graph (NetworkX) from historical logs and query k-hop neighbors as extra candidates.
Agent Features
Memory
Planning
Tool Use
Frameworks
Is Agentic
Yes
Architectures
Collaboration
Reproducibility
Data URLs
Risks & Boundaries
Limitations
LLM hallucination: generated addresses/POIs can be false and need verification.
High inference cost: LLM API calls make large-scale deployment expensive.
When Not To Use
Real-time, high-throughput systems where LLM latency and cost are prohibitive.
Applications that require provable location accuracy without external validation.
Failure Modes
Hallucinated candidate POIs leading to invalid predictions.
Sparse or mismatched address lookups causing poor world-alignment.

