Data Management and Algorithms Open access

Evaluating Learned Spatial Indexes

Michael Mathioudakis, Jie Yang, Sachith Pai

arXiv (Cornell University) | Jun 17, 2026

Abstract

Learned indexes improve query performance by adapting search structures to data and workload distributions. Although many learned indexes have been proposed, their trade-offs remain insufficiently understood for spatial range queries, where performance depends not only on model accuracy but also on data and query skew, layout granularity, selectivity, and storage behavior. In this work, we perform an experimental study of learned indexes for spatial range queries. We examine a representative set of indexes and address seven fundamental questions: (1) How does block size influence query latency, and what configurations yield optimal performance under varying selectivities? (2) How do skewed data and query distributions impact index performance? (3) How do indexes balance refinement and scan costs, and which designs favor one over the other? (4) How do disk-based storage conditions alter optimal block size and latency trade-offs compared to in-memory settings? (5) What are the construction costs of different indexes, and under what query volumes are these costs amortized? (6) For a given data and query workload, which index is expected to perform best? (7) Do index-selection insights learned from synthetic data generalize to real-world data distributions? To enable the analysis, we use a framework with a common storage backend, standardized query execution pipelines, and controlled variations in data and query skew. Our experiments reveal critical insights into refinement vs. scan trade-offs, the impact of block size, and the interplay between selectivity and layout effectiveness. We synthesize these findings into a workload-based decision tree for index selection and validate it on real OpenStreetMap point sets with synthetic queries, confirming that its recommendations exhibit minimal decision regret and typically yield near-optimal query performance.

Direct answer

What can I do from this paper page?

Use this page to scan "Evaluating Learned Spatial Indexes" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Data Management and Algorithms research, save the paper, or map adjacent work.

Authors

Researchers on this paper

Michael Mathioudakis

last | ORCID 0000-0003-0074-3966

Jie Yang

middle | ORCID 0000-0003-4801-7162

Sachith Pai

first | ORCID 0009-0004-5128-7424

Research areas

Follow related topics

Advanced Database Systems and Queries Latest Geographic Information Systems Studies research Latest Data Management and Algorithms research

Citation

BibTeX

@article{Mathioudakis2026Evaluating,
  title = {Evaluating Learned Spatial Indexes},
  author = {Michael Mathioudakis and Jie Yang and Sachith Pai},
  journal = {arXiv (Cornell University)},
  year = {2026},
  doi = {10.48550/arxiv.2606.19034},
  url = {https://doi.org/10.48550/arxiv.2606.19034}
}

FAQ

Using this paper in a discovery workflow

How do I find related work for this paper?

Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.

How can I keep up with new Data Management and Algorithms research papers?

Follow Data Management and Algorithms research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.

Can I cite this paper from this page?

This page includes a static BibTeX block for Evaluating Learned Spatial Indexes. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.

Follow this research in Scollr

Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.

Get the app