Datenschutzerklärung|Data Privacy

Martin Pagel

Mi 20.01.2021 13:00-13:45 Uhr, Virtual Event: "The Case for Learned Spatial Indexes" (Varun Pandey, TU München)

DBMS research colloquium: "The Case for Learned Spatial Indexes" by Varun Pandey, TU München

Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of GPS-enabled devices such as cell phones, cars, sensors, and various consumer based applications such as Uber, location-tagged posts in Facebook, Twitter, Instagram, etc. This exponential growth in spatial data has led the research community to focus on building systems and applications that can process spatial data efficiently. In the meantime, recent research has introduced learned index structures. In this work, we use techniques proposed from a state-of-the art learned multi-dimensional index structure (namely,
Flood) and apply them to five classical multi-dimensional indexes to be able to answer spatial range queries. By tuning each partitioning technique for optimal performance, we show that (i) machine learned search within a partition is faster by 11.79% to 39.51% than binary search when using filtering on one dimension, (ii) the bottleneck for tree structures is index lookup, which could potentially be improved by linearizing the indexed partitions (iii) filtering on one dimension and refining using machine learned indexes is 1.23x to 1.83x times faster than closest competitor which filters on two dimensions, and (iv) learned indexes can have a significant impact on the performance of low selectivity queries while being less effective under higher selectivities.

Varun Pandey is a PhD student in the database group at TU Munich working with Prof. Alfons Kemper and Prof. Thomas Neumann. He was the primary investigator in implementing spatial functionality in the main-memory database system, HyPer, invented at TU Munich which was later sold to Tableau. During his PhD, he primarily worked in the area of in-memory spatial query processing, and also investigated various big spatial data systems and spatial libraries that are used in modern spatial applications (and systems).

Log-in information:
If you are interested in attending the event, please contact!