New Databases papers from arxiv.org: database management, datamining, and data processing. Thank you to arXiv for use of its open access interoperability.

Joined November 2010
676 Photos and videos
Evaluating and Generating Query Workloads for High Dimensional Vector Similarity Search Matteo Ceccarello, Alexandra Levchenko, Ioana Ileana, Themis Palpanas arxiv.org/abs/2606.14511 [𝚌𝚜.𝙳𝙱] πŸ’¬This paper appeared in the proceedings of KDD 2025
1
17
PLRTune: Importance Pre-Sampling and LLM-Guided Reinforcement Learning for Automatic Database Tuning Xinyue Yang, Chen Zheng, Yaoyang Hou, Renhao Zhang, Yinyan Zhang, Heng Zhang arxiv.org/abs/2606.14312 [𝚌𝚜.𝙳𝙱]
25
Transforming Shape Schemas with Composable Property-Graph Queries (Extended Version) Philipp Seifer, Daniel HernΓ‘ndez, Ralf LΓ€mmel, Steffen Staab arxiv.org/abs/2606.14309 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙰𝙸 𝚌𝚜.𝙻𝙾]
9
WikiKV: Schema-Evolving Path-Indexed Storage for Hierarchical Knowledge Navigation Feifei Li, Haoliang Ming, Zihan Li, Hang Liao, Xingyu Fan, Xiaoqing Wu, Chenggong Wang, Wenhui Que arxiv.org/abs/2606.14275 [𝚌𝚜.𝙳𝙱]
15
TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries Chao Deng, Ju Fan, Yuyu Luo, Qinliang Xue, Meihao Fan, Yuxin Zhang, Min Zhang, Xiaofeng Jia, Jing Zhang, Xiaoyong Du arxiv.org/abs/2606.14201 [𝚌𝚜.𝙳𝙱]
40
Revisiting Filtered ANN Benchmarks: A Hardness-Controlled Benchmark Generator for Realistic Evaluation Mintaek Lim, Dogeun Kim, Minwoo Kim, Jaeyoung Do arxiv.org/abs/2606.14193 [𝚌𝚜.𝙳𝙱]
22
Vivace: Exact Temporal OLAP over Interval Histories via Independent Serverless Execution Woohyeok Park, Taeyoon Kim, Hyunjoon Kim, Kungyong Lee arxiv.org/abs/2606.14069 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙳𝙲]
12
Towards an open registry of Earth observation instruments David Montero, CΓ©sar Aybar, Miguel D. Mahecha, Luis GΓ³mez-Chova arxiv.org/abs/2606.13923 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙳𝙻]
12
TAHOE: Text-to-SQL with Automated Hint Optimization from Experience Zhiyi Chen, Jie Song, Peng Li arxiv.org/abs/2606.12387 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙰𝙸]
42
Neuro-Relational Programs: Unifying Queries and Neural Computation over Structured Data Arie Soeteman, Balder ten Cate, Maurice Funk, Benny Kimelfeld, Carsten Lutz, Moritz SchΓΆnherr arxiv.org/abs/2606.11946 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙲𝙲 𝚌𝚜.𝙻𝙢 𝚌𝚜.𝙻𝙾]
1
2
42
Efficient Graph Indexing for Interval-Aware Vector Search Siyuan Liang, Ziqi Yin, Qi Zhang, Ronghua Li, Guoren Wang, Kaiwen Xue, Daiyin Wang, Xubin Li arxiv.org/abs/2606.11789 [𝚌𝚜.𝙳𝙱]
1
79
Querying Cohesive Subgraph regarding Span-Constrained Triangles on Temporal Graphs with Dynamic Index Maintenance Chuhan Hu, Ming Zhong, Lei Li arxiv.org/abs/2606.11582 [𝚌𝚜.𝙳𝙱 𝚌𝚜.πš‚π™Έ]
1
21
LLMs Graphs: Toward Graph-Native, Synergistic AI Systems Arijit Khan, Longxu Sun, Xin Huang arxiv.org/abs/2606.11560 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙰𝙸] πŸ’¬Accepted at PAKDD 2066 Tutorial
45
Provenance Tracking in AI Compilers through the Lens of Coalgebra Zilu Tian, Liying Liu arxiv.org/abs/2606.10937 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙰𝙸]
1
32
Reconstructing OPC UA Address Spaces from Time-Series Databases Lukas LΓΌrzer, Hannes Unger, Stefan Huber arxiv.org/abs/2606.10663 [𝚌𝚜.𝙳𝙱] πŸ’¬Accepted at AI4IP 2026 (workshop at DEXA2026)
1
20
Determination Provenance: From Ambiguity to Algebra Joseph M. Hellerstein arxiv.org/abs/2606.10270 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙳𝙲 𝚌𝚜.𝙻𝙾]
1
13
TSseek: Regular Expression-Based Similarity Search for Distributed Time Series Datasets Xiaoshuai Li, Khalid Alnuaim, Mohamed Y. Eltabakh, Elke A. Rundensteiner arxiv.org/abs/2606.09824 [𝚌𝚜.𝙳𝙱]
1
15
ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset Luciano Duarte, Olga Ovcharenko, Sebastian Schelter arxiv.org/abs/2606.09648 [𝚌𝚜.𝙳𝙱 𝚌𝚜.𝙰𝙸]
16
AeroMesa: Efficient Data Management System for Multi-Dimensional Spatio-Temporal Trajectories Yue Zhang, Zizhong Ding, Lin Sun, Haopeng Chen, Yan Jiao, Yongming Xu arxiv.org/abs/2606.09581 [𝚌𝚜.𝙳𝙱]
89
InquiTree: Evaluating AI Agents in the Scientific Inquiry Loop with Paper-Derived Research Trees Shaoyang Cui arxiv.org/abs/2606.09550 [𝚌𝚜.𝙳𝙱]
18