The Clustering Strikes Back: Building Cost-Effective and High-Performance ANNS at Scale with Helmsman
Yuchen Huang, Baiteng Ma, Yiping Sun, Yang Shi, Xiao Chen, Xiaocheng Zhong, Zhiyong Wang, Yao Hu, Erci Xu, β¦
arxiv.org/abs/2606.13145 [ππ.πΈπ]
π¬Accepted by OSDI'26
CFALR: Collaborative Filtering-Augmented Large Language Model for Personalized Fashion Outfit Recommendation
Yujuan Ding, Junrong Liao, Yunshan Ma, Yi Bin, Wenqi Fan, Tat-Seng Chua, Qing Li
arxiv.org/abs/2606.13001 [ππ.πΈπ ππ.πΌπΌ]
Charge as a Construct-Validity Factor in Chinese Legal Case Retrieval: A Cross-Benchmark Audit
Yao Liu, Tien-Ping Tan, Zhilan Liu
arxiv.org/abs/2606.12993 [ππ.πΈπ]
Trait, Not State: The Durability of Reading Identity in Social Highlighting
Kazuki Nakayashiki, Keisuke Watanabe
arxiv.org/abs/2606.12904 [ππ.πΈπ ππ.π²π» ππ.π·π² ππ.ππΈ]
What Limits Does Quantization Place on Dense Top-k Retrieval? A Theoretical Study
Koki Okajima, Tsukasa Yoshida
arxiv.org/abs/2606.11780 [ππ.πΈπ ππ.π°πΈ ππ.πΈπ]
The Long Tail, Not the Front Page: Cold-Start Prediction of Crowd Highlight Salience
Kazuki Nakayashiki, Keisuke Watanabe
arxiv.org/abs/2606.11654 [ππ.πΈπ ππ.π²π» ππ.π·π² ππ.ππΈ]
Effective Reinforcement Learning for Agentic Search by Recycling Zero-Variance Queries During Training
JoΓ£o Coelho, JoΓ£o MagalhΓ£es, Bruno Martins, Chenyan Xiong
arxiv.org/abs/2606.10709 [ππ.πΈπ ππ.π°πΈ]