Filter
Exclude
Time range
-
Near
🎉 Excited to share that our paper "MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems" has been accepted as a ✨SPOTLIGHT✨ (top 2.2%) paper at #ICML2026! MemoryBench is the first benchmark to test whether LLMsys is capable of continuely improving itself with user feedback in service time. It covers multiple domains, languages, and types of tasks to evaluate the #ContinualLearning abilities of LLMsys, with a particularly focus on, not just #DeclarativeMemory (e.g., facts in long context), but also #ProceduralMemory (e.g., experience learned from task practice). All the code and data are open-sourced. You can easily try or implement your own methods on MemoryBench. Also, we built a frontend interface so that you can run experiments easily even without a GPU! Feel free to try! 📄 arXiv: arxiv.org/abs/2510.17281 💻 Code: github.com/LittleDinoC/Memor… 📊 Dataset (Small): huggingface.co/datasets/THUI… 🗄️ Dataset (Full): huggingface.co/datasets/THUI…
1
21
127
7,890
#LeahRubin is giving a really interesting talk on #cogaging in the #cART era, focusing on #cognitive heterogeneity & potential #mechanisms in #declarativememory decline in men AND #women . #HPAaxis #neuroinflammation #HIVAgIng
2