PapersAnon

PapersAnon

336 Photos and videos

Tweets

Pinned Tweet

PapersAnon @papers_anon

24 Jun 2024

rentry.org/LocalModelsLinks Various links for ML and local models (not just LLMs) that's kept fairly updated. rentry.org/LocalModelsPapers ML papers I've read that I think are interesting. Also keep a text file at the top of all the abstracts for easy searching.

Local Models Related Links

/lmg/ Accelerate Guides Quick Start Guide Anon's tutorial for getting models running locally SillyTavern Guide Instructions for roleplaying via koboldcpp. Additional GNBF grammar usage LM Tuning...

rentry.org

142

25,614

PapersAnon

PapersAnon @papers_anon

May 2

ClaudePlaysPokemon dev is back with Opus 4.7 Def favorite vibe check given the very minimal harness. twitch.tv/claudeplayspokemon

385

PapersAnon

PapersAnon @papers_anon

May 16

Claude has for the first time made it through Victory Road. All hints were removed for this latest iteration as well. Also for the first time caught a legendary (Articuno)

185

PapersAnon

PapersAnon @papers_anon

Apr 3

Woosh: A Sound Effects Foundation Model From Sony AI. Optimized for sound effects with a high-quality audio encoder/decoder model, a text-audio alignment model for conditioning, as well as a text-to-audio and video-to-audio generative models. Links below

461

PapersAnon

PapersAnon @papers_anon

Apr 3

arxiv.org/abs/2604.01929 github.com/SonyResearch/Woos… Repo isn't live yet sonyresearch.github.io/Woosh… sonyresearch.github.io/Woosh… sonyresearch.github.io/Woosh… Examples Some interesting papers I keep updated rentry.org/LocalModelsPapers

300

PapersAnon

PapersAnon @papers_anon

Mar 19

Came across what could be an interesting benchmark. Old famicom game called Radical Bomber: Jurai-Kun. Asymmetrical boardgame with 1 runner and 4 chasers. Runner has the ability to bomb certain connections and limited double turns. Some special blocks too. youtube.com/watch?v=A8mPtwdT…

371

PapersAnon

PapersAnon @papers_anon

Mar 17

VoXtream2: Full-stream TTS with dynamic speaking rate control Combines a distribution matching mechanism over duration states with CFG across conditioning signals to improve controllability and synthesis quality. Runs 4 times faster than real time on a consumer GPU. Links below

934

PapersAnon

PapersAnon @papers_anon

Mar 17

arxiv.org/abs/2603.13518 herimor.github.io/voxtream2/ Page not live yet huggingface.co/herimor Will probably be posted here Some interesting papers I keep updated rentry.org/LocalModelsPapers

268

PapersAnon

PapersAnon @papers_anon

Mar 4

Speculative Speculative Decoding Draft model predicts likely verification outcomes and prepares speculations pre-emptively for them. If the actual verification outcome is in the predicted set, a speculation can be returned immediately, eliminating drafting overhead. Links below

2,404

PapersAnon

PapersAnon @papers_anon

Mar 4

arxiv.org/abs/2603.03251 github.com/tanishqkumar/ssd Repo isn't live yet Some interesting papers I keep updated rentry.org/LocalModelsPapers

Speculative Speculative Decoding

Autoregressive decoding is bottlenecked by its sequential nature. Speculative decoding has become a standard way to accelerate inference by using a fast draft model to predict upcoming tokens from...

arxiv.org

388

PapersAnon

PapersAnon @papers_anon

Mar 3

Multi-Head Low-Rank Attention Novel attention mechanism with native 4-way tensor parallelism support. At 2.9B scale achieves SOTA performance on perplexity and zero-shot common-sense reasoning benchmarks. 2.8× decoding speedup over MLA. Links below

126

12,808

PapersAnon

PapersAnon @papers_anon

Mar 3

arxiv.org/abs/2603.02188 github.com/SongtaoLiu0823/ML… huggingface.co/Soughing/MLRA Some interesting papers I keep updated rentry.org/LocalModelsPapers

756

PapersAnon

PapersAnon @papers_anon

Feb 25

Aletheia tackles FirstProof autonomously From Deepmind. Autonomously solved 6 problems (2, 5, 7, 8, 9, 10) out of 10 according to majority expert assessments; notes that experts were not unanimous on Problem 8 (only). Links below

511

PapersAnon

PapersAnon @papers_anon

Feb 25

arxiv.org/abs/2602.21201 github.com/google-deepmind/s… arxiv.org/abs/2602.05192 FirstProof challenge paper daniellitt.com/blog/2026/2/2… Interesting article about FirstProof Some interesting papers I keep updated rentry.org/LocalModelsPapers

339

PapersAnon

PapersAnon @papers_anon

Feb 20

Adam Improves Muon: Adaptive Moment Estimation with Orthogonalized Momentum Scales orthogonalized momentum using a single adaptive stepsize, preserving orthogonality while improving upon Muon at negligible additional cost. Links below

101

8,253

PapersAnon

PapersAnon @papers_anon

Feb 20

arxiv.org/abs/2602.17080 github.com/minxin-zhg/namo Some interesting papers I keep updated rentry.org/LocalModelsPapers

1,166

PapersAnon

PapersAnon @papers_anon

Feb 13

HiFloat4 Format for Language Model Inference Packs 64 4-bit elements with 32 bits of shared scaling metadata, averaging 4.5 bits per value. Achieves higher average accuracy than the state-of-the-art NVFP4 format across multiple models and diverse downstream tasks. Links below

1,713

PapersAnon

PapersAnon @papers_anon

Feb 13

arxiv.org/abs/2602.11287 Some interesting papers I keep updated rentry.org/LocalModelsPapers

467

PapersAnon

PapersAnon @papers_anon

Feb 12

MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs Reparameterizes expert updates via per-expert null-space projections that keep router inputs invariant and thereby suppress routing shifts. Links below

523

PapersAnon

PapersAnon @papers_anon

Feb 12

arxiv.org/abs/2602.10965 github.com/Terence-Gu/MoEEdi… Some interesting papers I keep updated rentry.org/LocalModelsPapers

243

PapersAnon

PapersAnon @papers_anon

Feb 9

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos From Nvidia. Foundation world model that learns diverse interactions and dexterous controls from 44k hours of egocentric human videos. Links below

682

PapersAnon

PapersAnon @papers_anon

Feb 9

arxiv.org/abs/2602.06949 dreamdojo-world.github.io/ Code link not live yet Some interesting papers I keep updated rentry.org/LocalModelsPapers

326