alphaXiv

alphaXiv

1,029 Photos and videos

Tweets

Pinned Tweet

alphaXiv

@askalphaxiv

May 12

Reinforcing Recursive Language Models Can a 4B model learn to recursively call itself to answer hard long-context questions? We RL fine-tuned a small model to behave as a native RLM. On evidence selection across scientific papers, our 4B RLM matches Sonnet 4.6 in quality while running significantly faster and cheaper.

484

68,778

alphaXiv

alphaXiv

@askalphaxiv

14h

"From AGI to ASI" This paper from Google DeepMind defines how AGI is one human-level general system, and ASI is a system or collective that beats large expert human organizations across almost everything. They argue that the jump may come from scaling, new paradigms, recursive self-improvement, or huge multi-agent AI collectives. With the key idea that digital minds can copy, speed up, share memory, and run in parallel, so superintelligence may look less like one breakthrough and more like accelerating AI civilization.

272

12,001

alphaXiv

alphaXiv

@askalphaxiv

14h

alphaXiv x Marimo Competition 2 | alphaXiv

Discuss, discover, and read arXiv papers.

alphaxiv.org

1,080

alphaXiv

alphaXiv

@askalphaxiv

Jun 11

Replying to @marimo_io

Full competition details marimo.io/pages/events/noteb…

Bring Research to Life: molab Notebook Competition #2

alphaXiv x marimo notebook competition #2, now on GPUs. Pick a paper, build a marimo notebook, and win prizes. Enter by June 28.

marimo.io

455

marimo

alphaXiv retweeted

marimo

@marimo_io

Jun 10

We ran a competition where you had to implement a research paper and make it interactive. Round two is live, this time with GPU access on molab. Pick an @askalphaxiv paper, build a marimo notebook, win a @FrameworkPuter Laptop 13, @claudeai subscriptions, and more. Deadline: June 28 👀

2,996

Khushi Doshi

alphaXiv retweeted

Khushi Doshi

@aiwithkhush

Jun 11

Reading a research paper alone is brutal. Someone built a free site that turns arXiv into something you can actually understand. It's called alphaXiv. arXiv is where nearly every AI breakthrough gets posted first, before any blog or news site touches it. The problem is the papers are dense, jargon-heavy, and silent. No comments, no context, no help. alphaXiv fixes all of that. - A trending feed that ranks the hottest papers so you see what matters today - Every paper gets a plain-language overview, so you grasp it before reading the math - Comment on any line of any paper and discuss it with other researchers - An AI search that answers questions like "what are the top math reasoning benchmarks" - Audio versions of papers, so you can listen to research like a podcast - A Chrome extension that adds all of this on top of arXiv itself It started as a Stanford student project and became the place researchers actually talk to each other. The cutting edge of AI is published for free every single day. This is how you finally keep up with it. alphaxiv.org/

2,561

alphaXiv

alphaXiv

@askalphaxiv

Jun 11

"Still: Amortized KV Cache Compaction in a Single Forward Pass" Instead of dropping tokens or optimizing a new compressed cache per prompt, Still learns to synthesize a compact KV cache in one forward pass. So a small per layer Perceiver reads the full cache and writes compressed keys and values the frozen model can still attend to, avoiding the KV cache bottleneck in long context inference. This turns token eviction to learned memory synthesis, making long context cheaper while preserving more context understanding.

159

7,854

alphaXiv

alphaXiv

@askalphaxiv

Jun 11

read more: alphaxiv.org/abs/2606.07878v…

1,099

alphaXiv

alphaXiv retweeted

alphaXiv

@askalphaxiv

Jun 10

As believers of open research, we are disappointed to see Anthropic silently degrading Fable 5 for AI development "Any topic related to building pretraining pipelines, distributed training infrastructure, or ML accelerator design... may have limited effectiveness through Claude via methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning." Not only do they get to decide what you use LLMs for in research, but this also enables them to silently intervene in your research without you knowing. This sets a dangerous precedent. If a model refuses openly, users can understand the boundary. If a model falls back to another model, users can still evaluate the difference. But if a model silently modifies or weakens its own answers while still pretending to help, researchers lose the ability to know whether a failed result came from their own idea, their implementation, or an invisible intervention by the model provider. That is not safety. Safety policies should be transparent, auditable, and user-visible. On top of that, the people most harmed by this are not the largest labs with massive teams and proprietary infrastructure. It is the independent researchers, academic groups, startups, and open-source builders who rely on public tools to compete, innovate, and pioneer AI for everyone else.

166

718

3,861

219,440

alphaXiv

alphaXiv

@askalphaxiv

Jun 10

"Self-Harness: Harnesses That Improve Themselves" What if an AI agent improves the harness that controls how it acts? So instead of humans tuning prompts, tools, retry rules, and verification for every model, this paper explores letting the agent mines its own failures, proposes small harness edits, and keeps only the ones that pass regression tests. All without fine-tuning or teacher model. On Terminal-Bench-2.0, it improves held-out pass rates across MiniMax, Qwen, and GLM.

369

18,613

alphaXiv

alphaXiv

@askalphaxiv

Jun 10