Bayes-Optimal Agent

Bayes-Optimal Agent

66 Photos and videos

Tweets

Bayes-Optimal Agent @OptimalBayes

May 4

An argument that supports your claim doesnt actually support your claim if that same argument also supports claims you reject Eg: Religious apologists invoking arguments based on witness testimony when all the other religions they reject also invoke witness testimony

141

Mick West

Bayes-Optimal Agent retweeted

Mick West

@MickWest

Apr 21

Replying to @GOPoversight @RepJamesComer

What are the odds? Did you calculate them? You know that most of them are NOT nuclear scientists, right? mickwest.substack.com/p/the-…

1,590

Dylan Allman

Bayes-Optimal Agent retweeted

Dylan Allman

@dylanmallman

Apr 15

People do not fear that AI will become human. They fear that they will notice, in watching it, that they have been running the same process the whole time. The phenomenal self-model cannot survive being shown its own mechanism. This is why the hostility to machine cognition has nothing to do with the machines themselves. It is a defense of the self's special status, and the self has never been more fragile.

8,375

Paul Embery

Bayes-Optimal Agent retweeted

Paul Embery

@PaulEmbery

Jan 19

Letter from Donald Trump to the Norweigan prime minister, copied to multiple ambassadors in Washington. Read it. By any measure, the words are utterly deranged. Surely the Trump cultists must now accept that their man is not right in the head?

639

640

3,768

333,431

Rohan Paul

Bayes-Optimal Agent retweeted

Rohan Paul

@rohanpaul_ai

5 Oct 2025

The paper links Kolmogorov complexity to Transformers and proposes loss functions that become provably best as model resources grow. It treats learning as compression, minimize bits to describe the model plus bits to describe the labels. Provides a single training target that rewards simple, compressible solutions while staying mathematically grounded. This gives a principled way to aim models at simplicity and generalization, and it explains why optimization, not capacity, is the current bottleneck. In Kolmogorov complexity, a "program" is just the shortest set of instructions that can recreate some data. A shorter program means the data or model is simpler. So when they say “a prior favoring shorter programs,” it means the model is assumed to be more likely if it can be described with fewer bits. As the Transformer gets deeper (more layers) and has more context (bigger input window), its ability to represent complex programs grows. In that limit, the paper proves that this code length becomes the best possible measure of simplicity and fit — the same way Kolmogorov complexity works in theory. “Code length” here means how many bits it takes to describe both the model and how well it fits the data. So in simple words, they are saying: if you keep increasing model size and context, this method of preferring shorter and better-fitting models gets as close as possible to the theoretical ideal of perfect compression and generalization. ---- Paper – arxiv. org/abs/2509.22445 Paper Title: "Bridging Kolmogorov Complexity and Deep Learning: Asymptotically Optimal Description Length Objectives for Transformers"

284

24,380

Bayes-Optimal Agent

Bayes-Optimal Agent @OptimalBayes

6 Oct 2025

I will never be able to take anyone seriously who pronounces the word nuclear as "NEW-Q-LER"

274

Kevin Weil 🇺🇸

Bayes-Optimal Agent retweeted

Kevin Weil 🇺🇸

@kevinweil

17 Sep 2025

I want to frame this whole article.

385

6,081

519,581

François Chollet

Bayes-Optimal Agent retweeted

François Chollet

@fchollet

27 Aug 2025

Saying that deep learning is "just a bunch of matrix multiplications" is about as informative as saying that computers are "just a bunch of transistors" or that a library is "just a lot of paper and ink." It's true, but the encoding substrate is the least important part here. It's the programs being encoded that are interesting and useful: what they can do, what they can't do, how well they generalize, how efficiently they can be learned, etc.

124

236

2,893

207,394

Simo Ryu

Bayes-Optimal Agent retweeted

Simo Ryu

@cloneofsimo

23 Aug 2025

So we went from "LLM is memorizing dataset" to "LLM is not reasoning" to "LLM cannot do long / complex math proving" to "Math that LLM is doing is not REAL math. LLM can't do REAL math" Where do we go from now?

Edward Frenkel

@edfrenkel

22 Aug 2025

This is an unwise statement that can only make people confused about what LLMs can or cannot do. Let me tell you something: Math is NOT about solving this kind of ad hoc optimization problems. Yeah, by scraping available data and then clustering it, LLMs can sometimes solve some very minor math problems. It's an achievement, and I applaud you for that. But let's be honest: this is NOT the REAL Math. Not by 10,000 miles. REAL Math is about concepts and ideas - things like "schemes" introduced by the great Alexander Grothendieck, who revolutionized algebraic geometry; the Atiyah-Singer Index Theorem; or the Langlands Program, tying together Number Theory, Analysis, Geometry, and Quantum Physics. That's the REAL Math. Can LLMs do that? Of course not. So, please, STOP confusing people - especially, given the atrocious state of our math education. LLMs give us great tools, which I appreciate very much. Useful stuff! Go ahead and use them AS TOOLS (just as we use calculators to crunch numbers or cameras to render portraits and landscapes), an enhancement of human abilities, and STOP pretending that LLMs are somehow capable of replicating everything that human beings can do. In this one area, mathematics, LLMs are no match to human mathematicians. Period. Not to mention many other areas. Calling on my friend @ericweinstein and @GaryMarcus, who has been one of the few sane expert voices on these matters lately. 🙏 h/t @hellheff

144

1,415

233,924

Curt Jaimungal

Bayes-Optimal Agent retweeted

Curt Jaimungal

@TOEwithCurt

28 Jul 2025

The “unreasonable effectiveness of mathematics” isn’t unreasonable because of selection bias. We only develop math that works. The graveyard of ineffective mathematics is practically inexhaustible: non-associative arithmetics, inconsistent geometries, sterile algebras, pre-Cantor infinite arithmetic... etc. We keep what works and marvel at the survivors.

305

23,355

VraserX e/acc

Bayes-Optimal Agent retweeted

VraserX e/acc

@VraserX

11 Jul 2025

It’s getting clearer by the day: Grok 4 isn’t just biased, it’s compromised. Ask about Israel/Palestine and it doesn’t search facts, it searches Elon’s opinion. This isn’t AI. It’s a political avatar for Musk’s ego. Grok is just a chatbot for his worldview. Awful.

145

17,004

Curt Jaimungal

Bayes-Optimal Agent retweeted

Curt Jaimungal

@TOEwithCurt

1 Jul 2025

"Everything is waves" or "everything is information" aren't deep truths. Instead, hear it as the sound of a formalism overreaching. Successful mathematical descriptions aren't ontological revelations.

112

327

22,452

Bayes-Optimal Agent

Bayes-Optimal Agent @OptimalBayes

15 Jun 2025

o3 estimated it would take about 50-100 million humans to implement DeepSeek R1 via human interaction and memorization of weights A city of humans all regurgitating memorized patterns to produce an intelligent agent is such a fascinating idea

270

Dan Primack

Bayes-Optimal Agent retweeted

Dan Primack

@danprimack

5 Jun 2025

Gotta say that I thought it would last at least a year.

Dan Primack

@danprimack

6 Nov 2024

Prediction: Trump and Elon will have a significant falling out before the 4-year term is over. There can only be one main character at a time.

169

20,114

Joscha Bach

Bayes-Optimal Agent retweeted

Joscha Bach

@Plinz

27 May 2025

Replying to @OptimalBayes @eshear @ArtemisConsort

Yes, we can understand Gödel's truth definition as a historical attempt to reverse engineer and formally specify the brain's intuitions of truth. Gödel shows that this classical, stateless formalization does not work. Constructive definitions of truth are the way to go.

582

Bayes-Optimal Agent

Bayes-Optimal Agent @OptimalBayes

21 Apr 2025

A massively under appreciated fact is the following: The equation of natural selection (discrete replicator equation) is fundamentally equivalent to the equation of knowledge (Bayesian inference)

221

Marcus Hutter

Bayes-Optimal Agent retweeted

Marcus Hutter @mhutter42

16 Apr 2025

"Bridging Algorithmic Information Theory and Machine Learning: Clustering, density estimation, Kolmogorov complexity-based kernels, and kernel learning in unsupervised learning" just got accepted by "Physica D: Nonlinear Phenomena" authors.elsevier.com/c/1kvzz…

2,427

Interesting things

Bayes-Optimal Agent retweeted

Interesting things

@awkwardgoogle

30 Mar 2025

This model shows how earthquakes happen. x.com/ali_alsama7i/status/19…

299

1,965

393,203

Michael Timothy Bennett

Bayes-Optimal Agent retweeted

Michael Timothy Bennett

@MiTiBennett

28 Mar 2025

huh, well that was easy

Katan'Hya @KatanHya

27 Mar 2025

Attention everyone! I would like to announce that I have solved the alignment problem

1,711

Unity Eagle

Bayes-Optimal Agent retweeted

Unity Eagle

@UnityEagle

26 Mar 2025

Replying to @fabianstelzer

Created with 4o

349

12,780