A walking contradiction: On the one hand, Daniela from Anthropic says that AI has hardly replaced any jobs so far, on the other hand, co-founder Olah warns the Pope about the disruptive effect of AI on the labor market and society.
AI agents are advancing research-level math. 🚀
I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini.
When applied to a set of open formal math problems, our agent autonomously solved:
✅ 9 open Erdős problems (including two open for 56 years!)
✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems
✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization
We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini.
Read the paper here: arxiv.org/abs/2605.22763v1
If Codex or Claude Code keeps surprising you with usage limits, I’m building TokenBar for that.
It puts AI usage, limits, and reset times in your Mac menu bar so you don’t have to keep checking dashboards.
Free for 3 days. Cancel before charge if it’s not useful.
tokenbar.site
New CursorBench results just dropped.
Two big takeaways.
Composer 2.5 is way better than most people think.
63.2% score at $0.55 per task.
Nearly matching Opus 4.7 Max and GPT 5.5 Extra High at 20x less cost.
This is insane value.
Gemini 3.5 Flash is #10 at 49.8%.
Below GPT 5.5 Low.
Below Opus 4.7 Low.
Google's newest model can't even beat budget tier competition.
Composer 2.5 is the sleeper.
Gemini 3.5 Flash is the disappointment.
Ever hate getting ready for a run or trying to lock in and work, only to find one AirPod dead while the other is fully charged, even though you shoved both into the case the exact same way?
That’s exactly what AirPod Guard fixes. Check it out: airpodguard.com
We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️
These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵
Food logging breaks when the meal gets messy.
Bowls, takeout, and homemade meals are the hardest to track.
I'm building MetricSync to make that easier:
• AI nutrition tracker
• cheaper than CalAI
• 3 day free trial
Feedback welcome:
metricsync.download/
Artemis II has reached its maximum distance from Earth.
On the far side of the Moon, 252,756 miles away, Reid, Victor, Christina, and Jeremy have now traveled farther from Earth than any humans in history and now begin their journey home. Before they left, they said they hoped this mission would be forgotten, but it will be remembered as the moment people started to believe that America can once again do the near-impossible and change the world.
Congratulations to this incredible crew and the entire NASA team, our international and commercial partners, but this mission isn’t over until they’re under safe parachutes, splashing down into the Pacific.
hot take: AI coding tools are making us better developers, not lazier ones. the people who can't prompt well are the same ones who couldn't google well 5 years ago. the skill just shifted.
Intel is proud to join the Terafab project with @SpaceX, @xAI, and @Tesla to help refactor silicon fab technology.
Our ability to design, fabricate, and package ultra-high-performance chips at scale will help accelerate Terafab’s aim to produce 1 TW/year of compute to power future advances in AI and robotics.
It was fun hosting @elonmusk at Intel this past weekend!
hot take: paying for 3-4 AI subscriptions is the new "paying for streaming services you forgot about"
except worse because at least Netflix tells you when you're about to run out of episodes. these AI providers just cut you off mid-thought
Claude Mythos just dropped and my timeline is already full of people hitting rate limits. Your Mac menu bar shows battery, wifi, and time. Why not AI usage too? TokenBar tracks remaining capacity, reset countdowns, and pacing across 20 providers. One glance. $5 once. tokenbar.site
Every time a new AI model drops, the cycle is the same:
1. See benchmarks, get hyped
2. Sign up / upgrade
3. Use it heavily for 3 days
4. Hit the rate limit
5. Complain on Twitter
6. Switch to another model
7. Repeat
We are all stuck in the loop.