I've been meaning to put my thoughts together on the state of GenAI writing.
Reading Tom Bedor's blog today inspired me to write about the concept of "originality density" and why human effort is a required condition for providing value to readers.
Introducing Cohere's first open-source coding model: North Mini Code
Small & efficient, designed for agentic performance and built for community input.
To my #DevRel friends in the Netherlands 📣
We are looking for an Advocate in Amsterdam to join the Elastic team. If you are passionate about speaking, sharing technical content with the developer community, and writing code do apply!
jobs.elastic.co/jobs/enginee…
Just a reminder that many models (including open weight ones) are pretty good, and benchmarks are difficult.
(Posting this because truncated x-axes triggers... feelings)
MiniMax M3 has landed in the Arena and has moved the Pareto frontier!
Their latest model ranks #7 for Code Arena: Frontend, scoring 1531, it is neck and neck with GLM-5.1. It moves the Pareto frontier in its price class at $0.60 input/$2.40 output per Mtoken.
Congrats to the @MiniMax_AI team on this achievement!
So here's my live, coding agent dashboard, where I track tool calls, cost, stop reasons, read-to-edit ratios.
It should capture any major changes in my agent behaviour or quality here.
It was actually super easy to set up!
Personal update: I’ve joined @liquidai’s Post-Training team.
In this role, I’ll work closely @maximelabonne and @paulabartabajo_ and help build efficient general-purpose AI at every scale.
While it's bittersweet to move on from working with the amazing and talented team at Elastic, I'm beyond excited for this next journey!
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks.
On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
Sources: Starbucks shut down an AI program for automating inventory counts, nine months after deploying it, after it frequently miscounted and mislabeled items (@waylon_wc / Reuters)
(Visit Techmeme dot com for the link and full context!)
Orgs are laying people off "because of AI" and I (and the entire industry) need to talk about how insane all of this is.
I build AI tools. I like AI. This isn't an anti-AI rant. This is an anti-stupidity rant.
Buckle up. Spicy takes galore!
🧵1
I suspect the erosion of humans will not progress in a linear fashion, rather it will be a step-wise sort of deletion of jobs interspersed with period of hyperbolic replacement.
What if you could say "meow meow meow" into your computer and find cat pictures?
What if you could search text, images, PDFs, video AND audio in one index? With one model?
I made a video to explain how it works: youtu.be/WvdYwKhjtAM
The new @JinaAI_ v5-omni embedding model capably supports 4(!) modalities while being very small.
Here's a short video that shows you what you can do with it. Full-length video coming next week.
At this point I’ve seen some version of the metaphor of AI as gym equipment that works out for you (so you don’t get anything out of the workout) multiple times. And I really think it’s one of the better comparisons out there. It’s simple, familiar, and people really get it.
“Sure, a robot can lift 600 pounds much more easily than I can — but that doesn’t much help me if I’m trying to work out. The same goes for the thinking exercise of education.”