Developer educator and advocate. Sometimes writes bad jokes poorly.

Joined November 2019
201 Photos and videos
I've been meaning to put my thoughts together on the state of GenAI writing. Reading Tom Bedor's blog today inspired me to write about the concept of "originality density" and why human effort is a required condition for providing value to readers.
2
1
2
195
(Apologies to Tom if he's on this stupid site. I couldn't find his handle.)
84
<let-them-fight.gif>
51
JP Hwang retweeted
Jun 10
Cohere Transcribe, our open-source speech recognition model, is #1 on the new @huggingface Far-Field ASR benchmark.
13
48
524
35,941
Released amongst all the #mythos / #fable hoopla today - it does sound like a useful model. I might try it on the flight home 😉
Jun 9
Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performance and built for community input.
9
332
JP Hwang retweeted
To my #DevRel friends in the Netherlands 📣 We are looking for an Advocate in Amsterdam to join the Elastic team. If you are passionate about speaking, sharing technical content with the developer community, and writing code do apply! jobs.elastic.co/jobs/enginee…

2
2
4
169
Just a reminder that many models (including open weight ones) are pretty good, and benchmarks are difficult. (Posting this because truncated x-axes triggers... feelings)
MiniMax M3 has landed in the Arena and has moved the Pareto frontier! Their latest model ranks #7 for Code Arena: Frontend, scoring 1531, it is neck and neck with GLM-5.1. It moves the Pareto frontier in its price class at $0.60 input/$2.40 output per Mtoken. Congrats to the @MiniMax_AI team on this achievement!
78
So here's my live, coding agent dashboard, where I track tool calls, cost, stop reasons, read-to-edit ratios. It should capture any major changes in my agent behaviour or quality here. It was actually super easy to set up!
1
4
152
I used a Pi Otel plugin from here (pi.dev/packages), and Elastic Agent Skills (github.com/elastic/agent-ski…) helped me vibe my way to a dashboard in no time. I wrote about it here: jphwang.com/posts/coding-age…
3
119
JP Hwang retweeted
Personal update: I’ve joined @liquidai’s Post-Training team. In this role, I’ll work closely @maximelabonne and @paulabartabajo_ and help build efficient general-purpose AI at every scale. While it's bittersweet to move on from working with the amazing and talented team at Elastic, I'm beyond excited for this next journey!
69
5
339
26,867
Back from a week off. Looks like it's a New Model Week again! How's everybody's experience been with @MiniMax_AI M3 so far?
Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscrib… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days
2
86
Would recommend
More musings after some people got upset about the word clanker. lucumr.pocoo.org/2026/5/26/c…
92
JP Hwang retweeted
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
511
742
6,053
1,951,301
Does anyone know whether the “AI” here involved an LLM at all?
Sources: Starbucks shut down an AI program for automating inventory counts, nine months after deploying it, after it frequently miscounted and mislabeled items (@waylon_wc / Reuters) (Visit Techmeme dot com for the link and full context!)
1
1
299
JP Hwang retweeted
Orgs are laying people off "because of AI" and I (and the entire industry) need to talk about how insane all of this is. I build AI tools. I like AI. This isn't an anti-AI rant. This is an anti-stupidity rant. Buckle up. Spicy takes galore! 🧵1
May 22
Replying to @bettersafetynet
I suspect the erosion of humans will not progress in a linear fashion, rather it will be a step-wise sort of deletion of jobs interspersed with period of hyperbolic replacement.
36
83
845
389,643
What if you could say "meow meow meow" into your computer and find cat pictures? What if you could search text, images, PDFs, video AND audio in one index? With one model? I made a video to explain how it works: youtu.be/WvdYwKhjtAM
3
4
704
The new @JinaAI_ v5-omni embedding model capably supports 4(!) modalities while being very small. Here's a short video that shows you what you can do with it. Full-length video coming next week.
1
3
15
1,631
JP Hwang retweeted
At this point I’ve seen some version of the metaphor of AI as gym equipment that works out for you (so you don’t get anything out of the workout) multiple times. And I really think it’s one of the better comparisons out there. It’s simple, familiar, and people really get it.
“Sure, a robot can lift 600 pounds much more easily than I can — but that doesn’t much help me if I’m trying to work out. The same goes for the thinking exercise of education.”
20
1,918
16,341
405,654