(l,y)earning

Joined July 2023
256 Photos and videos
muzz khan retweeted
36
4,979
33,608
602,498
get a glimpse of our glizzy through this cli
introducing the greptile CLI. a full greptile review for your local changes, right in your terminal. 1. npm i -g greptile 2. checkout your branch 3. greptile review 🦎
5
1
16
1,869
muzz khan retweeted
Excited to see the use of GEPA-optimized LLM judges for data filtering in MAI-Thinking-1 model's pre-training pipeline!
Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: microsoft.ai/news/building-a…
3
21
166
51,799
I hate this so much. most of these are wasted compute that basic checks and simple logic would have prevented, not to mention the fact that there were various ways to prevent this from happening in the first place... also, how are you using a computer and not maintaining a clipboard history? that’s the first place I’d check... ik alfred had it for him, but it appears that he had no clue about it. even the useless spotlight has it now.
spent my 11-hour flight back from europe working on a very long report. started as a slack message but morphed into a several pages long doc. wifi was as shitty as it gets. after finally making it home i realized that the computer had forcefully restarted. opened slack: draft was gone :( hail mary: claude pls save me, no clue how but pls try it checked APFS snapshots, time machine, slack indexeddb, write-ahead logs, service worker / http caches, local storage, app logs, hibernation image... nothing. all gone but then... it realized i have alfred installed. so it checked the clipboard snapshots alfred keeps in sqlite. sad news: alfred clipboard memory gets deleted after 24h. aggressive retention policy. however! when sqlite runs DELETE, nothing gets actually deleted. it only marks pages as reusable, but it doesn't override the physical bytes. so claude decided to do a raw-scan of the db, reverse eng alfred data format, figure out the portion containing the timestamp, stitched everything back together across overflow pages... and handed me the exact final version of my report, the last one i cmd C'd all this, in a single shot ... day 200 of "what if you had an elite hacker you can ask anything to"
2
4
371
one massive ad for raycast and learning about software x.com/peduarte/status/206144…

Replying to @giansegato
not trying to change the apps you use or anything, but raycast keeps your clipboard history for 3 months (for free)
1
2
240
muzz khan retweeted
Greptile is coming to Tokyo, Japan! 🇯🇵🦎 To celebrate, we are hosting a community Happy Hour for our users and friends. It's happening on Monday, June 8th, 7-10 pm. Get a chance to win free Greptile credits, merch, and free drinks!! Looking forward to it. :) ----------------------------------------------------------- @greptile が東京にやってきます!🗼 これを記念して、ユーザーの皆さま・お友達のためのコミュニティ・ハッピーアワーを開催します。 📅 6月8日(月) 🕖 19:00–22:00 Greptileクレジットやオリジナルグッズが当たるチャンス、さらにドリンクも無料です! 皆さまにお会いできるのを楽しみにしています😊 RSVP/参加登録: luma.com/o8z4d4nj
3
10
65
982,658
muzz khan retweeted
Sorry, I prematurely abstracted. This doesn’t usually happen.
24
27
782
39,956
new word in the training data "canonical" i have no clue what it's supposed to mean most times
3
3
296
why's everyone so triggered about everyone being an MTS now. I thought it pretty amptly describes person who does "technical" stuff as an IC, not confined to any one domain or function
1
61
tired of commands going stale like this @cursor_ai the notorious "Run in background"
2
146
muzz khan retweeted
launching 5 things: 1. multi-repo context support 2. rebuilt web app for super large orgs 3. integrations with claude/codex/devin 4. .greptile/rules files 5. rebuilt learning so greptile maintains internal docs about your company
18
10
126
22,359
i got effect pilled. didn't take very much
1
94
muzz khan retweeted
more than a quarter of all code reviewed by greptile is now written by "background agents" - completely autonomous e2e coding agents like devin. we analyzed millions of PRs written by background agents and compared their - code churn rates - revert rates - bugs per PR against human baseline. read the full post on our blog!
4
18
58
4,744
deadass, why is everbody stealing my swag??
Prompt: “Redraw the attached image in the most clumsy, scribbly, and utterly pathetic way possible. Use a white background, and make it look like it was drawn in MS Paint with a mouse. It should be vaguely similar but also not really, kind of matching but also off in a confusing, awkward way, with that low-quality pixel-by-pixel feel that really emphasizes how ridiculously bad it is. Actually, you know what, whatever, just draw it however you want.”
1
294
muzz khan retweeted
Me asking claude if devin fixed the bug greptile found
1
1
16
679
muzz khan retweeted
Apr 30
tired of this misinformation so we made a video on the truth behind the anthropic vs opencode drama
385
158
3,947
411,177
muzz khan retweeted
my coworkers have AI psychosis
7
1
107
18,360
what's better clearing out browser tabs after an exam is over, or clearing out agent tabs after a clanker writes some code
2
4
161
I ask again, can't microsoft just throw money on compute for github and fix their uptime?
84