πŸ€— AI Education | Developer Advocate @huggingface

Joined November 2022
144 Photos and videos
I just recorded a deep dive into Pi's architecture. It's a minimalist AI coding agent with a beautiful design. Here's how it actually works πŸ‘‡
26
53
667
78,236
Reminder that you can serve Kimi K2.7, Minimax M3, DeepSeek V4 on Inference Providers πŸ€— πŸ‘‰ Single API endpoint πŸ‘‰ Use the same credits for all models πŸ‘‰ No markup price (we don't make money from this) πŸ‘‰ OpenAI API compatible πŸ‘‰ Largest selection of open models
2
22
learn to improve your skills without breaking them (testing for skills) πŸ€“
Finally catching up on some blog posts I saved for later. This one is VERY interesting. I never really dedicated that much of time to skills, and that was a mistake. Easy to read, well thought out and directly actionable in your everyday tasks! huggingface.co/blog/upskill
23
I just built a quick harness that evaluates different combinations of Kimi K2.7, GPT 5.5 and Opus 4.8 Each one as architect or implementer. πŸ‘‰ Ran each combination on a recent GH issue πŸ‘‰ ⁠⁠measured cost and quality on each of the runs. 🧡 Takeaways and cost breakdown below
4
1
10
1,311
Key takeaways: πŸ‘‰ The top combinations mostly use Kimi as coder πŸ‘‰ The best quality comes from using GPT-5.5 or Opus-4.6 as planner and Kimi as the implementer πŸ‘‰ Combining the absolute SOTA models with open models shrinks your bill without sacrificing quality! πŸ€—
1
1
231
Using multiple models in the same workflow can reduce your costs 7x without losing quality. My favorite team so far: πŸ‘‰ GPT-5.5 planner Kimi K2.7 Coder Here's how you can measure the best planner-coder pairs for your workspace πŸ‘‡
2
3
46
2,882
follow @ben_burtenshaw for the best agent material 🀯
we need to get real and move fast about training our own agents. orgs, teams, and individuals all need to be improving their agent. not waiting for the next API to be released (or not). this is a practical hands-on session where I'll show you how to train (and improve) your own agent based on traces. it's supported by tutorials and a repo, starts ofs simple, and works today. over the coming weeks, I'll add more advanced classes working up to RL environments and harnesses. If you're building around training agents, jump on stream, and demo your work.
3
269
guys we need to use telegram more to talk to agents
We now support rich formatting for all chatbots. Tables, nested lists, inline media, formulas, headers and more β€” right in Telegram messages. πŸ”¨ Start building! Docs: core.telegram.org/bots/api#r…
1
186
fable 5 feels about as good to me as gpt 5.5...Β which is, like, pretty damn good πŸ‘
1
4
151
While everyone is busy with Fable 5, Cohere just dropped a local coding model that can run on your laptop. Enters North Mini Code β€” an open-source coding model that runs on a single H100. πŸ”Ή 30B params, only 3B active (MoE) πŸ”Ή Apache 2.0 licensed πŸ”Ή 256K context window πŸ”Ή On-prem deployment ready
7
2
23
1,672
Some highlights: πŸ€– 33.4 on the Artificial Analysis Coding Index, outperforming Qwen3.5 (35B), Gemma 4 (26B), and Devstral πŸš€ 2.8Γ— faster than Devstral Small 2 πŸŽ–οΈ Ranked #8 out of 127 open-weight models for output speed πŸ‘Ύ Designed for agentic workflows: sub-agents, code reviews, and architecture mapping πŸ‘¨β€πŸ’» MoE architecture: 128 experts with just 8 activated per token. Try it on your favorite coding agent: huggingface.co/CohereLabs/No…
1
2
184
Alejandro AO πŸ€— retweeted
I just recorded a deep dive into Pi's architecture. It's a minimalist AI coding agent with a beautiful design. Here's how it actually works πŸ‘‡
26
53
667
78,236
opened claude code for the first time in a while today. is this normal?? i didn't even resize the terminal window and it keeps getting all messed up. is this a common issue?
5
3
426
i got many new followers recently. hey there! πŸ€— how can i help you build better AI apps? what tutorials/courses would you like to see?
2
2
111
what's your main LLM for coding?
40% gpt-5.5
40% claude (opus,sonnet,etc)
20% open source (kimi,m3,etc)
0% other
20 votes β€’ Final results
2
2
305
great question! in Pi, sessions are stored JSONL files that form trees. each line is a new message. each message is a node. when you do `/tree` and fork the conversation. you append a new message to the JSONL with a `parent` property that points to a previous message. this lets you fork a conversation and work across different branches. and all is stored in the same session JSONL file. great design.
Replying to @_alejandroao
@grok help me better understand the pi concept of trees in its session management. What happens when you fork a new tree branch from an earlier message? Will that basically create a new session, with all the messages so far in its context? Can you then work of both tree branches independently? Or is it more like a rollback thing where it will undo any tool calls that might have happened before? Also, could you explain what the effect on kv cache would be when creating a new tree branch? Would the model need to keep two separate caches or can the first common part of the cache be reused (is that perhaps prefill cache purpose?) note, im an amateur, be gentle with me
27
2,015
Just posted this video on YouTube with chapters, and an extra section on skills parsing at the end. Link here πŸ‘‡
I just recorded a deep dive into Pi's architecture. It's a minimalist AI coding agent with a beautiful design. Here's how it actually works πŸ‘‡
3
14
1,431