AI / ML Developer Advocate | Research 👨‍💻 Data Machina AI newsletter | Community 🤗 Data Science London

Joined January 2012
845 Photos and videos
Carlos retweeted
supervision just hit 40,000 GitHub stars! it now powers over 6.5k open-source computer vision projects, including all my demos like basketball AI link: github.com/roboflow/supervis…
55
419
5,719
878,765
Carlos retweeted
/no-mistakes is here! by popular demand i've made the most impactful tool in my agentic engineering setup "no-mistakes" invocable as a skill in Claude Code, Codex et al just type "/no-mistakes" once your agent has made changes, and watch the magic unfold details below 👇
71
100
1,610
166,336
Carlos retweeted

206
458
4,850
3,473,411
Carlos retweeted
Nous Research acaba de publicar el hub de skills más completo que he visto para su agente de IA: Hermes Se llama Hermes Skills Hub y tiene 691 skills listas para instalar. No es una lista de prompts. Son skills reales que amplían lo que puede hacer tu agente. 89 integradas de serie. 81 opcionales. 521 de la comunidad. 18 categorías. Algunas de las que me han flipado: - macos-computer-use: controla el escritorio de Mac en segundo plano sin robar el cursor ni el foco del teclado - comfyui: genera imágenes, video y audio con ComfyUI directamente desde el agente - humanizer: elimina el lenguaje de IA y añade voz real al texto - popular-web-designs: 54 sistemas de diseño reales (Stripe, Linear, Vercel) como HTML/CSS - claude-code: delega el código a Claude Code CLI desde dentro del agente - manim-video: animaciones estilo 3Blue1Brown para matemáticas y algoritmos - excalidraw: diagramas de arquitectura dibujados a mano desde el agente - ascii-video: convierte cualquier video a ASCII coloreado en MP4 o GIF Y esto es solo la primera página. Hay skills para finanzas, seguridad, DevOps, MLOps, traducción, gaming y social media. Compatible con Hermes Agent, Claude Code, Codex, Cursor y OpenCode. Todo gratis. Todo open source.
36
151
1,102
59,715
Carlos retweeted
My buddy, nontechnical, went all in on AI from the start. He's now Chief AI Officer at his company, and has convinced them to buy him a Blackwell GPU ($15k) to run local agent workflows. And here I am, a decade into a career in ML, begging my CTO for more training budget.
62
66
3,777
245,461
Carlos retweeted
It's pretty annoying that Codex uses agents.md and Claude Code uses Claude.md. There should be some industry standards to this stuff?
857
94
4,060
1,193,791
Carlos retweeted
Jun 3

44
168
1,214
2,122,079
Carlos retweeted
Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.
412
869
8,202
2,161,617
Carlos retweeted
Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context. You can run and train the model via Unsloth Studio. GGUF: huggingface.co/unsloth/gemma… Guide: unsloth.ai/docs/models/gemma…
Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇
96
380
2,819
348,954
Carlos retweeted
Jun 2
The most interesting visual AI tools today are generating the underlying source code behind the final output. This change is unlocking editability, iteration, and a feedback loop that pixel-native models can't match. And the market for visual code generation is organizing around the runtime where the artifact is rendered or executed. a16z's Yoko Li on why the next frontier of visual AI is code: a16z.news/p/the-next-frontie…
63
105
857
245,026
Carlos retweeted
The next evolution of Hermes Agent is here! Introducing Hermes Desktop: everything you love about Hermes, now native on your machine. First demoed in Jensen's GTC keynote, it's now in public preview.
1,237
1,464
12,782
5,815,844
Carlos retweeted
Jun 2

260
1,352
10,390
3,059,241
Carlos retweeted
I can’t sleep at night because my mind races with all the cool shit I could be building. AI has turned my workdays into 24 hour grind sessions. I code until I literally collapse from exhaustion 7 days a week.
119
37
451
24,453
Carlos retweeted
500-hour AI infrastructure engineering curriculum github.com/ai-infra-curricul…
2
182
1,288
119,767
Carlos retweeted
My Fine-tuning Stack for Small Language Models (2B to 15B Models) It costs me around $150 to generate a fresh dataset (~150M) and fine-tune the model. > Codex 5.5= orchestrator / operator > Deekseek v4 pro /Kimi 2.6= data gen. engine (dirt cheap) > Qwen 3.5 = best model to fine-tune (4B, 9B, 27B) > Unsloth = faster, cheaper fine-tuning framework. > Colab = Cheapest cloud GPU (A100 80GB for $0.66/hr) > G Drive = to save datasets (good codex colab integration) > Huggingface = To host datasets Models So Codex as planner & auditor, Deepseek as cheapest executor, Unsloth to fine-tune fast, Colab to get cheapest A100 GPU, Huggingface to host the fine-tuned model. Anyone can fine-tune, and run a Sonnet 4.5 level Custom model on their system.
55
96
890
37,234
Carlos retweeted
AI agents are advancing research-level math. 🚀 I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini. When applied to a set of open formal math problems, our agent autonomously solved: ✅ 9 open Erdős problems (including two open for 56 years!) ✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems ✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini. Read the paper here: arxiv.org/abs/2605.22763v1
80
242
1,509
218,261