Data Science @ETH and @Stanford | ex @Google Gemini Intern, @EPFL, @HKUST | 22

Joined June 2024
4 Photos and videos
12 Oct 2025
A pattern I like for reducing friction with LLMs is to dynamically render quick answer options or CTA buttons in the UI. Makes the experience smoother especially in domain specific settings. Haven't seen this around much, do you know of other examples?
1
96
12 Oct 2025
From my experience, ChatGPT has been trained to aggressively ask follow up questions, and I feel a single button tap might be a nice way to keep it going
53
31 Aug 2025
RLAIF w/ TRL GRPO on a single GPU. Co-locate vLLM with the trainer and reuse the same base model as a judge. Policy updates via LoRA; judge runs with base weights. No extra RM server, low latency.
3
2
13
2,185
31 Aug 2025
For GPU/API-poor setups: reuse the same vLLM process that does GRPO rollouts to batch the judge calls. Simple fast. Result: in a free-text medical diagnosis reasoning test, this setup on Llama-3.1-8B gave ~ 14% accuracy vs. the baseline (48→62%, n=300, single seed)
1
1
198
31 Aug 2025
Does TRL expect/encourage this pattern? Any gotchas? @QGallouedec
2
1
180
17 Jun 2025
having dinner with @garrytan wasn’t on my bingo card when I joined the bay 2 months ago amazing day at @ycombinator AI startup school thanks for the invite garry!
2
4
26
8,224
21 May 2025
this is insane
20 May 2025
WE CAN TALK! I spent 2 hours playing with Veo 3 @googledeepmind and it blew my mind now that it can do sound! It can talk, and this is all out of the box...
306
9 May 2025
Was great to get invited at Meta HQ to discuss our experience working with the new Llama API. A unified way to use all models from any provider is the way for developers. Performance and cost rule!
1
2
251
9 May 2025
Let him in! We gotta have a bigger Zurich delegation
All my friends are in but I'm still waiting 🥲 @garrytan @ycombinator @gustaf help me please
3
341
Clem retweeted
We're excited to announce the winners of the first LlamaCon Hackathon! These talented individuals and teams have demonstrated exceptional skill and creativity in their projects using Llama. 🥇 1st Prize: OrgLens An AI-powered expert matching system that connects you with the right professionals within your organization. By leveraging data from various sources, OrgLens creates a comprehensive knowledge graph and detailed profiles, streamlining expert matching. See their GitHub Repository: bit.ly/4k62WaO @KPJedrzejewski, @clemhus 🥈 2nd Prize: Compliance Wizards An AI-powered transaction analyzer designed to detect fraud and alert users. It uses Llama API’s multi-modality to assist fraud assessors in determining client involvement in criminal activities. See their GitHub Repository: bit.ly/3RRxiBS @SamDc73, @k_a__reem, @nicetomeetyu2, @sorhanft 🥉 3rd Prize: Llama CCTV Operator A Llama CCTV AI control room operator that identifies custom surveillance video events without model fine-tuning. It uses Llama 4’s multi-modal image understanding to assess and report predefined events. See their GitHub Repository: bit.ly/4d9UPrw @torayeff 🌟 Best Llama API Usage: Geo-ML This project uses Llama 4 Maverick and GemPy to generate 3D geological models, processing extensive geology reports into structured data for 3D representations. See their GitHub Repository: bit.ly/3GITT15 @WilliamJSDavis Please join us in congratulating these winners on their outstanding achievements! We're honored to have them as part of the Llama community. 🎉
6
3
28
4,689
8 May 2025
We won Meta's Llamacon Hackathon! Imagine engineers, PMs, or managers being able to query their org’s knowledge base in natural language: 🧠 “Who’s responsible for the normalization layer of the Llama 4 models?” 🔐 “Who’s an expert in our multi-step auth flow?” Had a blast hacking this up. Btw, we didn't use Cursor/Windsurf. I actually believe that planning ahead and understanding the exact structure of your code is key to deliver a project you own.
Replying to @cerebral_valley
🥇 1st Place: OrgLens An AI-powered expert matching system that connects you with the right professionals within your organization. By leveraging data from various sources, OrgLens creates a comprehensive knowledge graph and detailed profiles, streamlining expert matching. @KPJedrzejewski & @clemhus
1
3
244