Joined August 2022
4 Photos and videos
leezy retweeted
🌟 Is scaling parameters enough for self-evolving multimodal agents? Excited to share our new work: Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents We study how data, not just models, can evolve with the current policy. πŸ§΅πŸ‘‡
2
7
20
3,082
Apr 23
I reverse-engineered Claude Code's 512K-line TypeScript codebase and rebuilt its core in ~700 lines of Python. Same agentic loop. Same tool system. Same permission model. Just 5 files. github.com/leezythu/mini-cla…
2
1
21
Apr 23
πŸ“ prompt.py β†’ System prompt (maps to prompts.ts) πŸ”’ permissions.py β†’ Safety gate (maps to permissions.ts) The README is a full architectural walkthrough β€” every function mapped back to the original CC source.

14
Apr 23
πŸ“„ cli.py β†’ REPL (maps to cli.tsx REPL.tsx) 🧠 engine.py β†’ Agentic loop (maps to QueryEngine.ts query.ts) πŸ”§ tools.py β†’ 6 core tools (maps to 40 tools in src/tools/)

8
Mar 29
AI agents burned $847K/month for one team. I built an open-source dashboard to prevent that. ⚑ TokenPulse β€” real-time cost monitoring for AI agents βœ… Per-agent budget alerts βœ… Auto-patches OpenAI & Anthropic βœ… Beautiful live dashboard github.com/leezythu/tokenpul…
1
2
68
leezy retweeted
19 Dec 2025
Proud to introduce Seed1.8, our latest generalized agent model The model achieves competitive agentic capabilities, while maintaining high LLM/VLM scores, enjoy! github.com/ByteDance-Seed/Se…
9
33
245
45,117
leezy retweeted
29 Jul 2025
Nice up-to-date survey on efficient attention mechanisms for LLMs. Always a great way to catch up on new ideas and what's coming. (bookmark it)
4
44
241
25,709
24 Dec 2024
πŸš€ Introducing FocusLLMπŸ“·: Unlock Precise Long Context Understanding by Dynamic Condensing! arxiv.org/abs/2408.11745 🌟 Small-context LLMs can process documents 100x longer than their context limit with no information loss, with only a small training budget. #LLM #LongContext
5
1
50
24 Dec 2024
πŸ“Š Results: Perfect Accuracy at 400K context length in passkey retrieval (base model context length is 4K). Superior Performance on LongBench and ∞-Bench, surpassing all baselines. πŸ†
48
24 Dec 2024
πŸ’‘ Key Insights: Divide-and-conquer approach enables short-context LLMs to handle long texts. Dynamic Condensing preserves information from every token in the context. Base model parameters are frozen, a small set of trainable parameters enables this capability.
31
24 Dec 2024
πŸ› οΈ Solution in this Paper: Dynamic Condensing: Extracts crucial info from each text chunk with dynamic prompts, ensuring no information loss. Parallel Decoding: Integrates info across chunks. Training Efficiency: Trained with less cost, performs better than baselines. 🎯
29
24 Dec 2024
πŸ” Original Problem: LLMs struggle with long texts, previous condensing methods introduce inevitable information loss. 😩
26
9 Aug 2023
Do you remember when you joined X? I do! #MyXAnniversary
27
leezy retweeted
1 Nov 2022
πŸ˜‰
29,868
59,256
883,801