Shijue Huang

Shijue Huang

4 Photos and videos

Tweets

leezy retweeted

Shijue Huang @joeh310

May 13

🌟 Is scaling parameters enough for self-evolving multimodal agents? Excited to share our new work: Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents We study how data, not just models, can evolve with the current policy. 🧵👇

3,082

leezy

leezy @leezythu

Apr 23

I reverse-engineered Claude Code's 512K-line TypeScript codebase and rebuilt its core in ~700 lines of Python. Same agentic loop. Same tool system. Same permission model. Just 5 files. github.com/leezythu/mini-cla…

GitHub - leezythu/mini-claude-code: A minimal (~700 lines Python) reimplementation of Claude Code's...

A minimal (~700 lines Python) reimplementation of Claude Code's core agentic architecture - leezythu/mini-claude-code

github.com

leezy

leezy @leezythu

Apr 23

📝 prompt.py → System prompt (maps to prompts.ts) 🔒 permissions.py → Safety gate (maps to permissions.ts) The README is a full architectural walkthrough — every function mapped back to the original CC source.

leezy

leezy @leezythu

Apr 23

📄 cli.py → REPL (maps to cli.tsx REPL.tsx) 🧠 engine.py → Agentic loop (maps to QueryEngine.ts query.ts) 🔧 tools.py → 6 core tools (maps to 40 tools in src/tools/)

leezy

leezy @leezythu

Mar 29

AI agents burned $847K/month for one team. I built an open-source dashboard to prevent that. ⚡ TokenPulse — real-time cost monitoring for AI agents ✅ Per-agent budget alerts ✅ Auto-patches OpenAI & Anthropic ✅ Beautiful live dashboard github.com/leezythu/tokenpul…

leezy

leezy @leezythu

21 Dec 2025

Seed1.8, the latest generalized agent model. github.com/ByteDance-Seed/Se…

Seed-1.8/Seed-1.8-Modelcard.pdf at main · ByteDance-Seed/Seed-1.8

Contribute to ByteDance-Seed/Seed-1.8 development by creating an account on GitHub.

github.com

Yujia Qin

leezy retweeted

Yujia Qin @TsingYoga

19 Dec 2025

Proud to introduce Seed1.8, our latest generalized agent model The model achieves competitive agentic capabilities, while maintaining high LLM/VLM scores, enjoy! github.com/ByteDance-Seed/Se…

245

45,117

elvis

leezy retweeted

elvis

@omarsar0

29 Jul 2025

Nice up-to-date survey on efficient attention mechanisms for LLMs. Always a great way to catch up on new ideas and what's coming. (bookmark it)

241

25,709

leezy

leezy @leezythu

24 Dec 2024

🚀 Introducing FocusLLM📷: Unlock Precise Long Context Understanding by Dynamic Condensing! arxiv.org/abs/2408.11745 🌟 Small-context LLMs can process documents 100x longer than their context limit with no information loss, with only a small training budget. #LLM #LongContext

leezy

leezy @leezythu

24 Dec 2024

🔗 Check out our paper and code at github.com/leezythu/focusllm and revolutionize how you process long texts with LLMs!

GitHub - leezythu/FocusLLM: FocusLLM: Scaling LLM’s Context by Parallel Decoding

FocusLLM: Scaling LLM’s Context by Parallel Decoding - leezythu/FocusLLM

github.com

leezy

leezy @leezythu

24 Dec 2024

📊 Results: Perfect Accuracy at 400K context length in passkey retrieval (base model context length is 4K). Superior Performance on LongBench and ∞-Bench, surpassing all baselines. 🏆

leezy

leezy @leezythu

24 Dec 2024

💡 Key Insights: Divide-and-conquer approach enables short-context LLMs to handle long texts. Dynamic Condensing preserves information from every token in the context. Base model parameters are frozen, a small set of trainable parameters enables this capability.

leezy

leezy @leezythu

24 Dec 2024

🛠️ Solution in this Paper: Dynamic Condensing: Extracts crucial info from each text chunk with dynamic prompts, ensuring no information loss. Parallel Decoding: Integrates info across chunks. Training Efficiency: Trained with less cost, performs better than baselines. 🎯

leezy

leezy @leezythu

24 Dec 2024

🔍 Original Problem: LLMs struggle with long texts, previous condensing methods introduce inevitable information loss. 😩

leezy

leezy @leezythu

9 Aug 2023

Do you remember when you joined X? I do! #MyXAnniversary

Elon Musk

leezy retweeted

Elon Musk

@elonmusk

1 Nov 2022

😉

29,868

59,256

883,801