Tech enthusiast tracking the cutting edge of AI, robotics, Python open source, Gaussian splatting, and medical imaging.

Joined April 2009
Photos and videos
AgentSkills retweeted
🚨BREAKING: Someone just solved Claude Code's biggest problem. It's called Claude-Mem and it gives Claude persistent memory across sessions. - You can use up to 95% fewer tokens each time. - Make 20 times more tool calls before reaching limits. 100% Opensource.
317
1,035
10,413
1,657,095
AgentSkills retweeted
Introducing Agentic Vision, a new capability in Gemini 3 Flash. Agentic Vision makes Gemini even better at analyzing complex images, enabling it to more accurately and consistently read fine details, like serial numbers or text on a complex diagram. See what it can do. 🧡
214
499
4,979
4,745,273
AgentSkills retweeted
Jan 27
πŸŽ‰πŸŽ‰πŸŽ‰ Kimi K2.5 is on Ollama's cloud ollama run kimi-k2.5:cloud You can connect it to Claude Code, Codex, OpenCode, Clawdbot, and Droid via ollama launch! ollama launch claude --model kimi-k2.5:cloud
πŸ₯ Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. πŸ”Ή Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) πŸ”Ή Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) πŸ”Ή Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. πŸ”Ή Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5Γ— faster compared with single-agent setup. - πŸ₯ K2.5 is now live on kimi.com in chat mode and agent mode. πŸ₯ K2.5 Agent Swarm in beta for high-tier users. πŸ₯ For production-grade coding, you can pair K2.5 with Kimi Code: kimi.com/code - πŸ”— API: platform.moonshot.ai πŸ”— Tech blog: kimi.com/blogs/kimi-k2-5.htm… πŸ”— Weights & code: huggingface.co/moonshotai/Ki…
123
374
4,949
542,902
AgentSkills retweeted
πŸ—οΈ Most 3D reconstruction papers evaluate novel views by applying an optimal color correction using the ground-truth image. That's circular and unrealistic. PPISP directly predicts camera parameters from the rendered radiance alone, using an ISP controller trained on input views. ✨Result: metrics that don't require access to target images, aligning evaluation with real-world deployment. ➑️ github.com/nerfstudio-projec…
5
20
134
12,005
AgentSkills retweeted
Full Stack Folder Structure. Grab the Modern Full Stack Latest Edition Handbook: codewithdhanian.gumroad.com/…
19
160
1,232
55,927
AgentSkills retweeted
πŸ’₯🍌3D Nano Banana just dropped! ✨We just launched #Rodin Gen-2 "Edit", upload ANY model and edit like magic: 1️⃣ Smart Low-poly β†’ artist-style topology 2️⃣ Local edits via prompt (Beta) 3️⃣ BANG to Parts 4️⃣ more #Hyper3D is now the FIRST true #3D GenAI editing platform! πŸš€
28
106
670
127,654
AgentSkills retweeted
Big moment for Postgres! Search has always been Postgres' weak spot, and everyone just accepted it. If you needed a real relevance-ranked keyword search, the default answer was to spin up Elasticsearch or add Algolia and deal with the data sync headaches forever. The problem isn't that Postgres can't do text search. It can. But the built-in `ts_rank` function uses a basic term frequency algorithm that doesn't come close to what modern search engines deliver. So teams end up: - Running a separate Elasticsearch cluster just for search - Building sync pipelines that inevitably drift out of consistency - Paying for managed search services that charge per query - Accepting mediocre search relevance because "good enough" ships faster But this is actually a solvable problem. You can realistically bring industry-standard search ranking directly into Postgres, which eliminates the need for external infra entirely. This exact solution is now available with the newly open-sourced pg_textsearch by @TigerDatabase, a Postgres extension that brings true BM25 relevance ranking into the database. BM25 is the algorithm behind Elasticsearch, Lucene, and most modern search engines. Now it runs natively in Postgres. Here's what pg_textsearch enables: - True BM25 ranking with configurable parameters (the same algorithm powering production search systems) - Simple SQL syntax: `ORDER BY content <@> 'search terms'` - Works with Postgres text search configurations for multiple languages - Pairs naturally with pgvector for hybrid keyword semantic search That last point matters a lot for RAG apps. The video below shows this in action, and I worked with the team to put this together. You can now do hybrid retrieval (combining keyword matching with vector similarity) in a single database, without stitching together multiple systems. The syntax is clean enough that you can add relevance-ranked search to existing queries in minutes. pg_textsearch is fully open-source under the PostgreSQL license. You can find a link to their GitHub repo in the next tweet.
66
313
2,574
215,134
AgentSkills retweeted
πŸš€ ERNIE 5.0 is officially live! Supporting text, image, audio, and video inputs & outputs, ERNIE 5.0 ranks in the top tier across multiple authoritative benchmarks. βœ… 2.4T total parameters βœ… Stronger language & multimodal understanding βœ… Image & video generation on par with specialized domain models πŸ‘‰Try it now: Chat: ernie.baidu.com/ Playground: aistudio.baidu.com/ API: console.bce.baidu.com/qianfa…
15
47
326
18,944
AgentSkills retweeted
Excited to launch Pencil INFINITE DESIGN CANVAS for Claude Code > Superfast WebGL canvas, fully editable, running parallel design agents > Runs locally with Claude Code β†’ turn designs into code > Design files live in your git repo β†’ Open json-based .pen format
594
934
8,631
1,494,389
AgentSkills retweeted
Developer Week Day 2. πŸ› οΈ We just gave your Coding Agent a Design DegreeπŸŽ“πŸŽ¨ Introducing the Stitch MCP Server. πŸ”Œ You can now pipe Stitch designs directly into your favorite tools like Antigravity. * Generate new screens without leaving your IDE * Fetch the code from any design * Inject context: Give your agent full visual awareness Docs and more information πŸ‘‡
81
208
1,746
359,307
AgentSkills retweeted
The World API is live. Generate persistent, explorable 3D worlds from text, images, and video. Integrate them directly into your products.
108
264
2,746
660,762
AgentSkills retweeted
The @Walmart Immersive Commerce SDK is now available! Bring real-world products into gameplay by integrating Walmart items or your own marketplace merchandise directly into your experience without breaking immersion. Start building today: on.unity.com/Walmart
129
54
585
173,731
AgentSkills retweeted
Jan 21
Image to 3D, now in ANY pose! πŸ€Έβ€β™‚οΈ Simply upload your image a pose reference photo. Meshy generates the model to match it perfectly. No complex rigging needed. Create 3D characters your way.
40
119
1,041
103,249
AgentSkills retweeted
NVIDIA just removed one of the biggest friction points in Voice AI. PersonaPlex-7B is an open-source, full-duplex conversational model. Free, open source (MIT), with open model weights on @huggingface πŸ€— Links to repo and weights in πŸ§΅β†“ The traditional ASR β†’ LLM β†’ TTS pipeline forces rigid turn-taking. It’s efficient, but it never feels natural. PersonaPlex-7B changes that. This @nvidia model can listen and speak at the same time. It runs directly on continuous audio tokens with a dual-stream transformer, generating text and audio in parallel instead of passing control between components. That unlocks: β†’ instant back-channel responses β†’ interruptions that feel human β†’ real conversational rhythm Persona control is fully zero-shot! If you’re building low-latency assistants or support agents, this is a big step forward πŸ”₯
157
669
6,223
564,427
AgentSkills retweeted
πŸš€ LightOnOCR-2-1B πŸ¦‰ is out, a major update to LightOnOCR. 1B parameters, end-to-end multilingual OCR, and it beats models 9Γ— larger on OlmOCR-Bench while being much faster. PDF/page in, clean ordered Markdown out, with optional image localization (bbox variants).
14
107
680
57,291
AgentSkills retweeted
Introducing ShapeR, a method for robust conditional 3D shape generation from casually captured sequences. ShapeR leverages a rectified flow transformer conditioned on per-object multimodal data to turn casual image sequences into full metric scene reconstructions. Project Page: facebookresearch.github.io/S… Paper: arxiv.org/abs/2601.11514 Links to code and huggingface below ⬇️
17
147
1,012
70,405
Want to monitor your systems with lightning speed? 🌩️ #Netdata delivers AI-powered full stack observability, perfect for lean teams! From real-time metrics to actionable insights, it's designed for efficiency and ease. Check it out: github.com/netdata/netdata #OpenSource #DevOps #Monitoring #Observability
1
169
πŸ“Œ Unlock Real-Time Insights with Netdata: The Future of Full Stack Observability Netdata is revolutionizing the way teams monitor their systems with its AI-powered observability solution. Designed for speed and efficiency, it empowers even the leanest teams to gain deep insights into their entire stack effortlessly. πŸ”‘ Key Features: β€’ Real-time monitoring with minimal resource usage β€’ AI-driven insights for proactive management β€’ User-friendly interface with customizable dashboards β€’ Seamless integration with existing tools and platforms β€’ Lightweight and open-source, ensuring flexibility πŸ€” Why It's Interesting: Netdata stands out because it combines real-time analytics with AI-powered recommendations, making it a game-changer for developers who need rapid insights without the usual complexity. Its lightweight nature and ease of integration set it apart from other observability tools. πŸ”§ Technical Details: Built with C for maximum performance, Netdata operates with a low footprint, allowing it to collect metrics from various sources without heavy overhead. It integrates seamlessly with cloud services, containers, and traditional infrastructures, providing a holistic view of your stack.
128
πŸš€ Ready to ace your coding interviews? The #TechInterviewHandbook offers curated materials tailored for busy software engineers! Dive into problem-solving strategies and resources that will boost your confidence. 🌟 Check it out here: github.com/yangshun/tech-int… #CodingInterviews #SoftwareEngineering #OpenSource
1
115
πŸ“Œ Master Your Coding Interviews with Tech Interview Handbook The Tech Interview Handbook is the ultimate resource for software engineers seeking to excel in coding interviews. With a comprehensive collection of curated materials, this project streamlines your preparation process, making it easier than ever to get ready for your next big opportunity. πŸ”‘ Key Features: β€’ Extensive collection of coding challenges and solutions β€’ In-depth resources covering data structures and algorithms β€’ Tips and insights from industry experts β€’ Flexible format for on-the-go learning β€’ Community-driven updates and contributions πŸ€” Why It's Interesting: What sets the Tech Interview Handbook apart is its focus on busy professionals, providing concise yet comprehensive resources that fit into a hectic schedule. The community-driven approach ensures that the content remains relevant and up-to-date with current industry practices. πŸ”§ Technical Details: Built with TypeScript, the Tech Interview Handbook emphasizes type safety and maintainability, ensuring a robust codebase that can be easily extended. The project leverages GitHub's collaborative features, encouraging contributions from developers around the world.
95