Sparrow Creator: Open-Source AI Doc Extraction ๐Ÿš€ | ML/Oracle Dev | @katana_ml | Try: sparrow.katanaml.io | github.com/katanaml

Joined March 2010
859 Photos and videos
Pinned Tweet
We launched Katana ML katanaml.io in 2018 and now it is time to update the website, to explain where we are now and what we do with #MachineLearning, #MLOps, and #opensource ๐Ÿš€๐Ÿš€๐Ÿš€

11 Nov 2021
We have a new website - katanaml.io It explains what we do with ML in a simple and straightforward way. It is featuring our open source product Skipper, we are using it to run #MLOps. #MachineLearning #MLOps
1
6
22
The US just blocked non-Americans from accessing Anthropic's Fable 5 and Mythos 5. Overnight. No warning. ๐ŸŒ This is exactly why I built Sparrow with local inference at its core โ€” your AI pipeline shouldn't depend on a political decision made thousands of miles away. โšก Running Mistral, Qwen or GLM, etc. locally with MLX-VLM or vLLM means zero exposure to these decisions. ๐Ÿ“„ Document goes in โ†’ structured JSON comes out. On your hardware. Offline. Always available. ๐Ÿ”’ Own your models. Own your data. Run everything locally. ๐Ÿ”— sparrow.katanaml.io ๐Ÿ”— github.com/katanaml/sparrow
2
3
236
Andrej Baranovskij retweeted
Anthropic pulling the plug on Fable and Mythos 5 at the direction of a government order is a massive wake-up call. ๐Ÿ”ฅ It highlights exactly why open, distributed AI matters. When AI is centralized, access can disappear overnight. Local models are different. No one can revoke your access. No one can pull the plug. No one can take away the weights you already own. Own your AI.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-mytโ€ฆ
24
38
312
11,952
Andrej Baranovskij retweeted
The video from @angeloskath on local agentic AI with MLX is excellent. I also hear it's one of the most viewed videos in WWDC history ๐Ÿ‘ Goes through the basics of agentic AI and how to set it all up to run locally in a very approachable and simple way. The demos are excellent and it's kind of wild that they just work now. None of this was possible or practical < 1 year ago before M5 and the recent quality bump in open weights models. And it's not done improving.
9
17
182
8,973
Integrated @MistralAI OCR as a cloud backend into Sparrow โ€” open-source document extraction platform (5.2k โญ) Two-step pipeline: Mistral OCR โ†’ structured HTML Mistral Small โ†’ JSON extraction Works alongside local backends (MLX, vLLM) โ€” same API, just a flag switch Full local or full cloud. Enterprise covers both. โญ github.com/katanaml/sparrow @MistralDevs @sophiamyang
1
4
163
Gemma 4 12B vs Ministral 14B โ€” structured table extraction, JSON schema, array output. โŒ Gemma 4 12B (8-bit and bf16): fails to return a proper JSON array โœ… Ministral 14B 8-bit: extracts all rows correctly Tested with Sparrow ๐Ÿ‘‡ ๐ŸŽฌ youtube.com/watch?v=4yFW_mmzโ€ฆ ๐ŸŒ sparrow.katanaml.io
1
1
5
186
Andrej Baranovskij retweeted
One of my personal favorite features announced at WWDC will I suspect be a sleeper hit: container machines, allowing your Mac to run a lightweight, persistent Linux environment with your home directory and repos automatically mounted: github.com/apple/container/bโ€ฆ
227
815
9,698
728,596
Andrej Baranovskij retweeted
Three MLX videos dropped at WWDC: Running agents locally by @angeloskath youtube.com/watch?v=wykPErJ8โ€ฆ Distributed inference and training by Tatiana Likhomanenko youtube.com/watch?v=CzgK02zsโ€ฆ MLX Swift by David Koski youtube.com/watch?v=KCL8f9ztโ€ฆ
10
43
363
40,268
Andrej Baranovskij retweeted
Cybertruck sunrise on 280 right now headed to San Francisco.
14
5
150
17,304
New Sparrow UI vibe coded with Claude Code is live in prod: sparrow.katanaml.io Shadcn and Next.js stack
1
1
4
174
Andrej Baranovskij retweeted
Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is whatโ€™s new with Gemma 4 12B: ๐Ÿ‘‡
404
1,789
12,365
3,176,260
Sparrow new dashboard is mobile friendly
1
2
122
New Sparrow UI for mobile will be much better, thanks to Claude Design
1
4
222
Prince is doing incredible job and pushing new features for mlx-vlm. This is super useful, thanks ๐Ÿš€
Today we're shipping our biggest MLX-VLM release yet: v0.6.0 ...and we are raising ๐Ÿ’ธ This one's about turning your Apple devices into real local agent machines. From your desk to your pocket. What's new: โšก Speculative decoding everywhere โ€” Gemma 4 EAGLE3 DFlash, Qwen MTP, DeepSeek V4 MTP. Faster tokens, less waiting. ๐Ÿค– Agent-ready server โ€” native Anthropic /v1/messages API, stateful /v1/responses, tool calls, Codex context budgets. Plug Claude Code & Codex straight into local models. ๐Ÿ‘๏ธ New models galore โ€” DeepSeek V4, ZAYA1-VL, MiniCPM-V 4.6, LFM2 MoE, Step-3.7 Flash, Laguna more. ๐ŸŽจ Image gen & editing โ€” FLUX.2 (base klein), PrismML Bonsai. ๐Ÿ”Š Audio in โ€” Qwen3 Omni, Gemma 4 audio, base64 chat audio. ๐Ÿงฎ TurboQuant KV cache โ€” RHT-correct fast paths for leaner memory. ๐Ÿ“ฆ Modular server, better metrics, cleaner streaming. Run real agents on the hardware already in your hands. Github: github.com/Blaizzy/mlx-vlm
3
289
Building Agentic AI Pipelines for Document Analysis Two steps. Fully local. 1๏ธโƒฃ Sparrow Parse extracts structured data from bonds table โ€” Ministral 3B 14B 2๏ธโƒฃ Sparrow Instructor analyzes portfolio risk โ€” Gemma 4 31B Orchestrated with Prefect. No data leaves your machine. YouTube: youtube.com/watch?v=Sw_zzzu7โ€ฆ GitHub: github.com/katanaml/sparrow Sparrow: sparrow.katanaml.io
2
189
New Sparrow UI with shadcn/ui and next.js is coming up. Functionality is working, entering testing/fixing phase.
1
213
Building new Sparrow UI with Claude Code is going well. Implemented file upload component, added backend code with Next.js Migrating to shadcn from Sparrow Gradio UI: sparrow.katanaml.io
1
1
2
377