Co-founder & CTO @ParallelDots - Leading AI platform for retail execution monitoring for Fortune 500 brands. Previously built @DentistryAI.IIT Kharagpur
Great to see @ParallelDots featured among Brands to Watch in 2026 by @htTweets.
Building ShelfWatch from India to serve Fortune 500 consumer brands in 50 countries has been one heck of a ride. The best is ahead. 🙏
tech.hindustantimes.com/bran…
This is exactly the kind of work that makes AI practical for enterprises. When inference gets 5x cheaper at the edge with zero accuracy loss, every enterprise gets unlimited budget to experiment!
I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon.
4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B.
Speed needs work (unoptimized shader), compression target met.
Repo: github.com/TheTom/turboquant…
**Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.
I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon.
4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B.
Speed needs work (unoptimized shader), compression target met.
Repo: github.com/TheTom/turboquant…
**Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.
Sparse MoE is quietly becoming the architecture that makes enterprise AI agents economically viable. At ParallelDots we're seeing this firsthand where the inference cost curve is what unlocks real-world retail AI at scale, not just benchmark scores!
Three weeks ago I shared that Claude had shocked Prof. Donald Knuth by finding an odd-m construction for his open Hamiltonian decomposition problem in about an hour of guided exploration. Prof. Knuth titled the paper Claude’s Cycles.
The story didn't end there.
The updated paper shows the story got much bigger. For the base case m=3, there are exactly 11,502 Hamiltonian cycles. Of those, 996 generalize to all odd-m, and Prof. Knuth shows there are exactly 760 valid “Claude-like” decompositions in that family.
The even case, which Claude couldn’t finish, was then cracked by Dr. Ho Boon Suan using GPT-5.4 Pro to produce a 14-page proof for all even m≥8, with computational checks up to m=2000.
Soon after, Dr. Keston Aquino-Michaels used GPT Claude together to find simpler constructions for both odd and even m, by using the multi-agent workflow.
Dr. Kim Morrison also formalized Knuth’s proof of Claude’s odd-case construction in Lean.
So yes: the problem now appears fully resolved in the updated paper’s ecosystem of human AI proof assistant work!
We went from one AI solving one problem to a full mathematical ecosystem (multiple AI systems, multiple humans, formal verification) running in parallel on a problem that stumped experts for weeks.
We are living in very interesting times indeed.
Paper (updated): www-cs-faculty.stanford.edu/…
Prof. Donald Knuth opened his new paper with "Shock! Shock!"
Claude Opus 4.6 had just solved an open problem he'd been working on for weeks — a graph decomposition conjecture from The Art of Computer Programming.
He named the paper "Claude's Cycles."
31 explorations. ~1 hour. Knuth read the output, wrote the formal proof, and closed with: "It seems I'll have to revise my opinions about generative AI one of these days."
The man who wrote the bible of computer science just said that. In a paper named after an AI.
Paper: cs.stanford.edu/~knuth/paper…
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI
This is an excellent piece on how to think about Forward Deployed Engineers (FDEs) for enterprise AI startups. My favorite part is right at the end -
“Linear services scale by adding bodies. Exponential services scale by adding capability. Both have FDEs. Only one is building something that compounds.
If the product isn’t improving, you don’t have forward deployed engineers.”
It's not enough to build software with AI, you need to be a 10x AI software engineer.
All the best teams are parallelizing their AI software use and churning out 25-50 meaningful commits / engineer / day.
The Claude Code best practices guide is an absolute goldmine of tips
Meet Ankit Narayan Singh, co-founder and CTO of ParallelDots, sharing his vision for the future of image recognition solutions and insights into ParallelDots’ innovative journey, reshaping the FMCG and retail landscape with cutting-edge technology.
#JioGenNext
While there are 1000s of angels and endless startups, there hasn’t been an easy way for either to discover each other, we at @Dzerovc thought of taking a jab at it. We created angelinvestorsinIndia.com a repository of angels in India, so founders could reach out to them
I have got requests from a lot of first-time/second time founders looking for Co-founders.
I have been connecting them over Whatsapp. Some have moved from a dating phase to final team formation.
I do have a decent pipeline of people who are starting up first or second time.
How about a meet-up for Co-Founder dating?
If this post gets 50 RTs/Quotes, we will do this in mid of March.
RT for Good Karma!
@santoshpanda@bimleshgundurao@SanketPanda10