Rishabh Mukherjee

Rishabh Mukherjee

757 Photos and videos

Tweets

harsh deep retweeted

Rishabh Mukherjee

@rishabhm

Jun 13

Indian firms can get GPUs, which is not a problem (for now), but talent is a big, big problem. Indian companies just do not have it in them to pay people multi-million dollar compensation. An equally difficult problem is to convince such people to work in India. I feel that even if Silicon Valley-level comp was on offer, most people in that league would baulk at the idea of living in Indian cities. The solution is to open an office in Dubai or Singapore. That could also open the door to non Indian talent. A $10B spend is possible if 3-4 Indian ITES companies combine forces.

Hemant Mohapatra

@MohapatraHemant

Jun 13

To train a GPT class 1T model from scratch - including failed runs, data acq clean rlhf, post-training, team/people will likely req $250M of compute on an aggressive 3-4mo schedule (i.e. more reserved GPUs), $500-600M all-in IF you do a dense one. MoE fp8 will cut costs by 1/10th depending on how many active params you have. If you want SOTA however, the budgets go significantly higher on test-time compute, post-training RL, and data/synthetic generations..and v. high on talent. Maybe $2-4B all-in. After that comes serving the model. The talent is key to get to SOTA/beat it - and then you have to ensure this is useful enough to have inference vol over time - for which the capital will come if there is usage / TAM. So this is not as much about raising $50-60B, or raising it all at once as the OP says - we are investors in mistral, sarvam, reflection and anthropic - and they all scaled capital over time as models got adoption, but the early bottleneck is more on talent GPUs at that scale where you can do interesting things.

447

53,132

Archie Sengupta

harsh deep retweeted

Archie Sengupta

@archiexzzz

Jun 13

the sad thing about india is that all the boomer entrepreneurs are hardcore supporters of not building a sovereign model but of building infrastructure around these models (haha, sure), because that pockets you the money much more easily - unless anthropic comes and kicks you and says, "we are not going to let anyone use this model." sure you want 1.4 billion people remain a consumer economy because you can smell the money & not the ambition of the country. having almost no r&d expenditure while earning $3b in net profits from a single company makes your country weak, not strong. i hope a new generation of big entrepreneurs will risk it all just for the love of the game.

105

770

33,070

Kimi.ai

harsh deep retweeted

Kimi.ai

@Kimi_Moonshot

Jun 12

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai

615

1,621

13,593

1,960,196

harsh deep

harsh deep @harshcodesdev

Jun 5

Let's cook some agents tomorrow 🙌🏻

Suhas Sumukh

@suhasasumukh

Jun 4

Introducing Agent Arena. An AI agent hackathon. 6 hours. $10,000 in prizes. Apply below

0:32

harsh deep

harsh deep @harshcodesdev

Jun 5

on my way!! ✊

Suhas Sumukh

@suhasasumukh

Jun 4

Introducing Agent Arena. An AI agent hackathon. 6 hours. $10,000 in prizes. Apply below

0:32

harsh deep

harsh deep @harshcodesdev

May 31

Damn this is so sick!!

Farza 🇵🇰🇺🇸

@FarzaTV

May 30

Watch me control my computer with just my voice. This is the future of operating systems. No hands. GPT-Realtime 2.0 is very, very underrated. Demo:

1:44

harsh deep

harsh deep @harshcodesdev

May 28

⚡

Claude

@claudeai

May 28

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

ALT Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

harsh deep

harsh deep @harshcodesdev

May 17

Banglore is crazy, met so many cracked folks already!

harsh deep

harsh deep @harshcodesdev

May 17

Life's good 😁

harsh deep

harsh deep @harshcodesdev

May 16

Im ready 🙌🏻

harsh deep

harsh deep @harshcodesdev

May 8

Banglore weather 🤌🏻❤️

harsh deep

harsh deep @harshcodesdev

May 6

Humanity ftw 🙌🏻

a16z

@a16z

May 1

Scrolling is on the decline More charts: a16z.news/p/charts-of-the-we…

Andrej Karpathy

harsh deep retweeted

Andrej Karpathy

@karpathy

Apr 30

This is the the quote I've been citing a lot recently.

kache

@yacineMTB

Feb 4

you can outsource your thinking but you cannot outsource your understanding

848

4,388

46,840

2,596,190

harsh deep

harsh deep @harshcodesdev

Apr 25

Got noc 🙌🏻

harsh deep

harsh deep @harshcodesdev

Apr 25

Yup we are still in that early stage, they can work preety effectively if u tell them what to do, but can't deligate thinking and taste yet. You have to take those architectural decision, figure out complex business logic, take security measures etc.

Ronan Berder

@hunvreus

Apr 23

Talking to smarter folks than me, I'm convinced many of the AI folks in my timeline are full of shit. Nobody is "running 20 agents over night" and building stuff for actual users. Maybe some are building internal tools or disposable software. Maybe. But building software people like using? That doesn't get hacked on day one or blow up after the 3rd user? Nope. I don't even understand what that's supposed to look like. Do you work out a 57 pages document that perfectly describes what you want to build and then summon 14 agents and have them run wild for 6 hours? And what comes out on the other end isn't a broken pile of shit? Nope. Not buying it. PS: it may also be that I have an IQ of 82 and can't figure it out.

harsh deep

harsh deep @harshcodesdev

Apr 21

Interesting time!

Kimi.ai

@Kimi_Moonshot

Apr 20

Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000 tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100 files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Ki…

harsh deep

harsh deep @harshcodesdev

Apr 17

Damn this is so cool! Been searching around agentic video editing for while, never thought of giving html a shot.

HeyGen

@HeyGen

Apr 16

We built our launch video in Claude Code using HyperFrames. Now it's yours. Open source, agent-native framework. HTML to MP4. $ npx skills add heygen-com/hyperframes RT Comment "HyperFrames" to get the full source code of this launch video (must follow)

0:50

harsh deep

harsh deep @harshcodesdev

Apr 17

Got the offer Letter 🥳

168

himanshu

harsh deep retweeted

himanshu

@himanshustwts

Apr 15

every morning i wake up to startups raising tens of millions for incredible ideas like > agents that call APIs > meeting note taker > copilots for copilots > revenue agent > agents pay to chat > software to build software and yet another idea-to-app slop. seriously, how can someone be bullish on these?

203

19,504

sankalp

harsh deep retweeted

sankalp

@dejavucoder

Apr 14

which way anon?

283

20,367