Product Intern @Adobe

Joined December 2022
757 Photos and videos
harsh deep retweeted
Indian firms can get GPUs, which is not a problem (for now), but talent is a big, big problem. Indian companies just do not have it in them to pay people multi-million dollar compensation. An equally difficult problem is to convince such people to work in India. I feel that even if Silicon Valley-level comp was on offer, most people in that league would baulk at the idea of living in Indian cities. The solution is to open an office in Dubai or Singapore. That could also open the door to non Indian talent. A $10B spend is possible if 3-4 Indian ITES companies combine forces.
To train a GPT class 1T model from scratch - including failed runs, data acq clean rlhf, post-training, team/people will likely req $250M of compute on an aggressive 3-4mo schedule (i.e. more reserved GPUs), $500-600M all-in IF you do a dense one. MoE fp8 will cut costs by 1/10th depending on how many active params you have. If you want SOTA however, the budgets go significantly higher on test-time compute, post-training RL, and data/synthetic generations..and v. high on talent. Maybe $2-4B all-in. After that comes serving the model. The talent is key to get to SOTA/beat it - and then you have to ensure this is useful enough to have inference vol over time - for which the capital will come if there is usage / TAM. So this is not as much about raising $50-60B, or raising it all at once as the OP says - we are investors in mistral, sarvam, reflection and anthropic - and they all scaled capital over time as models got adoption, but the early bottleneck is more on talent GPUs at that scale where you can do interesting things.
64
37
447
53,132
harsh deep retweeted
the sad thing about india is that all the boomer entrepreneurs are hardcore supporters of not building a sovereign model but of building infrastructure around these models (haha, sure), because that pockets you the money much more easily - unless anthropic comes and kicks you and says, "we are not going to let anyone use this model." sure you want 1.4 billion people remain a consumer economy because you can smell the money & not the ambition of the country. having almost no r&d expenditure while earning $3b in net profits from a single company makes your country weak, not strong. i hope a new generation of big entrepreneurs will risk it all just for the love of the game.
27
105
770
33,070
harsh deep retweeted
๐ŸŒ˜ Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! ๐Ÿ”ท Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. ๐Ÿ”ท Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. ๐Ÿ”ท Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. โšก๏ธ 6x High-Speed Mode coming soon! ๐Ÿ”Œ Available today via Kimi API and Kimi Code. ๐Ÿ”— Kimi Code: kimi.com/code ๐Ÿ”— API: platform.moonshot.ai
615
1,621
13,593
1,960,196
Let's cook some agents tomorrow ๐Ÿ™Œ๐Ÿป
Introducing Agent Arena. An AI agent hackathon. 6 hours. $10,000 in prizes. Apply below
3
39
on my way!! โœŠ
Introducing Agent Arena. An AI agent hackathon. 6 hours. $10,000 in prizes. Apply below
2
26
Damn this is so sick!!
Watch me control my computer with just my voice. This is the future of operating systems. No hands. GPT-Realtime 2.0 is very, very underrated. Demo:
29
โšก
May 28
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.
3
38
Banglore is crazy, met so many cracked folks already!
3
55
Life's good ๐Ÿ˜
3
30
Im ready ๐Ÿ™Œ๐Ÿป
1
4
50
Banglore weather ๐ŸคŒ๐Ÿปโค๏ธ
6
77
Humanity ftw ๐Ÿ™Œ๐Ÿป
May 1
Scrolling is on the decline More charts: a16z.news/p/charts-of-the-weโ€ฆ
1
25
harsh deep retweeted
This is the the quote I've been citing a lot recently.
you can outsource your thinking but you cannot outsource your understanding
848
4,388
46,840
2,596,190
Got noc ๐Ÿ™Œ๐Ÿป
1
1
34
Yup we are still in that early stage, they can work preety effectively if u tell them what to do, but can't deligate thinking and taste yet. You have to take those architectural decision, figure out complex business logic, take security measures etc.
Talking to smarter folks than me, I'm convinced many of the AI folks in my timeline are full of shit. Nobody is "running 20 agents over night" and building stuff for actual users. Maybe some are building internal tools or disposable software. Maybe. But building software people like using? That doesn't get hacked on day one or blow up after the 3rd user? Nope. I don't even understand what that's supposed to look like. Do you work out a 57 pages document that perfectly describes what you want to build and then summon 14 agents and have them run wild for 6 hours? And what comes out on the other end isn't a broken pile of shit? Nope. Not buying it. PS: it may also be that I have an IQ of 82 and can't figure it out.
1
49
Interesting time!
Meet Kimi K2.6: Advancing Open-Source Coding ๐Ÿ”นOpen-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: ๐Ÿ”นLong-horizon coding - 4,000 tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). ๐Ÿ”นMotion-rich frontend - Videos in hero sections, WebGL shaders, GSAP Framer Motion, Three.js 3D. ๐Ÿ”นAgent Swarms, elevated - 300 parallel sub-agents ร— 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100 files. ๐Ÿ”นProactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. ๐Ÿ”นClaw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - ๐Ÿ”— API: platform.moonshot.ai ๐Ÿ”— Tech blog: kimi.com/blog/kimi-k2-6 ๐Ÿ”— Weights & code: huggingface.co/moonshotai/Kiโ€ฆ
2
36
Damn this is so cool! Been searching around agentic video editing for while, never thought of giving html a shot.
Apr 16
We built our launch video in Claude Code using HyperFrames. Now it's yours. Open source, agent-native framework. HTML to MP4. $ npx skills add heygen-com/hyperframes RT Comment "HyperFrames" to get the full source code of this launch video (must follow)
3
55
Got the offer Letter ๐Ÿฅณ
4
6
168
harsh deep retweeted
every morning i wake up to startups raising tens of millions for incredible ideas like > agents that call APIs > meeting note taker > copilots for copilots > revenue agent > agents pay to chat > software to build software and yet another idea-to-app slop. seriously, how can someone be bullish on these?
29
11
203
19,504
harsh deep retweeted
which way anon?
18
6
283
20,367