Silicon Valley Engineer, Entrepreneur. Working on #deeplearning and #robotics. Interested in Social Issues, Venture Capital, Change (Google, Hortonworks, Apple)

Joined June 2008
73 Photos and videos
Shivaji retweeted
We are partnering with @nvidia to power our frontier model training and platforms delivering customizable AI. thinkingmachines.ai/news/nvi…
101
163
2,408
669,308
Claude subscription will be top 2026 Christmas gift card!!
19
Shivaji retweeted
My information consumption is now 1/4 X, 1/4 podcast interviews of the smartest practitioners, 1/4 talking to the leading AI models, and 1/4 reading old books. The opportunity cost of anything else is far too high, and rising daily.
1,435
3,907
35,043
34,643,685
A decade ago I had put a post to respected of India Narendra Modi urging India to look and invest into AI. I wish they did. it was painful how the county failed the signal despite all the high paid set of advisers. India is far behind in the AI era,
10
We should be able to buy and exchange tokens. Save tokens. Tokens is the base of intelligence and how we operate through life. It is not real estate, not cars, but tokens as it unlocks you to all access in life. GPU time is too low level, a unified currency.
9
Token is the new currency. One who can afford tokens will survive and flourish, the UBI will be about giving a set amount of token from the govt. For every step of life will be operated by tokens. Whether it is machines, rides, food will be tokens. We will have token auctions.
6
Shivaji retweeted
In the future soon, we will be able to communicate with many intelligent animal species - can't wait to better understand what my dog🐶is saying! Congrats to the DolphinGemma team building on our Gemma models - the most powerful single GPU/TPU open source models out there!
Introducing DolphinGemma, an LLM fine-tuned on many years of dolphin sound data 🐬 to help advance scientific discovery. We collaborated with @dolphinproject to train a model that learns vocal patterns to predict what sound they might make next. It’s small enough (~400M params) to run directly on Pixel 9 phones used in the ocean! A very cool step toward enabling interspecies communication.
167
414
3,610
691,783
Shivaji retweeted
17 Mar 2025
Introducing Mistral Small 3.1. Multimodal, Apache 2.0, outperforms Gemma 3 and GPT 4o-mini. mistral.ai/news/mistral-smal…
265
1,055
7,389
865,153
23 Feb 2025
Grok3 is pretty good!!
18
23 Feb 2025
Are we heading to becoming a dumber society with LLM?
12
23 Feb 2025
With actual AI or AGI, capitalism in current form will have challenges to survive. We will be forced into socialism or some form of it.
12
23 Feb 2025
I have the privilege of working across the AI ecosystem and see the companies evolve. Question is how will companies keep its moat in the new AI world? if all that is done is integrate LLM into the products?
9
Shivaji retweeted
Excited to announce SIMA, a general AI agent for games & 3D virtual settings. It marks the first time an agent has demonstrated it can follow natural-language instructions to carry out a wide range of tasks across a large array of game worlds, similar to how a human would play.
13 Mar 2024
Our research project SIMA is creating a general, natural language instructable, multi 3D game-playing AI agent. The agent can carry out a wide range of tasks in virtual worlds, making AI more adaptable, helpful & fun! dpmd.ai/sima-1
63
211
1,311
252,691
Shivaji retweeted
Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano Gemini Ultra’s performance exceeds current state-of-the-art results on 30 of the 32 widely-used academic benchmarks. With a score of 90.0%, Gemini Ultra is the first model to outperform human experts on MMLU. blog.google/technology/ai/go…
917
3,608
22,454
4,965,735
Shivaji retweeted
Now Gemini Pro is coming today in Bard’s biggest update yet (in English in 170 countries) with more advanced reasoning and understanding in the responses. Bard Advanced with Ultra, our most general and capable model for highly complex tasks, is coming early next year. blog.google/products/bard/go…
53
191
1,879
388,150
Shivaji retweeted
Gemini Nano is super efficient for tasks that are on-device. Android developers can sign up for an early access program for Gemini Nano via Android AICore and Pixel 8 Pro users can already see it rolling out in features like Summarize in Recorder and Smart Reply in Gboard much more to come! blog.google/products/pixel/p…
60
147
1,563
300,034
Shivaji retweeted
Gemini’s reasoning capabilities mean it can understand more about a user’s intent, and use tools to generate bespoke user experiences that go beyond chat interfaces. Here’s what that looks like in action. ↓ #GeminiAI
46
290
1,459
362,854
Shivaji retweeted
6 Dec 2023
Lots of excitement about the Gemini announcement, but @GoogleCloud also announced availability of the newest TPU system today, TPU v5p. These systems are quite a bit higher performance and much cost effective than earlier generations. Compared to TPU v4, TPU v5p (see table image below): o 1.67X the bfloat16 perf/chip o ~3X the memory per chip o Adds int8 operations at 918 TOPs/chip o 2X the ICI network bandwidth o Pods are 2.18X larger So, whole pod is 4.1 bfloat16 exaflops, and 8.2 int8 exaops Real performance on training a GPT-3-like model is 2.8X higher per chip, and 2.1X better perf/$. cloud.google.com/blog/produc…
17
120
986
150,692
Shivaji retweeted
The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses blog.google/technology/ai/go…
381
1,843
10,750
3,151,242
Shivaji retweeted
6 Dec 2023
I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks, including 10 of 12 popular text and reasoning benchmarks, 9 of 9 image understanding benchmarks, 6 of 6 video understanding benchmarks, and 5 of 5 speech recognition and speech translation benchmarks. Gemini Ultra is the first model to achieve human-expert performance on MMLU across 57 subjects with a score above 90%. It also achieves a new state-of-the-art score of 62.4% on the new MMMU multimodal reasoning benchmark, outperforming the previous best model by more than 5 percentage points. Gemini was built by an awesome team of people from @GoogleDeepMind, @GoogleResearch, and elsewhere at @Google, and is one of the largest science and engineering efforts we’ve ever undertaken. As one of the two overall technical leads of the Gemini effort, along with my colleague @OriolVinyalsML, I am incredibly proud of the whole team, and we’re so excited to be sharing our work with you today! There’s quite a lot of different material about Gemini available, starting with: Main blog post: blog.google/technology/ai/go… 60-page technical report authored by th Gemini Team: deepmind.google/gemini/gemin… In this thread, I’ll walk you through some of the highlights.
240
2,360
12,574
3,903,030