Google Cloud AI | formerly Together AI, Apple

Joined January 2009
5 Photos and videos
TPUs power inference across the models we serve on the Gemini Enterprise Agent Platform, and enable training for Deepmind and our customers. Very excited for the launch of our eighth generation of TPUs!
Apr 22
We’re introducing our eighth generation of TPUs. This time, we’re taking a dual chip approach: TPU 8t, optimized for training, and TPU 8i, optimized for inference. 💪TPU 8t achieves nearly three times the compute performance per pod over our previous generation, Ironwood. ⚡TPU 8i connects 1,152 TPUs in a single pod to deliver the massive throughput and low latency needed to concurrently run millions of agents cost-effectively. These new TPUs are a crucial part of our fully integrated AI stack — from the chips all the way up to the models, developer tools, agents and applications. By designing the hardware and software in tandem, we’re able to deliver scale and efficiency. #GoogleCloudNext

ALT Text reads “Introducing the 8th generation of TPUs” over photos of chips.

135
Jamie de Guerre retweeted

6
226
1,338
173,209
Jamie de Guerre retweeted

110
949
4,430
1,802,160
Jamie de Guerre retweeted
Build an AI app with Nano Banana Pro and Veo 3.1 that turns any location into cinematic art. Just type a city and get 3D videos of its weather, architecture, and mood in real-time. 100% Opensource code.
44
100
1,242
111,364
Jamie de Guerre retweeted
Geminiii
855
902
19,091
4,788,060
Jamie de Guerre retweeted
Introducing the first-ever Google Agents Development Kit (ADK) Community Call. Come meet the team, learn about our AI Agents roadmap and ask any question you may have. Happening next Wednesday at 9:30-10:30am PT (link below)
13
29
197
22,107
Jamie de Guerre retweeted
Build a multi-agent home renovation team with Google ADK and Gemini 2.5 Flash Nano Banana. Just upload your room photo, design inspiration and budget to get a complete renovation plan with material, cost, timeline, and an after image of your new room. 100% Opensource code.
10
41
378
33,742
Jamie de Guerre retweeted
Building voice AI Agents has never been easier. New updates to Gemini Live API with Native Audio lets you build AI Agents that understand emotion, ignore background noise, use tools like RAG, MCP & search. x.com/GoogleAIStudio/status/…

7
8
46
6,904
Jamie de Guerre retweeted
10 Sep 2025
🚀 LAUNCHED: The A2A protocol is now natively integrated on Vertex AI Agent Engine! Deploying A2A with Agent Engine previously involved complex processes with separate runtimes and "glue code." I'm excited to share the native integration of the A2A protocol with Vertex AI Agent Engine, making it easier to build and deploy collaborative AI agents at scale. TLDR: ✅ Deploy A2A agents with a single template ✅ Scale easily on a secure endpoint on Agent Engine ✅ Get a clean, reusable API for simple integration Code and blog in 🧵
1
7
15
159,776
Jamie de Guerre retweeted
Introducing Open Deep Research! A fully open-source Deep Research tool that: • writes comprehensive reports • does multi-hop search and reasoning • generates cover images & pod-casts! We’re releasing everything: evaluation dataset, code and blog.🔥 Example output report👇
14
70
428
66,525
Thank you @ErikaBatista, @GetLago, @oanaolt, @huggingface for hosting! Great event and incredible presentations from so many talented founders. Look forward to seeing all of these companies grow!
Energizing open source AI founder meetup at @SignalFire hosted by @ErikaBatista @byAnhtho @oanaolt @GetLago @togethercompute. Got to hear more than 30 pitches (majority female founders) with many planning to open source very cool models, datasets and apps in the coming months, let’s go!
1
3
13
1,582
Jamie de Guerre retweeted
These models are incredible, and a massive step forward for OSS AI. Amazing work from @Meta team! On @togethercompute now at 350 t/s for full precision on 8B and 150 t/s on 70B. api.together.xyz/playground/…

We are thrilled to be a launch partner for Meta Llama 3. Experience Llama 3 now at up to 350 tokens per second for Llama 3 8B and up to 150 tokens per second for Llama 3 70B, running in full FP16 precision on the Together API! 🤯 together.ai/blog/together-ai…
5
6
61
17,852
Jamie de Guerre retweeted
New model now available on Together AI! @MistralAI's latest base model, Mixtral-8x22B! 🚀 api.together.xyz/playground/…
10
17
161
21,042
Jamie de Guerre retweeted
The serverless inference API @togethercompute is likely #1 in volume for OSS models (numbers coming soon!). We are also #1 on performance for almost all regimes according to Martian leaderboard, while providing 6000 RPM rate-limit to anyone who signs up and puts down a CC. tinyurl.com/2ceavejf
9
57
11,970
Jamie de Guerre retweeted
Three teams have been dominating the LLM game for a while: @MistralAI for sota LLMs 🦙 @langchain for building with LLMs 🦜 @togethercompute for serving LLMs 🚀 If you know how, you can build things really, really fast now. Brief intro and code walk-through for you 👇
17
83
623
137,560
This has huge potential to help generative AI scale to faster models, with longer context. Let’s go! 🚀
Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context. It builds on the lessons learned in past year designing efficient sequence modeling architectures. together.ai/blog/stripedhyen…
1
239
Jamie de Guerre retweeted
Together.ai emerging as one of the top ai dev tools!
The latest AI market survey from @retool has some great data. Love seeing @huggingface and @langchain top the AI dev tools charts!
5
49
13,792
This thing is scary fast. Give it a try! Lots of further improvements still to come.
Announcing the fastest inference available anywhere. We released FlashAttention-2, Flash-Decoding, and Medusa as open source. Our team combined these techniques with our own optimizations and we are excited to announce the Together Inference Engine. together.ai/blog/together-in…
3
203