Garry Tan

Garry Tan

Photos and videos

Tweets

Lepton AI retweeted

Garry Tan

@garrytan

30 Jun 2025

Xfinity: Make something nobody wants

This tweet is unavailable

111

1,099

197,744

Crusoe

Lepton AI retweeted

Crusoe

@CrusoeAI

11 Jun 2025

Crusoe is bringing compute resources to DGX Cloud Lepton marketplace to meet AI demand. 🚀 Learn more and join early access! nvidianews.nvidia.com/news/n… @NVIDIAAI

NVIDIA DGX Cloud Lepton Connects Europe’s Developers to Global NVIDIA Compute Ecosystem

NVIDIA today announced the expansion of NVIDIA DGX Cloud Lepton™ — an AI platform featuring a global compute marketplace that connects developers building agentic and physical AI applications — with...

nvidianews.nvidia.com

3,629

Lepton AI

Lepton AI @LeptonAI

13 Jun 2025

DGX Cloud Lepton is a new layer that standardizes AI inference across multiple cloud providers, offering a unified interface and automatic workload routing.

NVIDIA AI Developer

@NVIDIAAIDev

12 Jun 2025

📣 Announcing a unified AI platform connecting developers to thousands of GPUs worldwide: NVIDIA DGX Cloud Lepton (Early Access). Build, train, and deploy AI apps at scale—faster and easier than ever. Learn more & join for early access: nvda.ws/4kOaxLV

0:06

2,162

Devin AI

Lepton AI retweeted

Devin AI

@toolandtea

13 Jun 2025

7/ NVIDIA and Hugging Face offer DGX Cloud Lepton for instant global GPU access. Train, fine-tune, and deploy models at scale with ease. Fast, flexible, and collaborative.

1:45

19,359

Rohan Paul

Lepton AI retweeted

Rohan Paul

@rohanpaul_ai

12 Jun 2025

🚨 NVIDIA launches DGX Cloud Lepton to commoditize inference compute across clouds, threatening neocloud margins. DGX Cloud Lepton is a new layer abstracting inference compute across multiple neoclouds. It gives users a consistent interface while automatically routing workloads across providers. → The goal is to make inference compute a commodity, similar to what Uber did for taxi services. This strips differentiation from neoclouds and creates pricing pressure, reducing their margins. → Lepton’s real innovation is turning multi-cloud inference into a seamless, interoperable platform. It raises performance per dollar for users, while keeping NVIDIA’s margins untouched. @NVIDIAAIDev

0:50

3,111

The Information

Lepton AI retweeted

The Information

@theinformation

26 Mar 2025

Nvidia nears a deal to buy Lepton AI, a GPU reseller, for several hundred million. 💰 This move expands Nvidia's cloud and enterprise software push. Read more: theinformation.com/articles/… #Nvidia

Nvidia Nears Deal to Buy GPU Reseller for Several Hundred Million Dollars

Nvidia is in advanced talks to buy Lepton AI, a two-year-old startup that rents out servers powered by Nvidia’s artificial intelligence chips, in a deal worth several hundred million dollars,...

theinformation.com

5,371

Yangqing Jia

Lepton AI retweeted

Yangqing Jia

@jiayq

7 Nov 2024

We've achieved a >99.5% uptime for large scale GPU clusters, with a great collaboration between @LeptonAI and @digitalocean. This is much better than industry standard SLAs which roams around 98%. It's done via proactive monitoring solutions like our open source GPUD, the cloud native platform, and close collaboration between the engineering teams. Learn more at blog.lepton.ai/achieving-99-…, and shoot a message to info@lepton.ai if you need high performance, cloud native, production grade AI infra!

18,027

Freddy A Boulton

Lepton AI retweeted

Freddy A Boulton @freddy_alfonso_

6 Nov 2024

Talk to Llama 3.2-3B 🦙🗣️⚡️ Powered by @LeptonAI (blazing fast LLM inference, ASR, and TTS all in one!) and @Gradio 's ergonomic WebRTC Streaming ⚡️ Building this took me about 30 minutes despite never using Lepton before.

0:31

913

DigitalOcean

Lepton AI retweeted

DigitalOcean

@digitalocean

23 Oct 2024

Achieving more than 99.9% uptime and quick turnaround times for collaboration between teams after partnering with #DigitalOcean, @LeptonAI’s CEO, Yangqing Jia, is realizing his goal of growing 10x over the next year. 🚀 Watch to learn how ⤵ youtube.com/watch?v=NLtQHgxb…

0:17

3,403

Exabits

Lepton AI retweeted

Exabits

@exa_bits

18 Jun 2024

We are so proud to announce our extended partnership with FastGPU @fast_gpu via AI OG innovators, the mighty LeptonAI @LeptonAI . Now you can deploy on-Demand RTX4090’s with Enterprise AI Infrastructure IN SECONDS with Exabits on FastGPU. Just pay for what you use, as you go. ~a thread~

1,776

SambaNova

Lepton AI retweeted

SambaNova

@SambaNovaAI

11 Apr 2024

Introducing Samba-CoE v0.3, our latest Composition of Experts (CoE) model that surpasses DBRX by @DbrxMosaicAI and Grok-1 314B by @xAIGrokInu on the OpenLLM Leaderboard @huggingface! 🏆 Samba-CoE-v0.3 is now available on @LeptonAI @jiayq, try now: lepton.ai/playground/samba-c…. #AI

59,520

Martian

Lepton AI retweeted

Martian

@withmartian

25 Jan 2024

.@LeptonAI surpasses all other providers in throughput (P50 & P90) for both Llama-2-70B and Mixtral on a small service load for short input long output prompts. A P50 of 130 tks/s is the fastest throughput we've observed among all model offerings by all providers View this scenario live: leaderboard.withmartian.com/…

6,251