Stanford NLP Group

Stanford NLP Group

3 Photos and videos

Tweets

Pavel Shtykovskiy retweeted

Stanford NLP Group

@stanfordnlp

May 12

Many roughly know how a transformer works To REALLY understand modern neural LMs—MoEs, GPU tiling, kernels, RLHF, data—you need CS336 By @tatsu_hashimoto, @percyliang The 2026 edition appears on yt with ~2 weeks delay youtube.com/playlist?list=PL… Materials cs336.stanford.edu/

219

1,745

295,279

Inworld AI

Pavel Shtykovskiy retweeted

Inworld AI

@inworld_ai

May 5

Introducing Realtime TTS-2, a new generation of voice model built for realtime conversation. It is the first voice model that hears the conversation, takes natural-language voice direction, holds one voice identity across over 100 languages, and speaks like a person who is paying attention. The result is voice AI that feels as good as it sounds. Try it out: tinyurl.com/RealtimeAI Learn More: tinyurl.com/TTS-2Blog

2:07

106

163

783

323,478

Inworld AI

Pavel Shtykovskiy retweeted

Inworld AI

@inworld_ai

Jan 21

Inworld TTS-1.5 releases today. The #1 TTS on Artificial Analysis now offers realtime latency under 250ms and optimized expression and stability for user engagement, and costs half a cent per minute. Some voice models are fast, some are expressive, some are affordable. We outperform them all across the board. Production-grade realtime latency: <250ms latency for Max model, <130ms for Mini (P90 first audio) - 4x faster than before. Voice agents now respond before users notice any delay. Engagement-optimized quality: 30% more expressive to serve a wider range of personalities and 40% lower word error rates for fewer hallucinations, word cutoffs, and audio artifacts. Built for consumer-scale: Radically affordable with enhanced multilingual support (15 languages including Hindi) and enhanced voice cloning, now via API. On-prem options now available for enterprises.

105

490

285,655

Inworld AI

Pavel Shtykovskiy retweeted

Inworld AI

@inworld_ai

6 Nov 2025

Our TTS Max model just debuted at #1 on the @ArtificialAnlys leaderboard. And at $10/million characters, it’s also the most cost-efficient commercial TTS model available. Excited to keep making state-of-the-art voice more accessible. Check it out at inworld.ai/tts or through our partners @pipecat_ai and @livekit.

Inworld Voice AI: Top-Rated TTS & Voice Cloning

Leading voice AI with sub-200ms latency, instant voice cloning, emotion and non-verbal controls, multilingual support, and pricing down to $10 per million characters at scale.

inworld.ai

Artificial Analysis

@ArtificialAnlys

6 Nov 2025

Inworld TTS 1 Max is the new leader on the Artificial Analysis Speech Arena Leaderboard, surpassing MiniMax’s Speech-02 series and OpenAI’s TTS-1 series The Artificial Analysis Speech Arena ranks leading Text to Speech models based on human preferences. In the arena, users compare two pieces of generated speech side by side and select their preferred output without knowing which models created them. The speech arena includes prompts across four real-world categories of prompts: Customer Service, Knowledge Sharing, Digital Assistants, and Entertainment. Inworld TTS 1 Max and Inworld TTS 1 both support 12 languages including English, Spanish, French, Korean, and Chinese, and voice cloning from 2-15 seconds of audio. Inworld TTS 1 processes ~153 characters per second of generation time on average, with the larger model, Inworld TTS 1 Max processing ~69 characters on average. Both models also support voice tags, allowing users to add emotion, delivery style, and non-verbal sounds, such as “whispering”, “cough”, and “surprised”. Both TTS-1 and TTS-1-Max are transformer-based, autoregressive models employing LLaMA-3.2-1B and LLaMA-3.1-8B respectively as their SpeechLM backbones. See the leading models in the Speech Arena, and listen to sample clips below 🎧

139

16,169

Chieh-Hsin (Jesse) Lai

Pavel Shtykovskiy retweeted

Chieh-Hsin (Jesse) Lai

@JCJesseLai

29 Oct 2025

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

495

2,366

857,531

Nathan Lambert

Pavel Shtykovskiy retweeted

Nathan Lambert

@natolambert

19 Aug 2025

Just signed a book deal for The RLHF Book, excited to make improvements to it this fall and get physical copies in your hands soon :) (rlhfbook dot com)

455

47,149

The NetHack Learning Environment

Pavel Shtykovskiy retweeted

The NetHack Learning Environment

@NetHack_LE

23 Jul 2025

1 43.6 Grok-4-Wiz-AI-Cha died in The Dungeons of Doom on level 1. Killed by a housecat.

Davide Paglieri @PaglieriDavide

23 Jul 2025

LLMs acing math olympiads? Cute. But BALROG is where agents fight dragons (and actual Balrogs)🐉😈 And today, Grok-4 (@grok) takes the gold 🥇 Welcome to the podium, champion!

4,499

Kevin Patrick Murphy

Pavel Shtykovskiy retweeted

Kevin Patrick Murphy

@sirbayes

9 Dec 2024

I am happy to announce that the first draft of my RL tutorial is now available. arxiv.org/abs/2412.05265

722

4,393

320,760

Aleksey Tikhonov

Pavel Shtykovskiy retweeted

Aleksey Tikhonov @altsoph

8 Dec 2024

Earlier, we with @framrus developed a humor generation method that gives human-level results on blind tests. Now, we with @SaveTheRbtz are launching HUMOR-ARENA (humor.ph34r.me/), generated humor labeling site with the models ranking, and the top of generated jokes. Blog-post: altsoph.medium.com/humor-are…

2,321

Simons Institute for the Theory of Computing

Pavel Shtykovskiy retweeted

Simons Institute for the Theory of Computing @SimonsInstitute

4 Nov 2024

simons.berkeley.edu/events/s…

155

26,821

Chelsea Finn

Pavel Shtykovskiy retweeted

Chelsea Finn

@chelseabfinn

17 Apr 2023

Want to learn about meta-learning & few-shot learning? All of the latest lecture videos for Stanford CS330 are now online! youtube.com/playlist?list=PL… New topics in Fall '22 include: - self-supervised pre-training - large scale meta-optimization - domain adaptation & generalization

Stanford CS330: Deep Multi-Task and Meta Learning I Autumn 2022

While deep learning has achieved remarkable success in many problems such as image classification, natural language processing, and speech recognition, these...

youtube.com

184

915

150,905

NeurIPS Conference

Pavel Shtykovskiy retweeted

NeurIPS Conference

@NeurIPSConf

13 Jan 2023

You can now watch the recorded material from #NeurIPS2022 online without registration at: slideslive.com/neurips-2022

NeurIPS 2022

slideslive.com

213

771

140,501

Karol Hausman

Pavel Shtykovskiy retweeted

Karol Hausman

@hausman_k

31 Oct 2022

Our 2021 CS330 (cs330.stanford.edu/fall2021) lectures are online: youtube.com/playlist?list=PL… It was a pleasure to co-teach this class with @chelseabfinn. Topics incl. meta-learning, MTL, few-shot learning, deep RL (incl. multi-task, meta, goal-conditioned, hierarchical and offline RL)

417

Sebastien Bubeck

Pavel Shtykovskiy retweeted

Sebastien Bubeck

@SebastienBubeck

30 Jun 2022

The video of my talk @EPFL_en today on Transformers and how to make sense of them is online! youtube.com/watch?v=brmidghO…

484

Soumith Chintala

Pavel Shtykovskiy retweeted

Soumith Chintala

@soumithchintala

4 Feb 2022

Fun read on why MLOps is still somewhat broken -- the engineers who build them are not users. In ML Frameworks, the authors were ML scientists -- (Py)Torch, Theano, Caffe, MXNet, Keras, Chainer, TF, etc. and that helped in design requirements accurately being in your head.

Yaroslav Bulatov

@yaroslavvb

4 Feb 2022

Bananas and ML infrastructure: I've asked around about cloud workflows, and most of the feedback had unhappiness with cloud tooling. This prompted a discussion in @chipro's MLops community -- why are MLops frameworks so bad? (1/9)

250

Yaroslav Bulatov

Pavel Shtykovskiy retweeted

Yaroslav Bulatov

@yaroslavvb

4 Feb 2022

366

Pavel Shtykovskiy

Pavel Shtykovskiy @framrus

24 Jan 2022

Nice blog post on distributed multi-GPU training of large models lilianweng.github.io/lil-log…

Sheldon Axler

Pavel Shtykovskiy retweeted

Sheldon Axler

@AxlerLinear

27 Nov 2021

Today the videos that I made to accompany my book Linear Algebra Done Right surpassed two million minutes of total viewing on YouTube. Those videos are freely available from the links at linear.axler.net/LADRvideos.…. #LinearAlgebra

494

2,713

Sebastien Bubeck

Pavel Shtykovskiy retweeted

Sebastien Bubeck

@SebastienBubeck

3 Nov 2021

Just watched an incredible talk by @AlexGDimakis at the Simons Institute, highly recommended. Their Iterative Layer Optimization technique to solve inverse problems with GANs make a LOT of sense! The empirical results on the famous blurred Obama face speak for themselves! 1/4

441

Pavel Shtykovskiy

Pavel Shtykovskiy @framrus

30 Oct 2021

Inductive Biases for Deep Learning of Higher-Level Cognition (arxiv.org/abs/2011.15091) Fantastic paper!