Jason Baldridge

Jason Baldridge

Photos and videos

Tweets

Scott Wisdom retweeted

20 May 2025

Veo 3 is here, and in addition to better visuals, it makes noises and speaks! This was a massive effort made possible by incredible passion from the whole Veo team and the many other team enabling it to launch today. Looking forward to seeing what others do with it! #veo3

0:08

230

19,764

Sundar Pichai

Scott Wisdom retweeted

Sundar Pichai

@sundarpichai

20 May 2025

Veo 3, our SOTA video generation model, has native audio generation and is absolutely mindblowing. For filmmakers creatives, we’re combining the best of Veo, Imagen and Gemini into a new filmmaking tool called Flow. Ready today for Google AI Pro and Ultra plan subscribers.

0:08

841

92,537

Google DeepMind

Scott Wisdom retweeted

Google DeepMind

@GoogleDeepMind

17 Jun 2024

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊 dpmd.ai/v2a

0:12

347

1,479

529,219

Vivek Kumar

Scott Wisdom retweeted

Vivek Kumar @vivek_kumar

4 Oct 2023

It's so awesome to see the impact of the computational audio capabilities we developed featured in @madebygoogle 🎉 🎉 🎉 Congrats to John Hershey, @ScottTWisdom, @PGetreuer & everyone who contributed for pioneering new computational audio capabilities in Pixel8 #MadeByGoogle

Google Photos

@googlephotos

4 Oct 2023

Check out the 4 new Google Photos features coming first to Pixel 8 and 8 Pro ↓ Whether it’s noise from wind, traffic, or barking dogs, Audio Magic Eraser in Google Photos reduces distracting sounds in your video in just a few taps! 🪄

0:21

20,301

Jonathan Le Roux

Scott Wisdom retweeted

Jonathan Le Roux @JonathanLeRoux

21 Mar 2023

Sorry it took forever (I did the editing this year...): videos of all #SANE2022 talks by @TweetRupal @mhnt1580 @ScottTWisdom @tnsainath @shinjiw_at_cmu @anoopcherian @gan_chuang are finally available! Here's the essential binge-watching YouTube playlist👇 youtube.com/playlist?list=PL…

SANE 2022 @ Kendall Square

SANE 2022, a one-day event gathering researchers and students in speech and audio from the Northeast of the American continent, was held on Thursday October ...

youtube.com

4,463

Jonathan Le Roux

Scott Wisdom retweeted

Jonathan Le Roux @JonathanLeRoux

6 Oct 2022

Strong showing at #SANE2022 to learn about the latest and greatest in speech and audio research from a stellar lineup!

Efthymios Tzinis

Scott Wisdom retweeted

Efthymios Tzinis @ETzinis

1 Oct 2022

Here is a short presentation of AudioScopeV2!📢 @ScottTWisdom and I are looking forward to discussing further about open-domain on-screen sound separation and meeting you in #ECCV2022! webpage:google-research.github.io/so…... arxiv:arxiv.org/abs/2207.10141 video:youtu.be/6UgcS3NdPn8

Jonathan Le Roux

Scott Wisdom retweeted

Jonathan Le Roux @JonathanLeRoux

12 Sep 2022

Full list of speakers and talk details for #SANE2022 (Thursday 10/6, Cambridge, MA) now available! @anoopcherian @gan_chuang @mhnt1580 @TweetRupal @tnsainath @shinjiw_at_cmu @ScottTWisdom Poster & demo submissions due 9/21. Registration/Details: saneworkshop.org

SANE 2026 - Speech and Audio in the Northeast

SANE is a series of workshops gathering researchers and students in speech and audio from the Northeast of the American continent.

saneworkshop.org

Efthymios Tzinis

Scott Wisdom retweeted

Efthymios Tzinis @ETzinis

22 Jul 2022

I am 😃 that we will present AudioScopeV2 at #ECCV2022! If you want to learn about improved audio-visual attention models and calibration for on-screen sound separation check our paper w. @ScottTWisdom! project-page: google-research.github.io/so… new dataset: github.com/google-research/s…

arXiv Sound @ArxivSound

22 Jul 2022

``AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. (arXiv:2207.10141v1 [cs.SD]),'' Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey, ift.tt/jOrEQWR

AK

Scott Wisdom retweeted

@_akhaliq

4 Jul 2022

Distance-Based Sound Separation abs: arxiv.org/abs/2207.00562 project page: google-research.github.io/so… With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds

104

Jonathan Le Roux

Scott Wisdom retweeted

Jonathan Le Roux @JonathanLeRoux

6 Jun 2022

SANE is back! Thursday, Oct. 6 in Kendall Square, Cambridge, MA. Confirmed speakers: A. Cherian @anoopcherian, C. Gan @gan_chuang, W.-N. Hsu @mhnt1580, T. Sainath @tnsainath, S. Watanabe @shinjiw_at_cmu, S. Wisdom @ScottTWisdom. More details: saneworkshop.org

SANE 2026 - Speech and Audio in the Northeast

SANE is a series of workshops gathering researchers and students in speech and audio from the Northeast of the American continent.

saneworkshop.org

Aswin Sivaraman

Scott Wisdom retweeted

Aswin Sivaraman @actuallyaswin

22 Jan 2022

Happy to see my summer work with @ScottTWisdom, Hakan Erdogan, and John Hershey was accepted for presentation at @ieeeICASSP 2022 😊 My first ICASSP paper in the books! Immensely thankful for their mentorship. Our first version can be found on arXiv at: arxiv.org/abs/2110.10739

Sundar Pichai

Scott Wisdom retweeted

Sundar Pichai

@sundarpichai

24 Jan 2022

We can learn a lot about our environment just by listening to the birds. New #GoogleAI approaches can help isolate and identify birdsongs, helping ecologists better understand food systems and forest health. 🐦 ai.googleblog.com/2022/01/se…

101

152

1,569

Eduardo Fonseca

Scott Wisdom retweeted

Eduardo Fonseca @edfonseca_

22 Oct 2021

Our paper received a #WASPAA2021 special award for *Best Audio Representation Learning Paper*: "Self-Supervised Learning from Automatically Separated Sound Scenes". 🎉🚀 paper: arxiv.org/abs/2105.02132 talk: youtu.be/Tts5vYmGwUY slides: bit.ly/3lBjAnr 👇

Yuma Koizumi

Scott Wisdom retweeted

Yuma Koizumi @yuma_koizumi

20 Oct 2021

Our DF-Conformer paper has received the “Best Speech Enhancement Paper Award” from #WASPAA2021! Yay!!

This tweet is unavailable

Eduardo Fonseca

Scott Wisdom retweeted

Eduardo Fonseca @edfonseca_

12 Oct 2021

🔊Here's the video presentation of our WASPAA21 paper: "Self-Supervised Learning from Automatically Separated Sound Scenes". Work done during an internship at Google Research. paper: arxiv.org/abs/2105.02132 video: youtu.be/Tts5vYmGwUY slides: bit.ly/3lBjAnr

Eduardo Fonseca

Scott Wisdom retweeted

Eduardo Fonseca @edfonseca_

2 Oct 2020

🔊Happy to announce FSD50K: the new open dataset of human-labeled sound events! Over 51k Freesound audio clips, totalling over 100h of audio manually labeled using 200 classes drawn from the AudioSet Ontology. Paper: arxiv.org/pdf/2010.00475.pdf Dataset: doi.org/10.5281/zenodo.40604…

239

Efthymios Tzinis

Scott Wisdom retweeted

Efthymios Tzinis @ETzinis

26 Sep 2020

I am thrilled to announce that our paper "Unsupervised Sound Separation using Mixtures of Mixtures" got accepted to #NeurIPS2020 as a #Spotlight paper!! 📢📢 All kudos to @ScottTWisdom and the rest of the Google guys! arxiv.org/pdf/2006.12701.pdf

Mirco Ravanelli

Scott Wisdom retweeted

Mirco Ravanelli @mirco_ravanelli

4 Aug 2020

We are very happy to announce that all the videos of our recent #ICML2020 workshop on self-supervised learning are now publicly available at slideslive.com/icml-2020/sel… Thanks #ICML2020 and @SlidesLive for that! @MILAMontreal #DeepLearning #AI #Speech #MachineLearning

This tweet is unavailable

126

Scott Wisdom

Scott Wisdom @ScottTWisdom

25 Jul 2020

Glad you like it, thanks for the nice summary!

Joe Antognini @joe_antognini

16 Jul 2020

I'm a bit late posting this, but a very cool paper from Scott Wisdom and collaborators (including @ETzinis) out of Google introducing "MixIT": arxiv.org/abs/2006.12701 They tackle the problem of *unsupervised* source separation! 1/10