Research scientist at @GoogleAI working on sound separation

Joined July 2011
Photos and videos
Scott Wisdom retweeted
Veo 3 is here, and in addition to better visuals, it makes noises and speaks! This was a massive effort made possible by incredible passion from the whole Veo team and the many other team enabling it to launch today. Looking forward to seeing what others do with it! #veo3
12
30
230
19,764
Scott Wisdom retweeted
Veo 3, our SOTA video generation model, has native audio generation and is absolutely mindblowing. For filmmakers creatives, we’re combining the best of Veo, Imagen and Gemini into a new filmmaking tool called Flow. Ready today for Google AI Pro and Ultra plan subscribers.
9
58
841
92,537
Scott Wisdom retweeted
We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊 dpmd.ai/v2a
90
347
1,479
529,219
Scott Wisdom retweeted
It's so awesome to see the impact of the computational audio capabilities we developed featured in @madebygoogle 🎉 🎉 🎉 Congrats to John Hershey, @ScottTWisdom, @PGetreuer & everyone who contributed for pioneering new computational audio capabilities in Pixel8 #MadeByGoogle
Check out the 4 new Google Photos features coming first to Pixel 8 and 8 Pro ↓ Whether it’s noise from wind, traffic, or barking dogs, Audio Magic Eraser in Google Photos reduces distracting sounds in your video in just a few taps! 🪄
4
16
60
20,301
Scott Wisdom retweeted
Strong showing at #SANE2022 to learn about the latest and greatest in speech and audio research from a stellar lineup!
2
3
70
Scott Wisdom retweeted
Here is a short presentation of AudioScopeV2!📢 @ScottTWisdom and I are looking forward to discussing further about open-domain on-screen sound separation and meeting you in #ECCV2022! webpage:google-research.github.io/so…... arxiv:arxiv.org/abs/2207.10141 video:youtu.be/6UgcS3NdPn8

1
2
26
Scott Wisdom retweeted
I am 😃 that we will present AudioScopeV2 at #ECCV2022! If you want to learn about improved audio-visual attention models and calibration for on-screen sound separation check our paper w. @ScottTWisdom! project-page: google-research.github.io/so… new dataset: github.com/google-research/s…

``AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. (arXiv:2207.10141v1 [cs.SD]),'' Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey, ift.tt/jOrEQWR
1
3
21
Scott Wisdom retweeted
4 Jul 2022
Distance-Based Sound Separation abs: arxiv.org/abs/2207.00562 project page: google-research.github.io/so… With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds
24
104
Scott Wisdom retweeted
SANE is back! Thursday, Oct. 6 in Kendall Square, Cambridge, MA. Confirmed speakers: A. Cherian @anoopcherian, C. Gan @gan_chuang, W.-N. Hsu @mhnt1580, T. Sainath @tnsainath, S. Watanabe @shinjiw_at_cmu, S. Wisdom @ScottTWisdom. More details: saneworkshop.org
1
4
26
Scott Wisdom retweeted
Happy to see my summer work with @ScottTWisdom, Hakan Erdogan, and John Hershey was accepted for presentation at @ieeeICASSP 2022 😊 My first ICASSP paper in the books! Immensely thankful for their mentorship. Our first version can be found on arXiv at: arxiv.org/abs/2110.10739
2
1
36
Scott Wisdom retweeted
We can learn a lot about our environment just by listening to the birds. New #GoogleAI approaches can help isolate and identify birdsongs, helping ecologists better understand food systems and forest health. 🐦 ai.googleblog.com/2022/01/se…

101
152
1,569
Scott Wisdom retweeted
Our paper received a #WASPAA2021 special award for *Best Audio Representation Learning Paper*: "Self-Supervised Learning from Automatically Separated Sound Scenes". 🎉🚀 paper: arxiv.org/abs/2105.02132 talk: youtu.be/Tts5vYmGwUY slides: bit.ly/3lBjAnr 👇
9
16
88
Scott Wisdom retweeted
Our DF-Conformer paper has received the “Best Speech Enhancement Paper Award” from #WASPAA2021! Yay!!
2
15
77
Scott Wisdom retweeted
🔊Here's the video presentation of our WASPAA21 paper: "Self-Supervised Learning from Automatically Separated Sound Scenes". Work done during an internship at Google Research. paper: arxiv.org/abs/2105.02132 video: youtu.be/Tts5vYmGwUY slides: bit.ly/3lBjAnr
1
11
68
Scott Wisdom retweeted
🔊Happy to announce FSD50K: the new open dataset of human-labeled sound events! Over 51k Freesound audio clips, totalling over 100h of audio manually labeled using 200 classes drawn from the AudioSet Ontology. Paper: arxiv.org/pdf/2010.00475.pdf Dataset: doi.org/10.5281/zenodo.40604…
4
79
239
Scott Wisdom retweeted
I am thrilled to announce that our paper "Unsupervised Sound Separation using Mixtures of Mixtures" got accepted to #NeurIPS2020 as a #Spotlight paper!! 📢📢 All kudos to @ScottTWisdom and the rest of the Google guys! arxiv.org/pdf/2006.12701.pdf

3
11
69
Scott Wisdom retweeted
We are very happy to announce that all the videos of our recent #ICML2020 workshop on self-supervised learning are now publicly available at slideslive.com/icml-2020/sel… Thanks #ICML2020 and @SlidesLive for that! @MILAMontreal #DeepLearning #AI #Speech #MachineLearning
2
56
126
Glad you like it, thanks for the nice summary!
I'm a bit late posting this, but a very cool paper from Scott Wisdom and collaborators (including @ETzinis) out of Google introducing "MixIT": arxiv.org/abs/2006.12701 They tackle the problem of *unsupervised* source separation! 1/10
1
5