Lights and sounds are kinda my thing ✨

Joined March 2016
1,196 Photos and videos
MClem retweeted
Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument. MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency. Open weights. Open source inference engine. Suite of apps and plugins. Hear what it can do and try it out for yourself below 🧵
21
95
493
109,421
MClem retweeted
Latent modulation has so much untapped potential. LFO 🤝 AI
Really proud to share something we’ve been working on for a while: Magenta RealTime 2 (MTR2), a live music model that is highly interactive (MIDI, audio, text, lots of parameters) and low-latency (~200ms end-to-end), and runs locally on a MacBook!
2
3
37
3,179
MClem retweeted
Had one of those "Oh, I'm living in the future" moments yesterday. Flying on the airplane. No internet. Playing music with hand gestures controlling a realtime LLM running locally on my laptop Person next to me thinking I'm crazy... all the while I'm having a great time 😅
14
21
199
22,980
MClem retweeted
🎉Jazzed after Day 1 of @BerkleeCollege's AI Music Summit. Many fellow speakers on #TeamHuman: small models, attribution, iteration, live perf., user-owned data, selective adoption. Stellar lineup of creators, educators, devs, CEOs & lawyers worldwide. Shows by OOD human artists
2
1
7
453
MClem retweeted
Have been doing some stem remixing work with Stable Audio 3. 📦 Medium model 🔉 init_audio holding the original audio file 😶‍🌫️ init_noise_level between 0.4-0.5 seems to be the sweet spot 🪄 Empty promps
2
8
56
14,182
MClem retweeted
We've got a new model coming out next week! We've been having a lot of fun playing with it, and I hope you will too♥️ We'll be celebrating by presenting at the AI Music Summit at Berklee and helping teams at the hackathon afterwards build some wild new musical instruments 🎸
Help build a future of AI in music that's live, interactive, and deeply human. Join Google DeepMind's Magenta team in Boston and get access to a generative model you can actually play as an instrument, and build your own AI instrument, plugin, or performance. 📍 Boston Music Technology Hackathon
1
7
53
5,064
MClem retweeted
New diffusion model music instruments? Yes, please!
Can we transform offline audio diffusion into real-time streaming interactive instruments? Yes! Presenting Live Music Diffusion Models: a new paradigm for taking your favorite open models into live performance, right on your own laptop! 🎵🎵 🧵
1
17
1,294
MClem retweeted
Using Stable Audio 3 to generate variations of an existing loop. Unconditional generation (no prompt), renoising the latents to 0.5, and just using different seeds seems to generate a nice neighbourhood around the original. Generally keeps the harmonic context and feel.
9
10
154
10,753
MClem retweeted
Experiment: Painting sound effects with @StabilityAI Stable Audio 3 1. Free-form drawing on a spectrogram-like canvas. Time on the x-axis, pitch on the y. 2. Synthesise the drawing to audio. Strokes control bandpass-filtering of a white noise source. 3. Use that audio as input for SA3 audio-to-audio pass combined with a text prompt. In Claude Desktop with a handful of MCP tools.
2
8
44
4,697
MClem retweeted
The call for the NeurIPS 2026 Creative AI Track is out! In its fourth year, NeurIPS 2026 Creative AI Track invites research papers and artworks that explore emerging applications, methods, and critiques of artificial intelligence and machine learning in art, design, and creative practice. Focusing on the theme of Agency, this year’s track asks: how agency emerges, is exercised, is negotiated, and is contested through creative practice with AI. Agency may belong to an artist, a collaborator, a model, an audience, a platform, a community, or even a larger social and technical system, and may be asserted, delegated, shared, resisted, constrained, or redistributed. Important dates: June 30: Submission Portal Opens August 3 (Anywhere on earth): Submission Deadline September 18: Decision October 23: Final Camera-Ready Submission For more information, visit: neurips.cc/Conferences/2026/…

3
34
149
30,628
MClem retweeted
The moment it feels like playing a live instrument, we surpass the early days of neural synthesis -- are we close? Novack and crew are gods -- they hath finetuned a Stable Audio Open Small into a live music diffusion model for thy pleasure
Can we transform offline audio diffusion into real-time streaming interactive instruments? Yes! Presenting Live Music Diffusion Models: a new paradigm for taking your favorite open models into live performance, right on your own laptop! 🎵🎵 🧵
2
5
32
2,392
MClem retweeted
Can we transform offline audio diffusion into real-time streaming interactive instruments? Yes! Presenting Live Music Diffusion Models: a new paradigm for taking your favorite open models into live performance, right on your own laptop! 🎵🎵 🧵
9
29
161
13,324
MClem retweeted
Stable Audio 3, explained in 5 figures. It’s a family of open-weight models for generating instrumental music and sound effects. The models are fast, support editing, and are trained on licensed and Creative Commons audio. 👾 artintech.substack.com/p/sta… 🏋️‍♂️github.com/Stability-AI/stab…
5
23
118
43,634
MClem retweeted
I’m promoting our new conversational music recommendation dataset, Reddit2Deezer, the largest real-world, grounded CMR dataset (200k–600k conversations). The tracks and albums are mapped to the Deezer API, which enables straightforward access to audio previews and rich metadata.
1
3
14
522
If your uni is open to visiting researchers, reach out to Scott! Stellar researcher and human!! 🙌
All grades in!! 14-Month Sabbatical starts NOW! 🎉 Still open to Visiting Researcher collabs -- DMs open.
1
173
MClem retweeted
Say hello to Project LYDIA Phase II! Developed in partnership with our friends at Roland Future Design Lab @RolandGlobal, we're proud to announce the next step in our journey towards neural hardware. Article: articles.roland.com/project-…
6
14
1,689
In the spirit of celebrating non-AI creative work; it’s the anniversary of this music video, in which I held my breath for 4.5 minutes in order to perform the entire song underwater in one shot. No FX, no splicing of takes. It took me 3 months of training to get my breath hold up to a stationary 5 minutes in preparation for the shoot (don’t worry, the drowning is acting, I wasn’t running out of air.) It’s a slow-paced video. But the focus on a gradual buildup of surreal dread is meant to activate mirror neurons in the watcher, the same ones that turn on when you see someone yawn. I wanted you, over the course of 4 minutes, to slowly drown with me. To experience the feeling that birthed this song in the first place. (Full thing on YT, it’s called Mabúl)
5
36
252
7,089
MClem retweeted
I guess we were just ahead of the curve... :) Lo-Fi Player share.google/WjKVM95F4QC1Mx2…
INTRODUCING CLAUDE MUSIC The $900B company @claudeai is literally streaming LOFI music on YouTube 💀
1
1
13
1,415
MClem retweeted
We're launching the agentic robotics app store today. Let's democratize AI robotics for all! 300 apps shipped. 10,000 robots in the wild. It used to take weeks from a robotics engineer to build apps, now everyone can do it in hours with ML intern or your favorite neighborhood agent! My favorite reachy mini app was built by Joel, a 78yo marketing exec who'd never coded in his life. Personally, I built an office receptionist in two hours last week. More info to start building here: huggingface.co/blog/clem/rea…
46
97
693
125,232
Very exciting work! The demos are fascinating!
1/ Excited to release MIDI-SAG! We explore Singing Accompaniment Generation (SAG) and lyrics-to-song generation in one compositional song generation pipeline. Paper: arxiv.org/pdf/2602.22029 Code: github.com/fundwotsai2001/MI… Demo: composerflow.github.io/web_r…
2
9
946