โšซ๏ธ Artist, technologist and musician โšซ๏ธ โšซ๏ธ ๐”แดœแด›แดœแด€สŸษชsแด๐”› out now via @_otherpeople : awal.ffm.to/mutualismx โšซ๏ธ โšซ๏ธ AI music engineer @semilla_AI โšซ๏ธ

Joined December 2007
Photos and videos
๐”แดœแด›แดœแด€สŸษชsแด๐”› is officially out on @_otherpeople via all streaming services and as 2xLP ! The making of this album been a planetary-spanning journey where a generative music collaboration emerged through time and space via technologies of โ€œsonic becomingโ€ with @semilla_ai
2
6
31
5,768
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
Live Music Diffusion Models (Novack et al. ) Can the diffusion ecosystem, offline by nature, be transmuted into streaming? They used it live as a โ€œgenerative delayโ€ transforming a musicianโ€™s improvisation on a gaming laptop. ๐Ÿ”— github.com/ZacharyNovack/livโ€ฆ
1
2
1
245
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
LiveBand (Sony CSL ร— QMUL, June 2026) @SonyCSL @QMUL Listens to your live audio and plays along, causal, zero lookahead, runs on consumer hardware. A causal transformer trained adversarially so that training and inference are the same computation. ๐Ÿ”— arxiv.org/abs/2606.0380
1
1
1
174
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
The big release: Magenta RealTime 2 from @GoogleMagenta @Ilaria__Manco @jesseengel An open-weights live music model you play like an instrument: โ‰ steered by text, audio examples AND MIDI, injected every 40ms frame โ‰ ships with apps DAW plugin ๐Ÿ”— magenta.withgoogle.com/mrt2
1
3
8
366
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
Firstly, our research: ๐•ฎ๐–”๐•ฏ๐–Ž๐•ฎ๐–”๐–‰๐–Š๐–ˆ-๐•ฑ๐–‘๐–”๐–œ โ‰ A Flow Matching DiT that synthesizes audio inside CoDiCodecโ€™s continuous latent space, block-causal, for musical and improvisation. It trains & runs in realtime on an Apple Silicon laptop. ๐Ÿ”— github.com/moiseshorta/CoDiCโ€ฆ
1
1
1
178
We wrote some thoughts and a survey on the new wave of realtime AI music tools. Check it out here ๐Ÿ”Š
Lately, we been been feeling a shift in AI music. The latest most popular systems are offline: you write a prompt, you wait, a finished, derivative .wav is generated. Now the models are learning to play in time with us. En vivo. Hereโ€™s survey of the new realtime wave ๐Ÿงต๐Ÿ‘‡
10
808
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
Meet LiveBand: a real-time AI jamming companion! ๐ŸŽธ It generates live music accompaniments with zero perceived latency โšก๏ธ It runs locally on Macbooks, can generate any instrument (more than one at a time), is wildly robust, and is trained from scratch on a single GPU! ๐Ÿงต๐Ÿ‘‡
8
13
96
4,854
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
Replying to @GoogleMagenta
I'm also so so happy to see a growing community around local & real-time models that let us turn new tech into genuinely exciting and fresh creative ideas - @dadabots @zacknovack @hexorcismos to name just a few
2
2
14
574
Super interesting paper, the key innovation here being that there's absolutely no 'hidden latent coordinates' but just natural language serving as an encoder and then decoded back into sound with Python scripts via coding LLM's...
Can a sentence carry a sound? In Communicating Sound Through Natural Language, we introduce lexical acoustic coding (LAC): a way for LLM agents to transmit short sounds as structured English, then re-render the same audio back from that text. (1/6)
1
15
1,460
Real-Time Neural Audio Synthesis with CoDiCodec-Flow, generating from a custom model, trained in about 4 hrs. and generating on an M4 Macbook Pro. For those wanting to experiment with it , the code is live performance ready! New software coming up via @semilla_ai ๐Ÿ”œ๐Ÿ”œ๐Ÿ”œ
1
2
32
1,180
Just added a Google Colab notebook to the CoDiCodec-Flow repo. You can train your own real-time neural audio synthesis model with about 2-3 hrs. of audio and get results in about 3 hours of training time. Small Data models > Corporate Big Tech models github.com/moiseshorta/CoDiCโ€ฆ
2
33
1,177
Today I'm open sourcing CoDiCodec-Flow, a generative audio model for real-time neural audio synthesis. github.com/moiseshorta/CoDiCโ€ฆ After 5 years and many gigs performing with RAVE, I always found that there was a lack in sound and expressive quality; CoDiCodec-Flow improves that...
2
2
20
482
Block-Causal DiT Architecture The DiT (Diffusion Transformer) architecture is modified to be block-causal, respecting CoDiCodec's chunk structure. Each chunk contains 8 latent tokens representing 0.683 seconds of audio, with tokens being permutation-invariant within chunks.
1
1
130
Feel free to start trying it out and looking forward to hear what music you make with it!
71
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
๐Ÿ”ฅNew @ISMIRConf paper alert: CoDiCodec โ€“ a unified neural audio codec producing both continuous embeddings and discrete tokens from the same model. ๐Ÿ‘‰ pip install codicodec It outperforms existing continuous and discrete codecs in audio quality (FAD, FAD_clap)! ..by the great and only one @marco_ppasini ๐Ÿ’ช CoDiCodec offers - Continuous (~11 Hz) discrete (2.38 kbps) latents - FSQ-dropout: improves continuous decoding while keeping discrete tokens useful - Autoregressive & parallel decoding @SonyCSLParis @SonyCSLMusic #codec
1
4
17
911
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
Why is no one talking about relation between LLM coding agents and ADHD?
27
1
67
10,564
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
In an alternate timeline weโ€™d be using Evangelion GUI designs rather than CLIs

102
682
7,083
1,658,922
๐”ฅ๐”ข๐”ต๐”ฌ๐”ฏ๐” ๐”ฆ๐”ฐ๐”ช๐”ฌ๐”ฐ retweeted
this started with a striking PC1 falling out of persona space my main insights from the past few months: โŠน โ€œdistance from the Assistantโ€ is the main axis of persona variation across these models e.g. the most relevant thing seems to be โ€œhow Assistant-like is this personaโ€ โŠน this axis already exists in base models and steering with it makes them speak from the POV of helpful archetypes like therapists, coaches, and consultants โŠน not all personas far from the Assistant are bad! the risk comes from departing the more predictable territory of post-trained behaviour still have a lot of questions about what to anthropomorphize, what to treat as fundamentally alienโ€ฆ
New Anthropic Fellows research: the Assistant Axis. When youโ€™re talking to a language model, youโ€™re talking to a character the model is playing: the โ€œAssistant.โ€ Who exactly is this Assistant? And what happens when this persona wears off?
5
22
68
7,917