@fluidinference , Audio AI & LLM ML Engineer

Joined July 2025
6 Photos and videos
Alex retweeted
NVIDIA's Nemotron 3.5 ASR Streaming Multilingual is now available through FluidAudio optimized for Apple Silicon so apps can run ~40-language real-time ASR entirely on device, no cloud required. Apps shipping with it today include @SpokenlyApp, @ALTIC_DEV, @SnaplyAI, and others. FluidAudio: github.com/FluidInference/Fl… Original model: huggingface.co/nvidia/NVIDIA… @NVIDIAAIDev
3
3
18
1,031
Alex retweeted
LeafTok narrates books on-device using FluidAudio for TTS. Zero network calls during playback. Voices ship with the app. The trade-off vs cloud TTS: bigger app, fully offline.
1
2
3
82
Alex retweeted
Impressive. Very nice. Recently contributed a patch for Parakeet V2/V3 to FluidAudio that makes it match this speed, on-device, i.e. ~300x speed factor. Can transcribe an hour of audio in ~12 seconds on an iPhone. And tdt-ctc-110m is ~33% faster than that (1 hr in ~9s). github.com/FluidInference/Fl…
1
2
237
Alex retweeted
FluidVoice 1.5.15 now supports @NVIDIAAI Nemotron 3.5 ASR Multilingual collaborating with @fluidinference - 40 language-locales, free forever, fully on-device, and ultra fast! 👉Download here —> altic.dev/fluid @NVIDIAAIDev @NVIDIAAI #NemotronSpeech #VoiceAI
4
10
364
Alex retweeted
Snaply now supports NVIDIA Nemotron 3.5 ASR Multilingual via @FluidInference — 40 language-locales, on-device, no cloud, free to use. @NVIDIAAIDev @NVIDIAAI #NemotronSpeech #VoiceAl
3
4
7
353
Alex retweeted
Today's a big day for Nemotron models. Along with Ultra, we also shipped Nemotron Speech 3.5 that now supports 40 Languages and it's insanely Fast and Ultra Low latency! I collaborated with @Alex_tra_memory, @fluidinference and @ALTIC_DEV to port the model to coreML to make bring the latest Nemotron model to any macbook using FluidVoice! Give it a try and lmk what you think! Link below ⬇️
2
4
12
491
Alex retweeted
Nemotron ASR Multilingual running on an iPhone 17 Pro in CoreML. Many thanks to @fluidinference for the CoreML model and to @NVIDIAAI @NVIDIAAIDev for the model itself.
3
10
320
Alex retweeted
Supertonic3 running on an iPhone 17 Pro using ANE on CoreML. It’s blazing fast with low RAM consumption and background capable. 2 mins worth of audio generated in 3 secs. Many thanks to @fluidinference for the port.
4
8
708
when you write with coding agents you are also feeding the company training data. reducing the skills and ppl needed for complex or huge projects.
1
34
gave claude on windows another try, they have def improved the agent to operate much better on windows now. lesss environmental and system troubles, much more ease of operations
1
1
99
just finished chapter 1 of wdndev's llm_interview_note book. i had asked LLM to help quiz me on the concepts. the first chapter is impressive and does well in showing us why transformers were needed. i don't think i even understood half of the content. onto chapter 2 the main piece. i am already starting to doubt if i am can even become a ML engineer wdndev.github.io/llm_intervi…

1
59
this is like the best resource i have ever read about LLMs, the og is chinese but with deepseek you can easily translate or ask it to explain to you. its fluid clarity wdndev.github.io/llm_intervi…

1
86
i. do not have a math background yet alot of finetuning models for ANE optimization is intuition maxing thanks to coding agents.
1
50
alot of metalearning is related to understanding whats the fundamental unit a certain field use to measure or benchmark "progress". like how you can't have an economy without an agreed upon currency
58
tried to fuse mel spectrograms and encoder and decoder models for Parakeet RNNT and CTC before btu the enigmatic ANE Scheduler doesn't make this easy
1
61
Weight matrices and Upsampling is the deduction of additional information from limited information. i.e 6*6 matrix input into a 8*8 upsample matrix based on the position and values of the individual coordinates
53
been using claude code to grill me on coding concepts. man it is brutal in its feedbacks. it is able to politely call you a dumbassss in the most formal way possible
86
🥳  FluidAudio's Parakeet TDT 0.6B v3 CoreML has just crossed 1,000,000 total downloads and it has 472,749 monthly downloads, ranking #648 out of 2  million huggingface.co/FluidInferenc…
3
72