Omar Sanseviero

Omar Sanseviero

8 Photos and videos

Tweets

Edouard Grave retweeted

Omar Sanseviero

@osanseviero

Apr 2

Gemma 4 is here! 🧠 31B and 26B A4B for models with impressive intelligence per parameter 🤏E2B and E4B for mobile and IoT 🤗Apache 2.0 🤖Base and IT checkpoints available Available in AI Studio, Hugging Face, Ollama, Android, and your favorite OS tools 🚀Download it today!

112

899

123,029

kyutai

Edouard Grave retweeted

kyutai @kyutai_labs

6 Jun 2025

Unmute meets Moshi 🫂💖 Talk to unmute.sh!

2:05

6,224

Edouard Grave

Edouard Grave @EXGRV

6 Feb 2025

Today, we release our 🇫🇷 to 🇬🇧 simultaneous speech-to-speech translation system, called Hibiki. It runs on-device & the model, inference code and tech report are available. This is built using the same audio LLM as Moshi, showing its versatility. 🟢

kyutai @kyutai_labs

6 Feb 2025

Meet Hibiki, our simultaneous speech-to-speech translation model, currently supporting 🇫🇷➡️🇬🇧. Hibiki produces spoken and text translations of the input speech in real-time, while preserving the speaker’s voice and optimally adapting its pace based on the semantic content of the source speech. Based on objective and human evaluations, Hibiki outperforms previous systems for quality, naturalness and speaker similarity and approaches human interpreters. 🧵

1:29

1,699

Edouard Grave

Edouard Grave @EXGRV

13 Jan 2025

Excited to release a preview of Helium-1, our 2B LLM targeting edge and mobile devices. 🚀 More to come in the future: training code, support for more languages, data pipeline, tech report & more… 🟢

kyutai @kyutai_labs

13 Jan 2025

Meet Helium-1 preview, our 2B multi-lingual LLM, targeting edge and mobile devices, released under a CC-BY license. Start building with it today! huggingface.co/kyutai/helium…

6,352

kyutai

Edouard Grave retweeted

kyutai @kyutai_labs

13 Jan 2025

Meet Helium-1 preview, our 2B multi-lingual LLM, targeting edge and mobile devices, released under a CC-BY license. Start building with it today! huggingface.co/kyutai/helium…

kyutai/helium-1-preview-2b · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

375

58,316

Edouard Grave

Edouard Grave @EXGRV

18 Sep 2024

Local voice models FTW 🚀

kyutai @kyutai_labs

18 Sep 2024

Talking to Moshi locally on a Macbook M series (python 3.12) in 2 lines: pip install moshi_mlx python -m moshi_mlx.local_web -q 4

1,991

Edouard Grave

Edouard Grave @EXGRV

24 Jul 2024

Moshi goes to #ICML2024 in Vienna! Try the demo at moshi.chat/

1:04

9,009

Edouard Grave

Edouard Grave @EXGRV

23 Jul 2024

I am at ICML in Vienna! Let me know if you want to chat about (or to) Moshi, multimodal LLMs, Kyutai & more.

11,255

Alexandre Défossez

Edouard Grave retweeted

Alexandre Défossez @honualx

13 Dec 2023

Looking forward to discuss open research at @kyutai_labs. If you want to work on large scale multimodal LLMs, come and talk to us, this is what we look like 👇☕️

Neil Zeghidour

@neilzegh

13 Dec 2023

Look for my @kyutai_labs colleagues at #NeurIPS2023 if you want to learn more about our mission. We are recruiting permanent staff, post-docs and interns!

101

29,545

Edouard Grave

Edouard Grave @EXGRV

8 Dec 2023

✈️ I will be attending #NeurIPS2023: let me know if you want to chat about the future of LLMs, and how to democratize them. 🌐 We are also hiring members of technical staff and interns @kyutai_labs. Happy to talk about the lab and our mission.

13,401

Edouard Grave

Edouard Grave @EXGRV

17 Nov 2023

/kyutai has landed! Super excited to build this new research lab. Pure focus on research. As open as it gets.

kyutai @kyutai_labs

17 Nov 2023

Announcing Kyutai: a non-profit AI lab dedicated to open science. Thanks to Xavier Niel (@GroupeIliad), Rodolphe Saadé (@cmacgm) and Eric Schmidt (@SchmidtFutures ), we are starting with almost 300M€ of philanthropic support. Meet the team ⬇️

152

12,536

Edouard Grave

Edouard Grave @EXGRV

24 Feb 2023

Super excited by the release of LLaMA, a serie of large language models, from 7B to 65B parameters. 🎉 By training longer, LLaMA obtains GPT3 level performance with a 13B model, which can run on a single GPU. Excited to see what the research community will do with these models.

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

24 Feb 2023

Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters. LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B. The weights for all models are open and available at research.facebook.com/public… 1/n

36,755

Arthur Mensch

Edouard Grave retweeted

Arthur Mensch @arthurmensch

23 Feb 2023

Replying to @itsandrewgao

Paris of course

5,228

Edouard Grave

Edouard Grave @EXGRV

25 Aug 2022

Introducing PEER, a new language model which makes text generation and editing more collaborative and controllable. It adds human in the loop, by following instructions and providing explanations. Work lead @timo_schick. Paper: arxiv.org/abs/2208.11663

Timo Schick @timo_schick

25 Aug 2022

🎉 New paper 🎉 We introduce PEER, a language model trained to incrementally write texts & collaborate w/ humans in a more natural way. It can write drafts, add suggestions, follow instructions, perform edits, correct itself & provide explanations. Link: arxiv.org/abs/2208.11663

Edouard Grave

Edouard Grave @EXGRV

8 Aug 2022

Very excited to introduce Atlas, a new retrieval augmented language model which is competitive with larger models on few-shot tasks such as question answering or fact checking. Work lead by @gizacard and @PSH_Lewis. Paper: arxiv.org/abs/2208.03299

Patrick Lewis @PSH_Lewis

8 Aug 2022

🚨We’ve been working on better retrieval-augmented models & thrilled to present Atlas, led by @gizacard @EXGRV & myself🚨 Atlas is a end2end pretrained "RAG"-like model, beats models 50x its size on fewshot QA, sets numerous SotA on knowledge-intensive NLP arxiv.org/abs/2208.03299

more replies

Edouard Grave

Edouard Grave @EXGRV

8 Aug 2022

Our model, at 11B parameters, and significantly less training compute, outperforms LLMs on 64-shot question answering ( 3 pts wrt SOTA) or 15-shot fact checking ( 5 pts wrt SOTA).

Edouard Grave

Edouard Grave @EXGRV

8 Aug 2022

Joint work with the great following team: @gizacard @PSH_Lewis @MariaLomeli_ @lucas_hosseini @Fabio_Petroni @timo_schick Jane Dwivedi-Yu @armandjoulin @riedelcastro

Zeming Lin

Edouard Grave retweeted

Zeming Lin

@ebetica

21 Jul 2022

Excited to present our work on single-sequence protein folding from a language model! By stacking a simple folding trunk and Alphafold2's structure module on top of the language model, we get accurate structure prediction in a fraction of the runtime.

Alex Rives

@alexrives

21 Jul 2022

We have trained ESMFold to predict full atomic protein structure directly from language model representations of a single sequence. Accuracy is competitive with AlphaFold on most proteins with order of magnitude faster inference. By @MetaAI Protein Team. biorxiv.org/content/10.1101/…

0:16

Edouard Grave

Edouard Grave @EXGRV

1 Jun 2022

New release of our Contriever project! It includes multi-lingual models which can perform cross-lingual retrieval (eg, retrieve English documents to answer a question in Swahili), the code to (pre-)train your own retrievers, and an updated version of the paper with new results.

Gautier Izacard @gizacard

31 May 2022

Code for Contriever is now available! Code: github.com/facebookresearch/… Paper: arxiv.org/pdf/2112.09118.pdf Additionally we trained mContriever, a state-of-the-art multilingual neural retriever, by applying a similar contrastive learning method.