Mikel Artetxe

Mikel Artetxe

Photos and videos

Tweets

fairseq retweeted

Mikel Artetxe

@artetxem

21 Dec 2021

We are releasing a family of dense and MoE language models with up to 13B and 1.1T parameters. We find that MoEs are more efficient, but the gap narrows at scale and varies greatly across domains and tasks. Paper: arxiv.org/abs/2112.10684 Models & code: github.com/pytorch/fairseq/t…

fairseq

fairseq @fairseq

21 Dec 2021

Models and code available in fairseq: github.com/pytorch/fairseq/t…

Xian Li

@xl_nlp

21 Dec 2021

🌍Few-shot learning beyond English🌏 📢 Announcing XGLMs, a series of multilingual autoregressive languages models setting new SoTA on few-shot learning and outperforming English-centric models (e.g. GPT-3). Paper: arxiv.org/abs/2112.10668 Models and code: github.com/pytorch/fairseq/t…

fairseq

fairseq @fairseq

23 Nov 2021

Mixture of experts training in fairseq is now 40% faster thanks to Microsoft's Tutel library! Blog: microsoft.com/en-us/research… Fairseq code: github.com/pytorch/fairseq/t… Tutel code: github.com/microsoft/tutel

AI at Meta

fairseq retweeted

AI at Meta

@AIatMeta

9 Sep 2021

We’re introducing GSLM, the first language model that breaks free completely of the dependence on text for training. This “textless NLP” approach learns to generate expressive speech using only raw audio recordings as input. Learn more and get the code: ai.facebook.com/blog/textles…

333

1,220

fairseq

fairseq @fairseq

9 Mar 2021

fairseq now supports CPU offloading and full parameter optimizer state sharding via fairscale's FullyShardedDataParallel module. See our tutorial to train a 13B parameter LM on 1 GPU: fb.me/fairseq_fsdp

fairseq

fairseq @fairseq

12 Nov 2020

We just released 0.10.0, which is our last significant release before 1.0.0 when we will migrate to @Hydra_Framework. Changelog: github.com/pytorch/fairseq/r…

Release v0.10.0 · facebookresearch/fairseq

It's been a long time since our last release (0.9.0) nearly a year ago! There have been numerous changes and new features added since then, which we've tried to summarize below. While this ...

github.com

Naman Goyal

fairseq retweeted

Naman Goyal @NamanGoyal21

5 May 2020

Facebook AI Research's sequence modeling library @fairseq has made it's twitter debut. Please follow for latest updates.

PyTorch

fairseq retweeted

PyTorch

@PyTorch

12 Apr 2020

Fairseq includes support for sequence to sequence learning for speech and audio recognition tasks, faster exploration and prototyping of new research ideas while offering a clear path to production. bit.ly/2WfP85X

119

PyTorch

fairseq retweeted

PyTorch

@PyTorch

30 Jul 2019

roberta = torch.hub.load('pytorch/fairseq', 'roberta.large')

AI at Meta

@AIatMeta

29 Jul 2019

Facebook #AI’s RoBERTa is a new training recipe that improves on BERT, @GoogleAI’s self-supervised method for pretraining #NLP systems. By training longer, on more data, and dropping BERT’s next-sentence prediction, RoBERTa topped the GLUE leaderboard. ai.facebook.com/blog/roberta…

340

PyTorch

fairseq retweeted

PyTorch

@PyTorch

16 Jun 2018

fairseq now supports the training of gated convolutional language models (arxiv.org/abs/1612.08083). It can train a Google Billion Word language model on 128 GPUs in less than a day.

Language Modeling with Gated Convolutional Networks

The pre-dominant approach to language modeling to date is based on recurrent neural networks. Their success on this task is often linked to their ability to capture unbounded context. In this...

arxiv.org

PyTorch

fairseq retweeted

PyTorch

@PyTorch

16 Jun 2018

FairSeq Toolkit - Major Update - Distributed Training - Transformer models (big Transformer on WMT Eng-German in < 5 hours on DGX-1) - Fast Inference: translations @ 92 sent/sec for big Transformer - Story Generation Read more at Michael Auli's post: facebook.com/photo.php?fbid=…

132

Yann LeCun

fairseq retweeted

Yann LeCun

@ylecun

18 Sep 2017

Fairseq, now in PyTorch! The open-source convolutional sequence-to-sequence engine from FAIR is now available in... fb.me/1gCPauX6V

Yann LeCun

Fairseq, now in PyTorch! The open-source convolutional sequence-to-sequence engine from FAIR is now available in PyTorch, in addition to the original implementation in (Lua)Torch. Fairseq-py is a...

facebook.com

127

301