Yann LeCun

Yann LeCun

Users
Tweets

Yann LeCun

@ylecun

2 Feb 2025

You misread. There had been multiple LLM projects within FAIR for years. Some were open sourced as research prototypes (e.g. OPT175B, Galactica, BlenderBot...). In mid-2022, FAIR started a large LLM project called Zetta, which was still going in late 2022 when ChatGPT came out. A small group at FAIR-Paris was working on theorem proving. They needed an LLM for their own purpose and thought Zetta was too big and not ready. They developed their own model, which eventually became Llama-1. What happened internally between Zetta and Llama is somewhat similar to what just happened between DeepSeek and the big US players: a small team of talented folks innovated and beat the large teams.

450

361,793

meowbooks

meowbooks

@meowbooksj

11 Jun 2023

And finally @Yampeleg. You should just watch it. PS He read OPT175B logbook so we won't have to.

4,802

田中康紀@🧑‍🔧現場主義でモコボイス開発中

田中康紀@🧑‍🔧現場主義でモコボイス開発中

@gojiteji

25 Feb 2023

OPT175Bですらまだ試せていないのにLLaMAまでてきた🦙

233

bioshok

bioshok

@bioshok3

12 Feb 2023

webbigdata.jp/post-17639/ ちなみにFlanT5の論文でOPT175Bというメタがオープンソースにした大規模言語モデルがあるがそいつも明示的にはコードで学習していないっぽい。下の画像はOPT論文の学習データセット。Githubが乗ってない。 GoogleもMETAもコード学習した言語モデル出さないのはなぜか？

2,559

optimizium.base.eth (ξ/e)

optimizium.base.eth (ξ/e) @optimizium_eth

5 Jan 2023

Replying to @DataChaz @huggingface

"Better" is not entirely true according to HELM. It is much better than OPT175B in "Toxicity" category (and better than GPT3 as well). And a little bit better in "Efficiency" and "Robustness". But much worse in "Accuracy", and "Bias".

1,597

Charly Wargnier

Charly Wargnier

@DataChaz

5 Jan 2023

Everyone's talking about #ChatGPT, but did you know there was a free, #opensource equivalent of #GPT3 called Bloom? It's better than OPT175B, and you can easily use it in Transformers or @huggingface's API inference! 🤯 Code in the screenshot's alt. 🔗 huggingface.co/bigscience/bl…

188

1,021

169,211

eliran keren

eliran keren @eliran_keren

24 Dec 2022

I'm blown away by how powerful #GPT3 and #OPT175B are! I didn't realize there was a free, #opensource version of GPT-3. Amazing what technology can do! #ChatGPT

548

Igor Brigadir 🇺🇦

Igor Brigadir 🇺🇦 @IgorBrigadir

6 Dec 2022

Replying to @EigenGender

Yeah - that definitely makes "publishing" these things more challenging. One thing i think does work very well is something like the OPT175B "Lab Logbook" publication model: x.com/PhilippHennig5/status/… this was absolutely invaluable!

bioshok

bioshok

@bioshok3

27 Oct 2022

METAオワコン雰囲気ある人にはあるが、エグいぐらいのAI技術を持っているのも一応認識した方がいい。大規模言語モデルNLLB200、OPT175b,映像生成AI MAKE A VIDEO ,トランスフォーマーモデル超えるかもなMEGA、ロボットハンド強化学習のAI PGDM,Speech2Speech翻訳 METAはメタバースをAIで加速するのでは

129

bioshok

bioshok

@bioshok3

24 Oct 2022

以下の推定によるとGPT3は10^12FLOPSで4トークン程度生成する。docs.google.com/document/d/1… とすると現在の最新のGPUでRTX4090なんかは10^14FLOPSいかないくらいはあるからGPU１個で推論計算は余裕そうなんだけど、OPT175Bは256GB VRAM要求するそうで、10台上記GPUないとダメだけどなんでだろう。気になる

Appendices to biological anchors report

These are appendices to the report “Forecasting AI with biological anchors.” The main report begins here. It is a work in progress and does not represent Open Philanthropy’s institutional view. We...

docs.google.com

日経クロステック IT

日経クロステック IT

@nikkeibpITpro

7 Jun 2022

米メタが巨大言語モデルを公開　未熟で危険な技術を世に放つ成否 #OPT175B #言語モデル #GPT3 #IT経営 #業界動向 dlvr.it/SRnj64

MultiLingual Media

MultiLingual Media

@multilingualmag

12 May 2022

Unlike many other large language models, @Meta's OPT-175B will be available for free to all researchers or institutions that request access. #languagemodels #languagemodel #largelanguagemodel #meta #tech #gpt3 #OPT175B #AI #artificialintelligence

Meta gives researchers full access to its large language model

multilingual.com

Louis-François Bouchard 🎥🤖

Louis-François Bouchard 🎥🤖

@Whats_AI

8 May 2022

An open-source model as powerful as GPT-3! - mailchi.mp/6dae87ca23a9/an-o… #ai #opt #gpt #gpt3 #opt175b #meta #metaai #openai #artificialintelligence #machinelearning #datascience

Aishatu

Aishatu @AishatuAdo

5 May 2022

🤖 Meta's new AI language model OPT-175B ‘has a higher toxicity rate” and it “appears to exhibit more stereotypical biases in almost all categories except for religion.” technologyreview.com/2022/05… #AI #OPT175B

Anna | Ink of Books

Anna | Ink of Books @Ink_of_Books

5 May 2022

Würde mich freuen, wenn ihr euch den Thread zum neuen AI Model von Meta (Facebook) anschaut, ich find dass so spannend! #OPT175B

Anna | Ink of Books @Ink_of_Books

5 May 2022

Replying to @Ink_of_Books

16/ Jetzt gehts richtig los: "evaluate the tendency of OPT-175B to respond with toxic language via the RealToxicityPrompts dataset" Ganz einfach: Die Graphik zeigt, dass OPT-175B (blau) toxischer ist, als die beiden anderen geprüften LLMs und alle toxischer antworten, wenn ...

Arthur Holland Michel

Arthur Holland Michel @WriteArthur

4 May 2022

Meta has released a huge new AI language model called OPT-175B and made it available to a broad array of researchers. It also released a technical report with some truly extraordinary findings about just how dangerous this machine can be. 🧵 #AI #OPT175B

980

3,244

Osmar Zaiane

Osmar Zaiane @ozaiane

3 May 2022

OPT-175b: Open Pre-Trained #language model with 175 billion paramaters is now available to the #research community. #NLP #MachineLearning #OPT175B ai.facebook.com/blog/democra…

Silke Hahn ✨

Silke Hahn ✨ @_SilkeHahn

3 May 2022

Schon wieder Advent? Einen Tag ist man im Konferenztunnel, schon steht das nächste große Sprachmodell in der Tür. Mein Kollege @MengeSonnentag hat eine lesenswerte Meldung zu #OPT175B von #Meta getickert 🫴 #LLM #AI #Zeitenwende

heise Developer @heisedc

3 May 2022

KI-Sprachmodell: Meta schickt den nächsten GPT-3-Herausforderer ins Rennen heise.de/news/KI-Sprachmodel… #m3_2022 #BERT