Joined October 2022
51 Photos and videos
mixtral-8x7b-32kseqlen from @MistralAI. Mixture of Experts? 🤔 x.com/MistralAI/status/17331…

8 Dec 2023
magnet:?xt=urn:btih:5546272da9065eddeb6fcd7ffddeef5b75be79a7&dn=mixtral-8x7b-32kseqlen&tr=udp:/%2Fopentracker.i2p.rocks:6969/announce&tr=http:/%https://t.co/g0m9cEUz0T:80/announce RELEASE a6bbd9affe0c2725c1b7410d66833e24
1
2
7
14,680

New open weights LLM from @MistralAI params.json: - hidden_dim / dim = 14336/4096 => 3.5X MLP expand - n_heads / n_kv_heads = 32/8 => 4X multiquery - "moe" => mixture of experts 8X top 2 👀 Likely related code: github.com/mistralai/megablo… Oddly absent: an over-rehearsed professional release video talking about a revolution in AI. If people are wondering why there is so much AI activity right around now, it's because the biggest deep learning conference (NeurIPS) is next week.
10
1,937
StabilityAI released StableLM: an open source LLM with 3b and 7b parameters. RLHF and larger models coming soon.
1
15
2,395
Meta releases DINOv2 the first method for training computer vision models that uses self-supervised learning (no labeling needed) to achieve industry standard results.
1
11
2,136
Damo releases an open-source text to video model with 1.7B parameters. The demo only requires 16GB CPU RAM and 16GB GPU RAM to run. Try it out on Hugging Face below:
2
3
19
2,584
Researchers introduce Video-P2P: Video Editing with Cross-attention Control.
1
2
15
2,687