Our "Riddle of the Beginning" debate (with the great Scott Aaronson and the eminent Pria Natarayan) is online. (I remember I was a bit flummoxed by how some of my interlocutors seem actually no longer curious about this question...) iai.tv/video/the-riddle-of-t…
Ilya Sutskever gave a full talk on "Sequence to sequence learning with neural networks"
It's one of the most important talks in 2024
Here are some of my favourite insights from the talk 🧵
Cohere just dropped Command R7B
The smallest, fastest, state-of-the-art enterprise-grade LLM.
The best part? It’s Open Weights!
Here’s everything you need to know 🧵
(Cohere Partner)
Amazon's new Nova models give #CCaaS providers another option as they look to right-size models for optimal performance and cost. More in the comments.
I read Google's paper about their quantum computer so you don't have to.
They claim to have ran a quantum computation in 5 minutes that would take a normal computer 10^25 years.
But what was that computation? Does it live up to the hype?
I will break it down.🧵
1/
Employees are taking advantage of GenAI to ask for business insights in plain language. Amazon Q Business is an example enabler of this capability. Yesterday Amazon announced updates to Amazon Q.
Unexpected. @amazon is back with Foundation Models. As part of re:Invent they announced 6 new foundation models from text only to text-to-video! 👀 Nova models will be exclusively available through Amazon Bedrock.
TL;DR;
🧠 Micro (text-only), Lite (multimodal), Pro (high-capability), and Premier (coming 2025)
🎨 Canvas (image-generation) and Reel (video-generation)
📊 Context length up to 300K tokens and 200 languages
🥇 Performance on benchmarks similar to Llama 3
🗺️ Models currently only available in AWS Regions in the US
🔒 Includes watermarking capabilities (no details here)
🔧 Can be fine-tuned inside Amazon Bedrock
💰 Micro: $0.035 / $0.14; Lite: $0.06 / $0.24; Pro: $0.80 / $3.20 per million input/output tokens
@rohanpaul_ai I was thinking about the article you posted on how LLMs can navigate GUIs to carry out tasks for a person, all based just on a person's instructions. 1 / 2
ICLR is a top ML conference. All 10k papers from 2025 are in Open Review.
The top rated papers include:
— Scaling LLM interpretability to GPT4 scale
— Changing light source w consistent image
— 100x faster diffusion models
— A provable theory for LLM jailbreaks
Thread...
1/10
NVIDIA’s new sound from text model seems amazing. It’s crazy what you can create these days by just asking for it. Link to their demo video in the comment.