Filter
Exclude
Time range
-
Near
Researchers Achieve 16x Compression Breakthrough to Challenge Bigger AI Context Windows Read here: ow.ly/jkkz50Zb8TG #AIResearch #MachineLearning #DataCompression #AIInnovation #TechBreakthrough #DeepLearning #ContextWindows #AIContext #ResearchInnovation
23
Replying to @badlogicgames
ngl atm - none of them. Though Kimi on our own gpu‘s quite interesting - yet with powerconsumtion not cheap to run. For the moment we do tests while thinking about how to build a digital twin of our operations and how to describe this world to some clankers within their respective contextwindows…
3
81
🤔 Ever wondered how large language models remember what you said 5,000 words ago? It all comes down to one thing — attention. Let’s break down what attention really is, why it matters, and how it helps models expand their context windows. Want to learn how to build scalable LLM Applications? Join Our LLM Bootcamp: hubs.la/Q03MkrX_0 #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Transformers #LLM #AIModels #AttentionMechanism #ContextWindows
1
2
5
2,063
OpenAI just launched GPT-4.1 — and it’s a major leap forward. This new model isn’t just an upgrade—it’s smarter, faster, and more efficient across the board. From 1 million token context windows to major gains in coding, reasoning, and multimodal understanding, GPT-4.1 outperforms GPT-4o and even beats GPT-4.5 in key areas. It follows complex instructions better, remembers previous conversations more accurately, and delivers high precision on code edits and visual tasks. Plus, it’s more affordable and faster to use—especially for repeated prompts. Swipe through to see what’s new and why GPT-4.1 might be the most capable model yet. #OpenAI #ChatGPT #GPT4.1 #Contextwindows
4
6
1,488
LLaMA 4 can handle up to 10 million tokens—but how? It’s not just about slapping on more memory. This carousel breaks down the engineering behind long sequence modeling—the key to making large context windows work in practice. From sparse attention to memory compression, here’s how modern LLMs are scaling up without falling apart. #Llama3 #ContextWindows #Meta #LLMs
2
4
2,194
I guess Sam Altman wants to release Strawberry before his big interview with Oprah tomorrow.
1
1
2
112
Replying to @Prashant_1722
I think you're very wrong and I'll be happy to explain why. The theme of this Google i/o was "AI". They said that throughout. In other words, they said themselves that they put AI first and tried to show the best products. However, they failed mercilessly. They have not managed to make better products than the competition. If it were only about producing products suitable for the masses, it wouldn't be a problem. But many other companies do the same. Smartphones have now become a mass market. Xiaomi, Huawei, Pocco and so on all want to reach this market. Google is no longer alone. They also didn't show products that appeal to the masses but are supposed to be SOTA ("2m CONTEXTWINDOWS" - repeated 10 times I'm sure). Demis Hassabis was brought on stage (for the first time) not to explain the internet to the grandma next door, but as the head of DeepMind to say that they have caught up, that they are as - if not better than - OpenAI. The Google i/o is the DEVELOPER CONVERENCE, as they say themselves. It's about development and SOTA products. It is neither the place nor the time to appeal to the masses. And that was never the point.
3
7
311
Anthropic - Introducing 100K Token Context Windows, Around 75,000 Words #nlp #contextwindows reddit.com/r/MachineLearning…

1
5
4,027