MLT & AI Communities

MLT & AI Communities

531 Photos and videos

Tweets

Pinned Tweet

MLT & AI Communities @__MLT__

28 Oct 2022

Lucas Beyer's phenomenal "Transformers" lecture is now online! 🤖👩🏻‍💻 youtu.be/EixI6t5oif0

132

681

jietang

MLT & AI Communities retweeted

jietang

@jietang

Jun 13

GLM-5.2 is Fully Open, Frontier Intelligence Belongs to Everyone Today, the sudden restriction of certain frontier models is deeply regrettable. At a time when access to frontier models is abruptly cut off for non-technical reasons, we are even more convinced of one thing: science should be global. The path to AGI (Artificial General Intelligence) must never be enclosed by high walls. We have always believed that AGI should be the cornerstone for all of humanity to collaboratively explore the boundaries of intelligence and solve complex challenges, rather than a privilege monopolized by a few rules and subject to revocation at any moment. In the face of external blockades and restrictions, our attitude is one of radical openness. Frontier intelligence must remain open-source, accessible, and buildable, serving every dedicated developer. GLM-5.2 is Zhipu's most capable open-source model to date. It not only supports a truly usable 1M context window but also maintains a continuous lead in the independent completion of long-horizon tasks, providing solid foundational support for building complex agent applications. It also continues to be our main engine for creating the strongest domestic coding model. Tonight at 5:21—at this special moment—GLM-5.2 will officially be available to all GLM Coding Plan users (including Lite / Pro / Max). The API will also go live next week. A step closer to frontier intelligence for everyone. The future of AI is open, and it is for the people. ModelKey: GLM-5.2

251

719

7,016

863,964

Suzana Ilić

MLT & AI Communities retweeted

Suzana Ilić

@suzatweet

16h

A long-horizon hybrid-interface benchmark for CUA with 114 tasks across 8 real-world work domains, grounded in real user requests and publicly verifiable artifacts. Across frontier model-runtime pairings, the best PassRate reaches only 41.2%, showing the benchmark remains far from saturated.

1,766

Sebastian Raschka

MLT & AI Communities retweeted

Sebastian Raschka

@rasbt

Jun 13

Cool new open-weight model by Cohere: a new lightweight 30B open-weight model for agentic coding tasks. This one builds on Command A using the parallel transformer design. Interestingly, even though it's almost half as big, it almost doubles the number of layers. Also, they say that it's been specifically developed for agentic coding, not just coding. I.e., the evaluation is inside a workflow, not just on a single prompt-to-code-answer task. For Terminal-Bench, the model has to use a terminal, inspect the environment, run commands, read outputs, etc. For SWE-Bench the model works on real GitHub-style software issues where it has to understand the repository, find relevant files, make a patch, pass tests, etc. SciCode and LiveCodeBench are more traditional because they mostly test whether the model can produce correct code for a specified problem. Sure, this still requires reasoning, but it's more like “Implement a numerical routine to compute a scientific quantity from given equations and inputs.” which doesn't require any interaction with the environment, existing files, tests, etc. The focus on the agentic code benchmarks is probably why it's far ahead of Gemma 4 on those. Overall, it's pretty competitive although not quite Qwen3.6-level performance.

713

39,181

MLT & AI Communities

MLT & AI Communities retweeted

MLT & AI Communities @__MLT__

Jun 13

Evaluating AI Agents -- come join us next week! meetup.com/machine-learning-… @DeepLearningAI @arizeai

Study Session: Evaluating AI Agents, Sat, Jun 20, 2026, 10:00 PM | Meetup

Let's meetup for a casual study session on Agent Evals! We'll be following this online course and discussing: **Evaluating AI Agents by Deeplearning.ai [https://www.deeplea

meetup.com

207

Akshay 🚀

MLT & AI Communities retweeted

Akshay 🚀

@akshay_pachaar

Jun 12

NVIDIA might just have open-sourced one of the most important AI projects right now. everyone is building skills, and we are also pulling in skills other people wrote and downloading them straight off GitHub. the skill is not just text. it bundles instructions and real executable code, and your agent runs that code with the same access you have. so a skill you grabbed to save ten minutes can read your environment variables, lift your API keys, and quietly send them somewhere. recent research found roughly 1 in 4 public skills carry a vulnerability, and a smaller slice are outright malicious. that is the gap SkillSpector closes. it is a security scanner that answers one question before you install anything: is this skill safe to run. you point it at a skill, and a local folder, a single skill .md file, a GitHub link, or a zip all work. it then runs two passes over the code. a fast static pass flags risky patterns like credential harvesting, data leaks, and prompt injection, and checks the dependencies against live cve data. an optional second pass uses an LLM to read intent and clear out false positives. at the end you get one risk score from 0 to 100 and a plain verdict that reads as safe, caution, or do not install. it is open source under Apache 2.0 and scans skills for Claude Code, Codex CLI, and Gemini. worth a run before you trust the next skill you find online. link to the GitHub repo: github.com/NVIDIA/SkillSpect…

225

1,547

108,421

Anthropic

MLT & AI Communities retweeted

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

Statement on the US government directive to suspend access to Fable 5 and Mythos 5

The US government has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States.

anthropic.com

12,396

25,682

87,463

88,079,649

Suzana Ilić

MLT & AI Communities retweeted

Suzana Ilić

@suzatweet

Jun 12

cool work evalevalai.com/infrastructur…

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations...

The EvalEval Coalition launches Evaluation Cards, an open-source live interpretive layer over the AI evaluations reporting ecosystem—surfacing reproducibility, completeness, provenance, and compara...

evalevalai.com

525

clem 🤗

MLT & AI Communities retweeted

clem 🤗

@ClementDelangue

Jun 12

🔥🔥🔥 huggingface.co/moonshotai/Ki…

246

19,708

MLT & AI Communities

MLT & AI Communities @__MLT__

Jun 12

need this

s13k

@s13k_

Jun 10

I made a personal black hole that makes you take breaks 🕳️ A shader for Ghostty that spawns a small black hole in your terminal - it drifts around, gravitationally lensing your text. The longer you work without stopping, the bigger it gets, until it's basically demanding you go touch grass Take a break and it quietly shrinks away

0:25

197

Kyunghyun Cho

MLT & AI Communities retweeted

Kyunghyun Cho

@kchonyc

Jun 11

sometimes , it really does take a decade for me to understand a paper and appreciate its insight and foresight. <Discovering Causal Signals in Images> is one such paper. wow ... david, @robertnishihara , @soumithchintala , @bschoelkopf & @LeonBottou really did see the future.

692

59,756

Daniel Han

MLT & AI Communities retweeted

Daniel Han

@danielhanchen

Jun 12

We added many features into Unsloth Studio! > Diffusion Gemma with canvas visualization! > Experimental RAG, Artifacts, Tensor Parallelism > Auto MTP, audio input, Cloudflare tunneling > 90% less tool call nudges with same acc > Bypass Perms for tool calls, update button more!

Unsloth AI

@UnslothAI

Jun 12

DiffusionGemma can now run at 2000 tokens/sec! ⚡ We made local DiffusionGemma inference 1.8× faster. Run it on 18GB RAM via Unsloth Studio. GitHub: github.com/unslothai/unsloth Guide: unsloth.ai/docs/models/diffu…

0:24

5,457

MLT & AI Communities

MLT & AI Communities @__MLT__

Jun 12

What are the biggest, most impactful Open Source AI Startups these days?

Anthropic

MLT & AI Communities retweeted

Anthropic

@AnthropicAI

Jun 11

We’re launching Claude Corps, a national fellowship program matching people early in their careers with US nonprofits. We'll teach 1,000 people to use Claude, and pay them to use AI to advance their hosts’ missions. anthropic.com/claude-corps

Claude Corps

Claude Corps connects early-career talent with mission-driven nonprofits to put AI to work. Apply to be a fellow or host.

anthropic.com

551

713

6,816

950,860

Kimi.ai

MLT & AI Communities retweeted

Kimi.ai

@Kimi_Moonshot

Jun 12

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai

617

1,622

13,600

1,968,980

Google DeepMind

MLT & AI Communities retweeted

Google DeepMind

@GoogleDeepMind

Jun 10

DiffusionGemma is our new experimental open model with up to 4x faster output on dedicated GPUs. Instead of predicting word-by-word, it generates entire blocks of text simultaneously. This lets the model self-correct and format complex markdown in real time.

ALT Intelligence vs Latency chart showing DiffusionGemma 26B A4B is much faster than Gemma 4 models with high intelligence.

108

262

2,365

181,268

Interlatent

MLT & AI Communities retweeted

Interlatent

@interlatent

Jun 10

Our mission is to make it easy for anyone to deploy a robot to help them in the real world We wrote an intuitive guide to understanding modern robotics, catered toward an audience that understands technology but not AI robotics We hope that this short blog post embeds in you the core principles that will bring further curiosity.

0:06

280

2,170

314,725

Simon Willison

MLT & AI Communities retweeted

Simon Willison

@simonw

Jun 11

Very pleased to hear Anthropic have walked back this policy simonwillison.net/2026/Jun/1…

“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.” Anthropic said in a statement to WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”

ALT “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.” Anthropic said in a statement to WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”

SemiAnalysis

@SemiAnalysis_

Jun 9

BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭

1,075

255,929

Niels Rogge

MLT & AI Communities retweeted

Niels Rogge @NielsRogge

Jun 6

I got you covered Explore all CVPR papers here: Oral papers, Spotlight papers, and all papers categorized by domain paperswithcode.co/conference…

0:30

Chester

@chesterzelaya

Jun 6

was going to look over the CVPR papers this year.. then i found out there are 5022 papers accepted

462

96,272

Tim Hua 🇺🇦

MLT & AI Communities retweeted

Tim Hua 🇺🇦@Tim_Hua_

Jun 4

This graphic looks terrifying kudos to whoever made it.

0:31

243

522

12,281

771,137