ModelScope

ModelScope

8 Photos and videos

Tweets

InclusionAI retweeted

ModelScope

@ModelScope2022

May 14

Ring-2.6-1T is now open source! 1T-parameter thinking model with two reasoning gears and SOTA Agent execution. 🚀 🔗 modelscope.cn/models/inclusi… 🤖 Stronger Agent execution: SOTA on PinchBench (87.60) and ClawEval (63.82), top-tier on TAU2-Bench and GAIA2-search. SWE-Bench Verified 74.00. 🎯 high / xhigh Reasoning Effort: high for production Agent workflows, xhigh for hard reasoning (AIME 26: 95.83, GPQA Diamond: 88.27) ⚡ Async RL Popsicle algorithm: stable long-cycle RL training at trillion-scale Compatible with OpenClaw, Claude Code, OpenCode, Kilo Code, and more.

150

7,028

InclusionAI

InclusionAI @TheInclusionAI

May 14

Introducing. Ring-2.6-1T! #OpenSource #LLM ✅Advanced agentic workflow support ✅Reasoning effort levels: high for agentic tasks, xhigh for complex reasoning ✅Scalable asynchronous RL 🤗 Hugging Face huggingface.co/inclusionAI/R… 📷 ModelScope modelscope.ai/models/inclusi…

inclusionAI/Ring-2.6-1T · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Ant Ling

@AntLingAGI

May 14

🚀 Ring-2.6-1T is now open source. A trillion-scale flagship thinking model built for real-world complex tasks: Agent workflows, coding & engineering, long-horizon tasks, complex reasoning, research, and enterprise automation. It is designed to move beyond “answering” toward execution: understanding context, planning steps, calling tools, and staying stable across long task chains. Highlights： - Advanced agentic workflow support. - Reasoning effort levels: high for agentic tasks, xhigh for complex reasoning. - Scalable asynchronous RL via the IcePop algorithm, enabling stable, trillion-scale training for long-horizon agentic RL.

2,949

Novita AI

InclusionAI retweeted

Novita AI

@novita_labs

May 8

Ring-2.6-1T from @AntLingAGI is now live on @OpenRouter with free access available until May 15 PT. 1T total parameters · 63B active parameters Built for real-world agent workflows: • Coding agents • Tool use • Long-horizon execution • Lower token overhead with adaptive reasoning Designed for advanced autonomous systems where capability, latency, and cost efficiency all matter. Powered by Novita AI

3,877

Kilo

InclusionAI retweeted

Kilo

@kilocode

May 8

Ring 2.6 1T from @TheInclusionAI is live in Kilo, free for a limited time. Worth noting: Ling 2.6 1T shot to the top of our leaderboard right after launch. Both available now in the model picker! 🔥

Ant Ling

@AntLingAGI

May 8

We are launching Ring-2.6-1T, a trillion-parameter flagship thinking model engineered for real-world complex tasks and production env: 🚀 - Adjustable Thinking Effort: dynamic compute mechanism to flexibly balance cognitive depth, token cost, and execution speed; - Agent-Optimized: Built for high-frequency workflows, delivering rapid multi-step execution and tool orchestration with SOTA stability; - Deep Thinking: Unlocks the model's maximum capability ceiling for rigorous mathematical logic and scientific research;

4,851

InclusionAI

InclusionAI @TheInclusionAI

May 9

Ring-2.6-1T is now 1️⃣week free on @OpenRouter! ✅Lower token overhead and rapid multi-step execution ✅Suitable for general and coding agent use cases ✅Designed to deliver reliable execution in production workflows at a rational inference cost start now🤗 openrouter.ai/inclusionai/ri…

Ring-2.6-1T - API Pricing & Benchmarks

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. 262,144 token...

openrouter.ai

Ant Ling

@AntLingAGI

May 8

695

InclusionAI

InclusionAI @TheInclusionAI

May 7

AReaL is joining the @PyTorch Landscape🎉landscape.pytorch.org Fully async, modular, built for the agentic AI era. AReaL is a scalable, RL infrastructure designed to bridge foundation model training with modern agent-based applications. Explore AReaL: github.com/inclusionAI/AReaL

6,470

Ant Ling

InclusionAI retweeted

Ant Ling

@AntLingAGI

May 7

Announcing Ling-2.6-1T by inclusionAI, now available on OpenRouter. 🚀 This trillion-parameter flagship instruct model is built for real-world agents. It utilizes a “fast thinking” approach to cut costs by ~75% while maintaining SOTA performance on AIME26 and SWE-bench Verified. Ideal for: - Advanced coding - Complex reasoning - Large-scale agent workflows

184

11,992

Ant Ling

InclusionAI retweeted

Ant Ling

@AntLingAGI

Apr 29

One more thing: we welcome you to join the AntLing Discord community! 🤗🥳🥳🥳 discord.gg/jQtDsU5J6C

Join the AntLingLLM Discord Server!

Check out the AntLingLLM community on Discord - hang out with 137 other members and enjoy free voice and text chat.

discord.com

1,122

ModelScope

InclusionAI retweeted

ModelScope

@ModelScope2022

Apr 29

So excited to announce Ling-2.6-1T is now live on ModelScope!🔥 This 1T parameter model is built for complex agent workflows, multi-step execution and long-context understanding. It truly delivers in production. 📊The benchmarks speak for themselves: - AIME26 — leads all non-reasoning models - SWE-bench Verified, TAU2-Bench, BFCL-V4, PinchBench — first-tier open-source - ~16M tokens on Artificial Analysis full eval — same efficiency story as Ling-2.6-flash Works with Claude Code, OpenClaw, OpenCode & CodeBuddy ✅ SGLang & vLLM ready · Open weights available now 🚀 Explore on ModelScope 👇 modelscope.cn/models/inclusi… modelscope.ai/models/inclusi…

143

10,224

InclusionAI

InclusionAI @TheInclusionAI

Apr 29

Ling-2.6-1T, designed for precise instruct task execution is now #OpenSource🥂Built for the Agentic era, Ling-2.6-1T brings about a significant capability upgrade↗️ and top-tier performance.#LLM 🤗 Hugging Face: huggingface.co/inclusionAI/L… 📷 ModelScope: modelscope.cn/models/inclusi…

inclusionAI/Ling-2.6-1T · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Ant Ling

@AntLingAGI

Apr 29

Last week, we introduced Ling-2.6-1T. Today, Ling-2.6-1T is officially an open model~ 🤗 1T total parameters · 63B active parameters We bring values to developers by making it easier to test, deploy, customize, and build. It is optimized to be "token efficiency" for real production needs: • Lower token overhead: strong intelligence without long reasoning traces • Reliable multi-step execution: better instruction, tool, context, and workflow control • Production-ready deployment: from code generation to bug fixing, with broad agent framework compatibility A sneak pick into the agentic capability in @opencode

0:13

3,653

InclusionAI

InclusionAI @TheInclusionAI

Apr 28

📣Fast generation, high token efficiency, real task execution and improved experience, Ling-2.6-flash is now opensource! 🔧Available in BF16, FP8, and INT4 variants for different deployment needs. Hugging Face: huggingface.co/inclusionAI/L… ModelScope: modelscope.cn/models/inclusi…

inclusionAI/Ling-2.6-flash · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Ant Ling

@AntLingAGI

Apr 28

Ling-2.6-flash is now officially open-sourced! A fast, token-efficient Instruct model built for real-world agent workflows. 104B total parameters · 7.4B active parameters Available in BF16, FP8, and INT4 variants for different deployment needs. Key strengths: - Fast generation: 215 tokens/s on Artificial Analysis Output Speed - High token efficiency: only 15M tokens on the full AA Intelligence Index evaluation - Real task execution: strong performance across coding, document processing, and lightweight agent workflows - Improved experience: better Chinese-English switching and smoother compatibility with mainstream coding frameworks

5,492

ZenMux

InclusionAI retweeted

ZenMux

@ZenMuxAI

Apr 27

📢 Ling-2.6-1T from @TheInclusionAI is live on @ZenMuxAI 🔥 Ant Group's trillion-param MoE flagship. 50B active per token. Open-source SOTA on SWE-bench Verified, AIME & agent benchmarks — no thinking tokens needed ⚡️ 🔗 zenmux.ai/inclusionai/ling-2…

138

8,538

ModelScope

InclusionAI retweeted

ModelScope

@ModelScope2022

Apr 27

✨ Inclusion AI's LLaDA2.0-Uni is open-source! A single MoE-based diffusion LLM that unifies visual understanding and image generation — natively, in one model. Download on ModelScope 👉 modelscope.ai/models/inclusi… Built on a single Mask Token Prediction paradigm, LLaDA2.0-Uni handles: 🖼️ Text-to-image synthesis at 1024×1024, with the option to "think" before drawing 🔍 Visual question answering, captioning, and document understanding on par with dedicated VLMs ✏️ Instruction-driven image editing — single or multi-reference, faithful to original details 🎨 Interleaved text-image reasoning, opening the door to a new class of multimodal chains Released under Apache 2.0 — paper, code, and weights all open. 📄 modelscope.ai/papers/2604.20… 🔗 github.com/inclusionAI/LLaDA…

8,228

DailyPapers

InclusionAI retweeted

DailyPapers

@HuggingPapers

Apr 23

LLaDA2.0-Uni from Inclusion AI Unified diffusion LLM for multimodal understanding and generation. Features MoE backbone with SigLIP-VQ tokenizer enabling 8-step image generation and native interleaved reasoning.

3,192

DailyPapers

InclusionAI retweeted

DailyPapers

@HuggingPapers

Apr 23

DR-Venus: a 4B parameter deep research agent trained on only 10K open data Achieves frontier performance on edge-scale devices, outperforming prior 9B agents and narrowing the gap to 30B-class systems through agentic SFT and RL with information gain rewards.

3,414

InclusionAI

InclusionAI @TheInclusionAI

Apr 24

Built for stable, high-speed execution in complex, real-world environments, meet Ling-2.6-1T!🥳Massive leap& top-tier performance, and 🛠️ Engineering-Task-Friendly. We’re unlocking 1 week of free API access. Start testing now at @OpenRouter openrouter.ai/inclusionai/li… Open-source model is on the way! Keep an eye out—more details dropping soon. 👀 #inclusionAI #LLM

Ling-2.6-1T - API Pricing & Benchmarks

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale....

openrouter.ai

Ant Ling

@AntLingAGI

Apr 23

🚀 Today, we are launching Ling-2.6-1T, a trillion-parameter flagship model designed for precise instruct task execution. By prioritizing a "Fast-Thinking" mechanism, it delivers SOTA intelligence with ultra-low token overhead, making token efficiency a first-class citizen.

2,055

InclusionAI

InclusionAI @TheInclusionAI

Apr 23

Brand new release from our team! 🆕DR-Venus — exploring new possibilities for edge-scale deep research with just 10K open data. 🛠️ Built with a two-stage recipe: 1️⃣ Agentic SFT: strict trajectory cleaning long-horizon resampling for maximum data value 2️⃣ IGPO RL: turn-level rewards via information gain & format-aware regularization for stable long-horizon execution #inclusionAI #deepresearch #AIAgent Edge-ready GGUF format — deploy on your Mac today. 🐙 Code: github.com/inclusionAI/DR-Ve… 📄 Paper: arxiv.org/abs/2604.19859 🤗 Models: huggingface.co/collections/i…

GitHub - inclusionAI/DR-Venus

Contribute to inclusionAI/DR-Venus development by creating an account on GitHub.

github.com

765

InclusionAI

InclusionAI @TheInclusionAI

Apr 23

We’re excited to introduce LLaDA2.0-Uni, the 💥first unified multimodal model in the LLaDA2.0 series. Highlights: 🧠 One paradigm to rule all – With unified block-wise mask token prediction, LLaDA2.0-Uni achieves top-tier performance across visual understanding, high-fidelity image generation, and single/multi-reference image editing. ⚡ Efficient inference – A novel decoding strategy in the dLLM backbone, together with an 8-step distilled diffusion decoder, enables highly efficient inference. (SGLang support soon for even faster inference.) 🔄 Interleaved, intelligent, infinite – Unified discrete representations enable seamless interleaved generation and advanced interleaved reasoning capabilities.

Haoxing Chen @Chenhaoxing249

Apr 23

After two months of teamwork, we’re excited to share our team’s latest achievement — LLaDA2.0-Uni, InclusionAI’s first multimodal LLaDA. A unified discrete diffusion LLM built for both understanding and generation across text and images. Highlights: ● One paradigm for VQA, doc understanding, and image generation ● Efficient inference with a new decoding strategy 8-step distilled decoder ● Interleaved text-image generation enabled by unified discrete representations (SGLang support soon) 🤗 Hugging Face: huggingface.co/inclusionAI/L… 📷 ModelScope: modelscope.cn/models/inclusi…

140

16,306

InclusionAI

InclusionAI @TheInclusionAI

Apr 23

🔧Try out and explore how LLaDA understand and generate the world. 📄 Technical Report: arxiv.org/abs/2604.20796 🤗 Hugging Face: huggingface.co/inclusionAI/L… 📷 ModelScope: modelscope.cn/models/inclusi… 🐙 Code: github.com/inclusionAI/LLaDA… #dLLM #inclusionAI #LLaDA #Multimodal #Opensource #DiffusionModels

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation...

We present LLaDA2.0-Uni, a unified discrete diffusion large language model (dLLM) that supports multimodal understanding and generation within a natively integrated framework. Its architecture...

arxiv.org

725

Artificial Analysis

InclusionAI retweeted

Artificial Analysis

@ArtificialAnlys

Apr 22

Ant Group's Ling 2.6 Flash scores 26 on the Artificial Analysis Intelligence Index, a 10-point jump from Ling-flash-2.0. It is one of few recent open weights releases focused on non-reasoning capabilities and focuses on a reasonable cost to intelligence ratio. Ling 2.6 Flash is a non-reasoning model from Ant Group's @TheInclusionAI lab. Ant Group's model family comprises three series: Ling (non-reasoning), Ring (reasoning), and Ming (multimodal). Ling-flash-2.0 was the previous flash-tier non-reasoning model. Ling 2.6 Flash is expected to be open weights shortly after release, but as of today the weights have not been released on Hugging Face. Key takeaways: ➤ At 104B total parameters with 7.4B active parameters, Ling 2.6 Flash (26) sits in intelligence near GPT-5.4 nano (Non-Reasoning, 24) and Gemma 4 26B A4B (Non-reasoning, 27), both models with comparable active parameter counts. However, at 18 points behind GLM-5.1 (Non-reasoning, 44), there remains a gap to frontier non-reasoning open weights models ➤ Ling 2.6 Flash is comparatively token efficient, using ~15M output tokens to run the Intelligence Index. This is comparable to Gemma 4 26B A4B (~14M) but a fraction of Qwen3.5 9B (~78M). Compared to models in the similar intelligence tier, Ling 2.6 Flash represents a reasonable efficiency tradeoff, which has positive effects on cost when deployed on larger workloads. At a price of $0.1 / million input tokens and $0.3 / million output tokens, Ling 2.6 Flash costs only ~$23 to run the full Artificial Analysis Intelligence Index. ➤ Gains from Ling-flash-2.0 were driven mostly by improvements agentic capabilities and instruction following. τ²-Bench jumped from 21% to 86% ( 65 points), IFBench from 34% to 57% ( 23 points), and GDPval-AA Elo from 425 to 783 ( 84%). Conversely, GPQA Diamond fell from 66% to 59% (-6 points) and SciCode from 29% to 27% (-2 points). ➤ AA-Omniscience performance is at -66 with 15% accuracy and 96% hallucination rate. This is consistent with the model's small 7.4B active parameter count. Knowledge recall benefits from larger parameter counts, and sub-10B active-parameter models systematically underperform on this metric. Additional model details: ➤ Architecture: MoE, 104B total parameters, 7.4B active parameters ➤ Context window: 262K tokens (doubled from 128K for Ling-flash-2.0) ➤ Pricing: $0.10 / $0.30 per 1M input/output tokens (via Novita API) ➤ License: Weights not yet released ➤ Availability: Third party API through @novita_labs

184

19,494