Joined November 2011
29 Photos and videos
Dango233 retweeted
What?!
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
1
1
1,556
Dango233 retweeted
Make your agent smarter. The II-Commons skill gives your agent reliable knowledge from arxiv, PubMed & more, plug it in Repo: github.com/Intelligent-Inter… Add it to II-Agent: agent.ii.inc
3
12
55
8,008
Dango233 retweeted
变成喵在香港监狱喝咖啡哈哈哈哈哈 @dango233max
2
5
1,921
Dango233 retweeted
20 years ago, my first startup was all about enterprise search. Two decades later, we’re still building search engines. The technology has shifted from NLP to NN and the users from humans to agents. but searching is still the core. opensource the fastest bm25 engine:
so we built psql_bm25s. exact BM25 retrieval. native Postgres access method. ~23x faster than pg_search on the standard benchmark. retrieval stops being a budget item. the harness stops rationing. the agent gets to look things up like it should have the whole time.
5
8
53
20,101
Dango233 retweeted
我們開源了這顆星球🌎上速度最快的低成本 bm25 引擎。
so we built psql_bm25s. exact BM25 retrieval. native Postgres access method. ~23x faster than pg_search on the standard benchmark. retrieval stops being a budget item. the harness stops rationing. the agent gets to look things up like it should have the whole time.
6
32
232
45,966
DS4 is geart! I made a temporary fork with my weekend patches while the PRs are under review - unlock q4 on 192GB MAC - llama.cpp-style raw completions endpoint: enable Pre-filling and custom templates in SillyTaverns etc. Pre-merge convenience fork only :)
Welcome to DS4, a specialized inference engine for DeepSeek v4 Flash. github.com/antirez/ds4 This project would have been impossible without the existence of llama.cpp and GGML and the work of @ggerganov and all the other contributors. Thanks!
1
3
265
Dango233 retweeted
我们始终还是相信 multi-agents 是必须的,尽管很多公司都认为它实现起来难度太大。我承认确实比预期困难一些,但是这应该是目前最“不一样”的多agent框架了。这个视频中每个节点都是agent,没有工作流,它们是自组织的,诞生,合作,互相攻击和死亡都是自主行为。
Unstructured intelligence = chaos Most agent frameworks ship without a nervous system: deadlocks, context loss, vacuum hallucinations. We built Common Ground to fix this, agents coordinate on a shared protocol.
5
13
88
19,591
Dango233 retweeted
Unstructured intelligence = chaos Most agent frameworks ship without a nervous system: deadlocks, context loss, vacuum hallucinations. We built Common Ground to fix this, agents coordinate on a shared protocol.
24
45
447
536,441
Dango233 retweeted
Chinese New Year is rapidly becoming the AI researcher's favorite holiday
40
57
1,344
140,870
Dango233 retweeted
我参与了中文版翻译工作。希望把关于 AI 时代经济与治理的讨论带给更多中文读者,欢迎大家指出任何翻译/术语建议。虽然AI已经能做大部分翻译任务,但翻译过程中还是有很大量的人类对齐工作,尤其一些概念中/英差距很大,又要兼顾原作者表达的语气和方式,整个工作体验还是很有意思的。
你好,中国的朋友们! 《The Last Economy》中文版现已上线,可在我们网站免费阅读。 “The Last Economy” by @EMostaque is now available in Chinese What language should we do next?
19
70
398
62,629
Dango233 retweeted
Our state of the art open source general purpose agent hits V1 Feature equivalent to Replit / Manus / Genspark etc, to make websites to presentations and more connected to all your other tools Readying open repo update in a week or two, give it a try and give feedback!
II-Agent V1 is here. The AI agent built for real work is finally out of beta. Faster, smarter, and production-ready. It’s time to change how you build. 👇 Let’s see what’s new.
35
38
322
30,556
Dango233 retweeted
II-Agent V1 is here. The AI agent built for real work is finally out of beta. Faster, smarter, and production-ready. It’s time to change how you build. 👇 Let’s see what’s new.
21
46
210
132,172
值得想沿着这个思路往下走的朋友们想的问题: 这个思路的核心,是微分几何,还是运筹学/系统工程?
昨天我俩讨论了一下这个paper,首先它的突破性和解决问题的漂亮是没问题的。但有意思的地方是: 1 它是从微分几何获得了一个证明,然后找到了解法,还是 先在工程上凑到了一个解法,然后用流型做证明? 2 沿着这个思路,还有什么可推导的其他用途?
1
614
锐评:挂流形的羊头,卖运筹学的狗肉,论文命名的反向工程,给工程解法找理论爹 千万别被“流形”这个词骗了。说是从流形理论推导的,我敢打赌这绝对是从运筹学“倒着来”的,想从微分几何去理解是南辕北辙。 我工业工程的DNA动了,怪不得这么多人“看不懂”。说是指派问题我的IE同学们是不是能看懂 ?
DeepSeek just dropped a banger paper to wrap up 2025 "mHC: Manifold-Constrained Hyper-Connections" Hyper-Connections turn the single residual “highway” in transformers into n parallel lanes, and each layer learns how to shuffle and share signal between lanes. But if each layer can arbitrarily amplify or shrink lanes, the product of those shuffles across depth makes signals/gradients blow up or fade out. So they force each shuffle to be mass-conserving: a doubly stochastic matrix (nonnegative, every row/column sums to 1). Each layer can only redistribute signal across lanes, not create or destroy it, so the deep skip-path stays stable while features still mix! with n=4 it adds ~6.7% training time, but cuts final loss by ~0.02, and keeps worst-case backward gain ~1.6 (vs ~3000 without the constraint), with consistent benchmark wins across the board
6
1
15
6,774
明明是约束优化和资源调度问题,非要说成是几何拓扑;明明是有棱有角顶点不可导的可行域,非要讲成流型(虽然确实是流形没错,但不符合直觉),Storytelling果然是AI核心竞争力 ​ ​叠甲:不是说文章不好啊,把运筹/系统工程的工具箱拿来怼AI大概率是正路!问题实际,解法精妙。很能体现团队成员多样性
5
2
40
31,044
20 Nov 2025
Pure text prompt, no image init. The race is over. (?) #nanobanana #Gemini3
2
473
Dango233 retweeted
Find research faster with II-Commons Search arXiv PubMed (web app Agent demo, API/MCP/A2A) Beta now live for II-Accounts
7
14
54
502,053
Dango233 retweeted
While building II’s open stack, we put together a Gemini-CLI → MCP OpenAI a bridge to access the tools model in our tests. This lets any MCP-savvy agent tap Gemini its tools via your gemini-cli instance!
6
17
74
32,464