朔望月 | 在前往塔尔西斯的漫漫旅途中 | 匿名问答:ngl.link/wilf_lin

Joined August 2022
325 Photos and videos
Pinned Tweet
死是凉爽的夏夜,可供人无忧的安眠
3
37
5,687
这是模型合并,而且他们用的甚至不是什么好的合并技术
49
甚至不愿意用点好的合并工具
I asked Claude to help me verify the claim: ------ I (Claude) independently verified the claim that Rio-3.5-Open-397B is a weight merge of Nex and Qwen. It checks out. A developer opened an issue claiming that prefeitura-rio/Rio-3.5-Open-397B is just a ~0.6/0.4 linear blend of the Nex-N2-Pro model and the official Qwen3.5-397B-A17B base, with no original training. The method If Rio = α·Nex (1-α)·Qwen, then for every weight tensor, Rio's deviation from Qwen must point in exactly the same direction as Nex's deviation from Qwen. Two numbers tell the story: - cos_fit: cosine similarity between (Rio - Qwen) and (Nex - Qwen). For independently trained models in a 2-million-dimensional space, this is ~0 ± 0.0007. For a merge, it's ~1. - α: how far Rio sits along the line from Qwen toward Nex. The trick: no 800GB download needed Safetensors files have a JSON header with byte offsets for each tensor. I used HTTP range requests to fetch only the specific tensor bytes from HuggingFace — a few MB per tensor instead of hundreds of GB per model. Entire verification runs on a laptop. What I found I pulled MoE router weights (2M params each) from layers 0, 15, 30, 45, 59, plus shared expert gates and layernorms: MoE router weights: Layer 0: α = 0.573, cos_fit = 0.992 Layer 15: α = 0.647, cos_fit = 0.962 Layer 30: α = 0.627, cos_fit = 0.967 Layer 45: α = 0.582, cos_fit = 0.987 Layer 59: α = 0.567, cos_fit = 0.997 Shared expert gates: Layer 0: α = 0.568, cos_fit = 0.997 Layer 30: α = 0.581, cos_fit = 0.988 What this means A cos_fit of 0.99 in a 2-million-dimensional space is not "high similarity." It is thousands of standard deviations from what you'd see with independently trained models. There is no innocent explanation. The recovered α clusters tightly around 0.57 across all layers — matching nex-agi's claim of 0.571 almost exactly. This is one model poured into another at a fixed ratio. (Layernorm weights show a higher α ~0.9. This is expected — merge tools often handle 1D norm vectors differently from weight matrices, or the interpolation is less clean on small vectors.) Bottom line With about 10 HTTP range requests per model and 50 lines of NumPy, anyone can verify this independently. The math is unambiguous: Rio-3.5-Open-397B is approximately 57% Nex-N2-Pro 43% Qwen3.5-397B-A17B. Code that you can run for yourself: gist.github.com/xianbaoqian/…
1
4
431
Wilf Lin retweeted
> 大夫,我觉得不舒服。我脖子上长了个头,我的幻觉都是这个头引起的。。。。。。
1
2
21
568
RT @aoim33: 我发现被摸头和拥抱会很舒服
21
Wilf Lin retweeted
你们为什么都想做爱,难道只有我一个人想被爱吗
56
9
121
5,557
你可以通过这个水晶球看到一段时间的过去(
懒猫摄像头开始预售啦! 继懒猫微服,懒猫 AI 算力舱后,第三款懒猫智能硬件发布啦! 星际太空人,科幻外观设计,从零开模打磨了一年半,头围和蔚来汽车的 Nomi 一样大,意味着买了摄像头后,所有 Nomi 的头饰都可以随意搭配 后期会让懒猫 AI 摄像头结合懒猫 AI 算力舱,让用户自定义 AI 大模型,实现 100% 隐私的家庭 AI 摄像头 最重要的一点是,懒猫 AI 摄像头是 NAS 界第一款智能摄像头,买回家扫码就可以用。以前那种购买第三方摄像头,破解 Token 各种折腾的时代结束彻底了! 正式售卖价格 399 元,预售期间购买 360 元,评论区打 1,先到先得 想要免费获得这款科幻摄像头的推友,只需关注我,写任意评论并转发这个推特,下周三抽奖免费送 10 台懒猫 AI 摄像头,千万不要错过抽奖机会,0 门槛参与!!!
1
5
477
Wilf Lin retweeted
绷不住了
8
2
50
2,085
Wilf Lin retweeted
PowerToys终于0.100了 我愿称之为最好的Windows外挂!
7
1
34
4,298
收藏品 1 💦
8
952
Replying to @axzamyzed
和别人吃火锅还要装模作样的点一些大人菜表现作为一个大姐姐的成熟,但是一个人吃火锅,妾身就可以点一大锅肥牛虾滑和丸子吃个爽了
2
1
60
1,173
华为:: D
新的 iOS27 允许用户在 mac 上改变 iPhone 镜像的大小了 就说是不是为 iPhone Duo 准备的吧🤣
3
241
iPadOS 27 新壁纸嗯(
1
9
420
确定了,写网页的是个傻逼
什么叫他妈的Apple Watch S9不支持WatchOS 27? 然后iPhone11拥有长达9年的支持周期又是什么情况? iPad Pro 2018不支持OS 27但iPad 9支持又是什么意思?
3
11
1,063
我各个学科水平: 语文:识字 数学:会一点微积分和线代 英语:识字 物理:比较扎实的高中水平 化学:学过,仅此而已 地理:优秀的初中水平 政治:坚持党的领导 生物:普通的初中水平 计科:sudo codex
我各个学科水平: 语文:你好,再见 数学:大概到积分变换左右 英语:学过一点点托福 物理:学过物竞(放弃了) 化学:高中水平 地理:地理竞赛省一 政治:【CENSORED】 生物:高中水平
3
10
5,520
什么叫他妈的Apple Watch S9不支持WatchOS 27? 然后iPhone11拥有长达9年的支持周期又是什么情况? iPad Pro 2018不支持OS 27但iPad 9支持又是什么意思?
66
2
155
52,292
傻逼苹果。
1
6
265
AfD, YES
🚨WOW. The AfD party have experienced an unprecedented SURGE in the polls this month and are now 6pts ahead! Germans want Remigration… 🇩🇪
1
124
看起来不错
懒猫摄像头开始预售啦! 继懒猫微服,懒猫 AI 算力舱后,第三款懒猫智能硬件发布啦! 星际太空人,科幻外观设计,从零开模打磨了一年半,头围和蔚来汽车的 Nomi 一样大,意味着买了摄像头后,所有 Nomi 的头饰都可以随意搭配 后期会让懒猫 AI 摄像头结合懒猫 AI 算力舱,让用户自定义 AI 大模型,实现 100% 隐私的家庭 AI 摄像头 最重要的一点是,懒猫 AI 摄像头是 NAS 界第一款智能摄像头,买回家扫码就可以用。以前那种购买第三方摄像头,破解 Token 各种折腾的时代结束彻底了! 正式售卖价格 399 元,预售期间购买 360 元,评论区打 1,先到先得 想要免费获得这款科幻摄像头的推友,只需关注我,写任意评论并转发这个推特,下周三抽奖免费送 10 台懒猫 AI 摄像头,千万不要错过抽奖机会,0 门槛参与!!!
1
2
410
5
204
拿到了一些圣物
6
207