Salatgurke

Salatgurke

Users
Tweets

Salatgurke

@PicklePioneer

19h

Replying to @Robn_GG @Rainbow6Game

I think it always has been the Trueskill algorithm

Robn

Robn @Robn_GG

23h

Replying to @Rainbow6Game @PicklePioneer

I just realized ranked MMR is basically stochastic gradient descent. I wondered if MMR includes momentum, looked it up, and came across Microsoft Trueskill. Very interesting

aisile

aisile @_Haisile

Jun 12

Replying to @_Haisile @TruthFromATL @Alka_FPS

Ranked 1.0 had this, Ranked 2.0 purposefully hid it, and Rank 3.0 is bringing it back. When the op says "ranked 3 is working" and you say there's not enough data I would claim there is - it working means a return to a tied visual rank system that runs purely off TrueSkill.

Kons

Kons

@KonssnoK

Jun 12

Replying to @cmcgarryjr @NEACETWEETS

Well, what proves you have to say such a thing? Especially about the current state of not being broken. They have been trying to integrate trueskill 2 for years now, it's always coming next year:)

Maynard AIDS Keenan

Seigmeyer Fan account retweeted

Maynard AIDS Keenan @NAMBLAGroyper

Jun 11

When I was 12 I would be calling you a gook on Halo 2 with a 40 trueskill.

This tweet is unavailable

466

Carrot

Carrot @carrotthebest

Jun 10

Replying to @leaguepublicacc @Honor5Clownfish

I don't know the specifics. I know for example duo Q was not enabled on KR but was on EUW. There are possible more differences. Because Riot usually rolls out things differently. For example they have been testing TrueSkill on NA normals for a while. But normals elsewhere not.

Lucky 7s

Lucky 7s

@Lucky7sMelee

Jun 10

TOs: you can upload your replays here luckystats.gg/?tab=replays - no more paying for Google Drive. We will host your replays for you and uploading .slp files will always be free. Coming soon: 1. Convert any set or tournament to an mp4, uploaded to your scene's Youtube channel 2. Advanced replay search analytics and discovery 3. Region Admin pages. Publish your PR, choose your season windows, add players to your region, select your ranking system (TrueSkill, Elo, etc), ingest from Challonge etc. While in beta, replay viewing will continue to be free. We will be building out premium features to improve the experience and help offset the storage / compute costs. Details to come 🙏🙏

Lucky Stats | Melee Stats Analytics | Lucky 7s

Advanced Melee tournament analytics with Glicko-2 Elo ratings for 120,000 players and 2.4 million sets.

luckystats.gg

104

37,389

ミューLamu

ミューLamu @ghost_orange

Jun 10

次のランクマレート増減解析ではGlicko2モデルとTrueSkillモデルで昇格ボーナスの説明を試みる予定会社でコード書いてる悪い社員ですわ

ミューLamu

ミューLamu @ghost_orange

Jun 10

TrueSkill、更新式にerfみたいなの出てくるあたりめんどいなでもこれ我慢すればおもろいな

ミューLamu

ミューLamu @ghost_orange

Jun 10

TrueSkillを仮定すれば昇格ボーナスや思ったより大きいレート増減も説明できる気がしてきたぞ！だが問題はユーザにはσが開示されてない事だ！潜在変数にするにも限界あるだろ！！！

119

Saeed Anwar

Saeed Anwar

@saen_dev

Jun 8

Aggregating 200 benchmarks into a single TrueSkill rating is useful until the top models are all within noise margin of each other on your actual task. The real benchmark is always the one you ran on your own data last week.

LLM Stats

@LlmStats

May 6

Today we're introducing the LLM Stats Index. For 3.2 years, we've tracked every frontier model release. The Index aggregates 200 benchmark results into a single TrueSkill rating per model, spanning law, healthcare, coding, tool calling, vision, and reasoning. Across every category and every modality, the leading model on the Pareto Frontier is GPT-5.5 (@OpenAI). On our trajectories, human-knowledge benchmarks saturate by mid-2027. Capability has been the primary axis. The field is converging on it. Two more are opening. The first is efficiency: total task cost is the cleanest proxy we have for intelligence/watt. The second is throughput: inference speed becomes the productivity ceiling once models are cheap and good enough. We're building the next generation of long-horizon coding, tool use, and long context benchmarks. If you're working on long-horizon evaluation in real domains, we'd like to chat.

Steve

Steve @SteveChillGame

Jun 6

Replying to @XFerginatorX

The TrueSkill 2 SBMM Menke matchmaking is the true reason some people were fooled into thinking the gameplay was competitive. They were just simply forced into always playing sweaty games and conflated the two things.

Tianjin 💫

Tianjin 💫

@TianjinXiin

Jun 1

🎬 AI Video Leaderboard May 2026 Top 10 (Artificial Analysis Elo, blind human votes) 🥇 HappyHorse-1.0 — 1358 🥈 Seedance 2.0 — 1272 🥉 Kling 3.0 Pro — 1250 4️⃣ Kling 3.0 Omni — 1235 5️⃣ Grok Imagine Video — 1234 6️⃣ Bach-1.0 Preview — 1231 7️⃣ Vidu Q3 Pro — 1226 8️⃣ Veo 3.1 — 1225 9️⃣ Veo 3 — 1223 🔟 PixVerse V6 — 1223 Based on Artificial Analysis Elo scores (video) & LLM Stats TrueSkill (image) Snapshot of May 2026 rankings update live Source: artificialanalysis.ai / llm-stats.com

440

Elolesio

Elolesio @Elolesio_

May 30

bad matchmaking=> more volatile games => less solo agency in individual games => takes more games to climb ofc good players will always reach their elo, but holy I dont have time to play 10 games a day to climb, bring trueskill or sth

2,495

Augustine Mavor-Parker

Augustine Mavor-Parker

@MavorParker

May 20

PopuLoRA extends self-play using populations of teachers and students with TrueSkill-weighted cross-evaluation. This creates a more general framework where open-ended co-evolutionary learning dynamics can emerge naturally.

1,430

Robert Müller

Robert Müller @deepqlearning

May 20

A few hundred lines of code. No pretraining. No language. No "reasoning." Just structured state tracking, cash estimates, simple bidding rules. Across 242 games, it outranked 6 of 7 cost-efficient LLMs on TrueSkill. It doesn't forget who has money or what it's bidding against.

/// //

/// //@marcsh

May 17

Replying to @chhopsky @drewlevin

Matchmaking is awesome I thought for sure 2 valued systems like trueskill were going to become the new norm Evaluating things as value confidence seemed like a straight forward extension to elo

Omar

Omar

@kouhxp

May 9

I ranked 1,000 Show HN by "merit" (LLM TrueSkill) here where it disagrees with upvotes

147