Long document understanding, Multilingual Evals and efficient models mainly, but other #NLProc applications in free time | vim enthusiast

Joined April 2011
646 Photos and videos
.@DuckDuckGo I love you and have been using your search engine since last 6-7 years, and your browser for 2-3 years. But please stop showing me your ads on Reddit.
10
Rishu Kumar retweeted
My running friends are discussing whether performance-enhancing drugs are fine for non-professional sports. My academic friends are discussing whether AI use is an acceptable aid for scholarship. --- These are related. Let's undermine the heart of all of our activities for ease.
8
11
79
6,566
Auto translation is based based based
hoje aprendi um novo insulto no trabalho "trabalhar com vc é como trabalhar sozinho, porém mais difícil"
50
Rishu Kumar retweeted
The models are becoming more German.
ai writing slop has moved on from em dashes to inventing hyphenated adjectives that no one has ever said before
3
4
41
2,889
I don't get the hate for data centers tbh. If you are willing to hate on them, are you willing to give up your social media, YouTube/Netflix/whatever, which requires a data center to stream it to you conveniently?
1
42
I am all for halting the country (traffic, internet, flights, industries, whatever) during JEE and NEET exams, as is done in China. But this take is just stupid. If the public is inconvenienced for an exam, why aren't babus publicly executed when they are verifiably corrupt?
Oh hello, @durov Nobody is using Telegram in India for messaging. Telegram is mostly used by scammers in India. Most financial fraud (Billions of dollars) in India happens through Telegram The Indian government should have banned Telegram years ago. It is long overdue. I’ve been noticing the same pattern for years. Almost every fraudster immediately moves to Telegram. it’s harder to trace, easier to operate. Calling this an internet freedom issue misses the point completely. Telegram became one of the preferred platforms for financial fraud, scam networks, betting groups, piracy, and other illegal activities in India.
2
6
140
I miss those days where I could just pay ₹60/$1 and use WhatsApp without ads or random shit for a year. How long before Meta launches a Meta One subscription to use all their services without feeling like a pover?
1
51
Rishu Kumar retweeted
There's an incredibly powerful technique that can do this. It's called training on the test set.
WHAT THE HELL is happening in AI? A 3B parameter model just put up coding benchmark scores in the same league as Claude Opus 4.5. 3 BILLION. The weights are on Hugging Face, anyone can test it. I genuinely don't know if this is a breakthrough or if the benchmarks are broken.
47
77
3,682
146,302
Rishu Kumar retweeted
pivoting to using pangram to make sure i only read slop
1
9
70
Rishu Kumar retweeted
21
311
4,704
246,493
Rishu Kumar retweeted
Party is over, time to regularize ColBERT models to fix efficient ANN MUVERA and SMVE promised to simplify multi-vector retrieval infrastructure but broke on modern ColBERT models We found a fix, and it does the exact opposite of what we expected
9
19
99
16,122
Rishu Kumar retweeted
It's actually le gros chaton
431
814
9,302
1,773,883
Rishu Kumar retweeted
Jun 15
The French Goverment needs to STOP Le Chaton Fat before it's too late. This is the FAT takeoff we've been warned against for years!
64
157
2,391
314,440
Rishu Kumar retweeted
Writing premised on "I'm an expert at Y and am sharing my experience and opinions" should probably be supported by actual experience and opinions, imo. If I cared what Claude had to say about being a good researcher I can prompt it as well as he can?
1
1
8
383
Rishu Kumar retweeted
1/ Meaningful AI evaluation doesn’t stop at model performance, especially if we want to understand how people actually use and are affected by AI systems. We introduce PuLSE: Public and Longitudinal Signals for Evaluation — a general approach to monitoring for societally-impactful trends in real time from public data. Joint work with awesome people like @jessicadai_ , Sean Garcia, @beenwrekt, and @2plus2make5.
we analyzed >100k posts from r/ChatGPT over 3 years on one hand, we saw ChatGPT quickly become normalized as an everyday consumer product, which is pretty cool on the other hand…
1
4
29
4,715
Rishu Kumar retweeted
Oh lawd, he comin'
he comes
1
1
47
1,726
. @GeminiApp bros, your summarization is not summarizing.
3
452
Rishu Kumar retweeted
Two new blogs available on my website. As someone who’s overthinking all the time, journaling what I have in mind is surprisingly satisfying 😊🥰
1
2
106
Ate junk yesterday and today my performance almost tanked in gym, especially while running. Low-key liking that my body is almost forcing me to eat clean.
3
49