Nithum

Nithum

Photos and videos

Tweets

24 Jul 2024

Check out our most recent Explorable "Can Large Language Models Explain Their Internal Mechanisms?" pair.withgoogle.com/explorab…

Can Large Language Models Explain Their Internal Mechanisms?

An interactive introduction to Patchscopes, an inspection framework for explaining the hidden representations of large language models, with large language models.

pair.withgoogle.com

This tweet is unavailable

183

Google AI

Nithum retweeted

Google AI

@GoogleAI

24 Jul 2024

Can large language models (LLMs) explain their internal mechanisms? Check out the latest AI Explorable on Patchscopes, an inspection framework that uses LLMs to explain the hidden representations of LLMs. Learn more → goo.gle/patchscopes

ALT A visual walkthrough of the patching process for explaining hidden representations of LLMs.

146

563

49,604

Google AI

Nithum retweeted

Google AI

@GoogleAI

8 Aug 2023

While large language models appear to have a rich understanding of the world, how do we know they’re not simply regurgitating from training data? Check out the latest AI Explorable on a phenomenon called grokking to learn more about how models learn. → goo.gle/45ohnQh

ALT An example of grokking: memorization followed by sudden generalization. The model quickly fits the training data with 100% accuracy, but doesn't do better than random guessing on test data, but after more training, accuracy on the test data improves — the model generalizes.

464

1,772

318,833

Adam Pearce

Nithum retweeted

Adam Pearce @adamrpearce

7 Aug 2023

Do Machine Learning Models Memorize or Generalize? pair.withgoogle.com/explorab… An interactive introduction to grokking and mechanistic interpretability w/ @ghandeharioun, @nadamused_, @Nithum, @wattenberg and @iislucas

244

1,140

256,642

iislucas (Lucas Dixon)

Nithum retweeted

iislucas (Lucas Dixon)@iislucas

5 Apr 2023

Some of my thoughts on generative AI... and a reboot of the PAIR blog... medium.com/people-ai-researc… #responsibleai #hci #machinelearning #GenerativeAI

Meet the new co-leads of PAIR: Lucas Dixon and Michael Terry

Back in 2017, we announced the launch of PAIR by stating, “We believe AI can go much further — and be more useful to all of us — if we…

medium.com

1,453

Adam Pearce

Nithum retweeted

Adam Pearce @adamrpearce

27 Mar 2023

Confidently Incorrect Models to Humble Ensembles by @Nithum, @balajiln and Jasper Snoek pair.withgoogle.com/explorab…

3,922

Nithum

Nithum @Nithum

27 Mar 2023

ML models sometimes make confidently incorrect predictions when they encounter out of distribution data. Ensembles of models can make better predictions by averaging away mistakes. pair.withgoogle.com/explorab…

From Confidently Incorrect Models to Humble Ensembles

ML models sometimes make confidently incorrect predictions when they encounter out of distribution data. Ensembles of models can make better predictions by averaging away mistakes.

pair.withgoogle.com

Andy Coenen

Nithum retweeted

Andy Coenen

@_coenen

8 Dec 2022

In partnership with @GoogleMagenta, we invited 13 professional writers to use Wordcraft, our experimental LaMDA-powered AI writing tool. We've published all of the stories written with the tool, along with a discussion on the future of AI and creativity. g.co/research/wordcraft

Wordcraft Writers Workshop

The Wordcraft Writers Workshop is a collaboration between Google's PAIR and Magenta teams, and 13 professional writers. Together we explore the limits of co-writing with AI.

wordcraft-writers-workshop.appspot.com

Adam Pearce

Nithum retweeted

Adam Pearce @adamrpearce

9 Nov 2022

Most machine learning models are trained by collecting vast amounts of data on a central server. @nicki_mitch and I looked at how federated learning makes it possible to train models without any user's raw data leaving their device. pair.withgoogle.com/explorab…

TensorFlow

Nithum retweeted

TensorFlow

@TensorFlow

27 Jun 2022

🤔 We've come a long way with #NLP, but what have language models actually learned? Watch Senior Software Engineer at Google PAIR, Nithum Thain, discuss AI language model learnings → goo.gle/3HVtolv

Nithum

Nithum @Nithum

22 Mar 2022

Check out our new explorable on machine learning calibration: Machine learning models express their uncertainty as model scores, but through calibration we can transform these scores into probabilities for more effective decision making. pair.withgoogle.com/explorab…

Are Model Predictions Probabilities?

Machine learning models express their uncertainty as model scores, but through calibration we can transform these scores into probabilities for more effective decision making.

pair.withgoogle.com

118

Martin Görner

Nithum retweeted

Martin Görner @martin_gorner

28 Sep 2018

Beautiful "RNN with attention" tutorial from one of the authors of Google's troll-fighting AI @Nithum. github.com/conversationai/co…. We presented this toxic comment detection model together in the "Tensorflow and modern RNNs without a PhD" talk. Excuse our French 🤬!

Martin Görner

Nithum retweeted

Martin Görner @martin_gorner

5 Nov 2017

Replying to @Devoxx

My co-speaker for this session will be @Nithum from Google @JigsawTeam. He fights bad behavior online with neural networks.

Martin Görner

Nithum retweeted

Martin Görner @martin_gorner

4 Nov 2017

"Tensorflow and deep learning without a PhD" continues @Devoxx on Monday 9:30. Deep learning novices welcome, fresh neurons required :-)

103

Sean Mullin

Nithum retweeted

Sean Mullin @MullinSean

24 Feb 2017

Awesome work by @adamscj and @nithum! wired.com/2017/02/googles-tr… via @WIRED

Now Anyone Can Deploy Google’s Troll-Fighting AI

Google subsidiary Jigsaw is now offering developers access to an API for its AI-based detector for abusive comments.

wired.com

Jigsaw

Nithum retweeted

Jigsaw @Jigsaw

23 Feb 2017

Introducing Perspective, using machine learning to improve discussions online. bit.ly/2lIZEjS

149

Jigsaw

Nithum retweeted

Jigsaw @Jigsaw

7 Feb 2017

We collected and labeled over 1 million @Wikimedia page edits to determine where personal attacks were made. bit.ly/2lgWfcB

Jigsaw

Nithum retweeted

Jigsaw @Jigsaw

19 Oct 2016

How can we keep extremists from using technology to cause harm? @POTUS and @WIRED asked our very own @yasmind. bit.ly/2eCOxa8

Amy Zhang

Nithum retweeted

Amy Zhang @amyxzh

20 Jul 2016

Wikipedia building n-gram models to detect personal attacks and harassment: meta.m.wikimedia.org/wiki/Re… x.com/wikiresearch/status/75…

Research:Detox - Meta-Wiki

meta.wikimedia.org

WikiResearch @WikiResearch

20 Jul 2016

wikidetox.appspot.com: a demo of algorithmic classification of personal attacks on Wikipedia talk pages

WikiResearch

Nithum retweeted

WikiResearch @WikiResearch

20 Jul 2016

Detecting personal attacks on Wikipedia: some context from the 2015 survey meta.wikimedia.org/wiki/Rese…