Can large language models (LLMs) explain their internal mechanisms? Check out the latest AI Explorable on Patchscopes, an inspection framework that uses LLMs to explain the hidden representations of LLMs. Learn more → goo.gle/patchscopes
ALT A visual walkthrough of the patching process for explaining hidden representations of LLMs.
While large language models appear to have a rich understanding of the world, how do we know they’re not simply regurgitating from training data? Check out the latest AI Explorable on a phenomenon called grokking to learn more about how models learn. → goo.gle/45ohnQh
ALT An example of grokking: memorization followed by sudden generalization. The model quickly fits the training data with 100% accuracy, but doesn't do better than random guessing on test data, but after more training, accuracy on the test data improves — the model generalizes.
ML models sometimes make confidently incorrect predictions when they encounter out of distribution data. Ensembles of models can make better predictions by averaging away mistakes.
pair.withgoogle.com/explorab…
In partnership with @GoogleMagenta, we invited 13 professional writers to use Wordcraft, our experimental LaMDA-powered AI writing tool. We've published all of the stories written with the tool, along with a discussion on the future of AI and creativity.
g.co/research/wordcraft
Most machine learning models are trained by collecting vast amounts of data on a central server. @nicki_mitch and I looked at how federated learning makes it possible to train models without any user's raw data leaving their device.
pair.withgoogle.com/explorab…
🤔 We've come a long way with #NLP, but what have language models actually learned?
Watch Senior Software Engineer at Google PAIR, Nithum Thain, discuss AI language model learnings → goo.gle/3HVtolv
Check out our new explorable on machine learning calibration:
Machine learning models express their uncertainty as model scores, but through calibration we can transform these scores into probabilities for more effective decision making.
pair.withgoogle.com/explorab…
Beautiful "RNN with attention" tutorial from one of the authors of Google's troll-fighting AI @Nithum. github.com/conversationai/co…. We presented this toxic comment detection model together in the "Tensorflow and modern RNNs without a PhD" talk. Excuse our French 🤬!