Inseq (@InseqLib)

13 Dec 2022

Hello world! 🐛

13 Dec 2022

After a year of restless development, I'm finally happy to announce Inseq, a new tool to democratize post-hoc interpretability of sequence generation models 🐛 github.com/inseq-team/inseq #nlproc #xai Some highlights 👇 1/

Gabriele Sarti

Inseq retweeted

24 Sep 2024

Model Internals-based RAG Evaluation (MIRAGE) 🌴 is accepted to #EMNLP2024 Main! ➡️ To celebrate, here's our new MIRAGE demo combining @InseqLib and Transformers-specific LRP: huggingface.co/spaces/gsarti…. Reach out if you want to catch up in Miami! 🤗🏖️

MIRAGE - a Hugging Face Space by gsarti

Model Internals to generate RAG citations

huggingface.co

Jirui Qi @Jirui_Qi

21 Jun 2024

[1/8] Struggling with verifying the trustworthiness of RAG outputs? Check our latest work where we utilize *model internals* as a powerful and faithful tool for attributing answers to retrieved docs! (w/ @gsarti_ @AriannaBisazza @raquel_dmg) 📄: arxiv.org/abs/2406.13663 #NLProc

3,281

Gabriele Sarti

Inseq retweeted

14 Aug 2024

Very hyped for the new beautiful viz that just landed in the @InseqLib main branch! 🔥 This will empower users to explore attribution tensors more flexibly and intuitively. h/t to @_ddjohnson for his awesome work on the treescope toolkit powering this release!

14 Aug 2024

Thanks to the new treescope integration, @InseqLib now supports interactive visualizations for multidimensional attributions (show_granular), token highlights (show_tokens) and improved viz for attribute_context CLI! 🚀 Install main, will appear in v0.7 x.com/_ddjohnson/status/1821…

Granular visualization of an attention weight matrix for Gemma 2B on a summarization task. The slider on the bottom allows to visualize scores for individual heads without aggregation.

ALT Granular visualization of an attention weight matrix for Gemma 2B on a summarization task. The slider on the bottom allows to visualize scores for individual heads without aggregation.

Generated text with probability highlights (in green). For every generated token, source and target attribution scores can be visualized by opening the collapsed section (scores for token "colore" shown in figure)

ALT Generated text with probability highlights (in green). For every generated token, source and target attribution scores can be visualized by opening the collapsed section (scores for token "colore" shown in figure)

Output visualization of the attribute_context command for a chain-of-thought reasoning task with output-side context. The token "pairs" in the generated sentence "20 pairs of legs." is found to be context-sensitive and its prediction over the non-contextual alternative "horses" is motivated mainly by the presence of "pairs" in the output context.

ALT Output visualization of the attribute_context command for a chain-of-thought reasoning task with output-side context. The token "pairs" in the generated sentence "20 pairs of legs." is found to be context-sensitive and its prediction over the non-contextual alternative "horses" is motivated mainly by the presence of "pairs" in the output context.

959

Inseq

Daniel Johnson @_ddjohnson

14 Aug 2024

ALT Granular visualization of an attention weight matrix for Gemma 2B on a summarization task. The slider on the bottom allows to visualize scores for individual heads without aggregation.

7 Aug 2024

By popular demand, the Treescope pretty-printer from the Penzai neural net library can now be installed separately, and supports both JAX and PyTorch! And that's not all: Penzai itself now has less boilerplate and includes more pretrained Transformer models!

ALT A Google Colab notebook that loads the pretranied Pythia-1B model from HuggingFace and then visualizes it with Treescope.

A Google Colab notebook that converts the pretranied Pythia-1B model from HuggingFace to a Penzai model and then visualizes it with Treescope.

ALT A Google Colab notebook that converts the pretranied Pythia-1B model from HuggingFace to a Penzai model and then visualizes it with Treescope.

1,399

CIS, LMU Munich

Inseq retweeted

CIS, LMU Munich @CisLmu

15 Jul 2024

🎓 We're thrilled to host Gabriele Sarti (@gsarti_) in our PhD seminar series tomorrow, July 16th, from 12:00-13:00 in Oe67 BU 101! Join us for his talk on interpreting context usage in generative language models, featuring the Inseq toolkit and PECoRe framework. 🕛Don't miss it!

4,828

Inseq

22 Jun 2024

The 🐑 PECoRe / 🌴 MIRAGE demo on @huggingface Spaces is powered by our new attribute-context CLI command released in v0.6, and allows to export the code to reproduce your results locally with 🐛 Inseq. Check it out ➡️ hf.co/spaces/gsarti/pecore

Eliana Pastor @eliana__pastor

22 Jun 2024

⚠️ Citations from prompting or NLI seem plausible, but may not faithfully reflect LLM reasoning. 🏝️ MIRAGE detects context dependence in generations via model internals, producing granular and faithful RAG citations. 🚀 Demo: huggingface.co/spaces/gsarti… Fun collab w/ @Jirui_Qi, @AriannaBisazza & @raquel_dmg! Check it out ⬇️

1,493

Eliana Pastor

Inseq retweeted

20 May 2024

Today, we had the first seminar of our #XAI course! @gsarti_ presented the @InseqLib to interpret LMs and the PECORE framework to identify & attribute context dependence in LMs! 🚀🌟 Thank you, it was so interesting! 🤗 Great start to our series! gsarti.com/talk/polito-inseq…

2,211

Javier Ferrando

Inseq retweeted

Javier Ferrando @javifer_96

3 May 2024

[1/4] Introducing “A Primer on the Inner Workings of Transformer-based Language Models”, a comprehensive survey on interpretability methods and the findings into the functioning of language models they have led to. ArXiv: arxiv.org/pdf/2405.00208

127

558

87,744

Inseq

2 May 2024

Today @InseqLib hit 300 ⭐️ on Github! A huge thank you to all our awesome users ❤️ Onwards to the next 300! 🤺

[points to Pyvene]
Inseq: You there, what is your profession?
Pyvene: I do trainable inference-time interventions... sir.
Inseq: [points to another library] And you, TransformerLens, what is your profession?
TransformerLens: I provide a unified hookable architecture to support mechinterp analyses on Transformer models, sir.
Inseq: I see...
[turns to a third soldier]
Inseq: You?
SAELens: I enable the training of sparse autoencoders to mitigate superposition in latent features.
Inseq: [turns back shouting] INSEQ USERS! What is OUR profession?
Inseq Users: HA-OOH! HA-OOH! HA-OOH!
Inseq Users: We do feature attribution of generative LMs! 😇

ALT [points to Pyvene] Inseq: You there, what is your profession? Pyvene: I do trainable inference-time interventions... sir. Inseq: [points to another library] And you, TransformerLens, what is your profession? TransformerLens: I provide a unified hookable architecture to support mechinterp analyses on Transformer models, sir. Inseq: I see... [turns to a third soldier] Inseq: You? SAELens: I enable the training of sparse autoencoders to mitigate superposition in latent features. Inseq: [turns back shouting] INSEQ USERS! What is OUR profession? Inseq Users: HA-OOH! HA-OOH! HA-OOH! Inseq Users: We do feature attribution of generative LMs! 😇

375

Inseq

13 Apr 2024

@InseqLib v0.6 is out now on PyPI! 🔥 New CLI command for context attribution (@gsarti_), new perturbation-based methods by @hmohebbi75 & @casszzx and optimizations incl. multi-gpu support! ⚡️ Huge shoutout to our contributors! ❤️ Release notes ⬇️ github.com/inseq-team/inseq/…

Release v0.6.0: Context Attribution CLI, New Attribution Methods, Performance Improvements and more...

🔙 Context Attribution CLI (#237) The inseq attribute-context CLI command was added to support the PECoRe framework for analyzing context usage in generative language models. The command is highly...

5,683

Inseq

18 Mar 2024

Want to learn how to detect and attribute context usage in LMs using @InseqLib? This @Gradio demo using our CLI can teach you how! 🐛

18 Mar 2024

The official 🐑 PECoRe 🐑 demo to detect and attribute context dependence in LM generations is now available on @huggingface Spaces! 🚀 Includes code examples, a usage guide, useful presets for various dec-only & enc-dec models, and more! Check it out ⬇️ huggingface.co/spaces/gsarti…

2,217

Gabriele Sarti

Inseq retweeted

11 Mar 2024

🐑 PECoRe repository is now public (github.com/gsarti/pecore) and all model/datasets are available on @huggingface (huggingface.co/collections/g…)! 🐑Interested in using PECoRe on your models? Have a look at the @InseqLib implementation (`inseq attribute-context`)!

GitHub - gsarti/pecore: Materials for "Quantifying the Plausibility of Context Reliance in Neural...

Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑 - gsarti/pecore

4 Oct 2023

[1/8] Our new work (w/ @AriannaBisazza @gchrupala @MalvinaNissim) is finally out! 🎉 We introduce PECoRe, an interpretability framework using model internals to identify & attribute context dependence in language models. 📄Paper: arxiv.org/abs/2310.01188 #NLProc #neuralempty

2,794

Inseq

7 Mar 2024

Did someone say Mamba support? 👀 (Actually gradient-based methods are not working out-of-the-box because of some in-place variable overwriting, but should be fixable!)

Arthur Zucker

@art_zucker

7 Mar 2024

`mamba` is now available in transformers. PEFT finetuning example: gist.github.com/ArthurZucker… Thanks @_albertgu and @tri_dao for this brilliant model! 🚀 and the amazing `mamba-ssm` kernels powering this!

645

Inseq

7 Mar 2024

Tracker for Mamba gradient-based methods support: github.com/huggingface/trans…

Cannot propagate gradients in Mamba · Issue #29514 · huggingface/transformers

System Info transformers version: 4.39.0.dev0 Platform: macOS-14.2.1-arm64-arm-64bit Python version: 3.11.7 Huggingface_hub version: 0.21.4 Safetensors version: 0.4.2 Accelerate version: not instal...

Gabriele Sarti

Inseq retweeted

1 Mar 2024

Excited to present my recent work on @InseqLib and the #ICLR2024 PECoRe interpretability framework for the @SheffieldNLP group this afternoon! Many thanks @casszzx for inviting me! 🤗 🐛 Inseq: github.com/inseq-team/inseq 🐑🐑 PECoRe: openreview.net/forum?id=XTHf…

GitHub - inseq-team/inseq: Interpretability for sequence generation models 🐛 🔍

Interpretability for sequence generation models 🐛 🔍 - inseq-team/inseq

Hosein Mohebbi @hmohebbi75

3,743

Hosein Mohebbi

Inseq retweeted

28 Feb 2024

Wow!! This is great news🥳 Thanks @InseqLib! I’m excited to try it out!

28 Feb 2024

Value Zeroing, a faithful approach for analyzing context mixing in Transformers, is now available on @InseqLib main branch for all @huggingface text generation models! 🔀 🔍Paper introducing VZ: aclanthology.org/2023.eacl-m… 🐛VZ in Inseq: tinyurl.com/inseq-vz

633

Inseq

28 Feb 2024

3,899

Inseq

28 Feb 2024

Authors: @hmohebbi75 @wzuidema @gchrupala @afraalishahi

188

Inseq

16 Feb 2024

After two years with Poetry, Inseq just moved to @astral_sh's new blazing fast package manager uv! Our CI installation step is now ~80% faster! 🔥 Congrats to @charliermarsh and the team on the release, and godspeed for the cargo-like experience you are planning to build!

3,252

Inseq

9 Dec 2023

Mousavi et al. attribute the generation of dialogue models, finding increased influence for more refined versions of dialogue history aclanthology.org/2023.nlp4co… (@mahedmousavi S. Caldarella G. Riccardi @sislab7) 3/

Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?

Seyed Mahed Mousavi, Simone Caldarella, Giuseppe Riccardi. Proceedings of the 5th Workshop on NLP for Conversational AI (NLP4ConvAI 2023). 2023.

aclanthology.org

216

more replies

Inseq

9 Dec 2023

Wang et al. propose a metric for LMs factual reliability and show its relation to models’ sensitivity to in-context distractors arxiv.org/abs/2310.09820 (W. Wang @bazril @alexandrabirch1 W. Peng @EdinburghNLP) 7/

1,044

Inseq