Guy Bar-Shalom

Guy Bar-Shalom

8 Photos and videos

Tweets

Pinned Tweet

Guy Bar-Shalom @GuyBarSh

Apr 15

New blogpost out 📃 "Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (ffabffrasca.substack.com/p/d…) [1/8]

1,935

Yam Eitan

Guy Bar-Shalom retweeted

Yam Eitan @ytn_ym

May 25

1/ How much can you compress an LLM’s KV cache? tl;dr it depends on how you train your model. Many strong context compaction methods, such as Cartridges and attention matching, operate post-hoc: given a fixed model and a context, they try to compress the resulting KV cache. @yoav_gelberg and I ask the complementary question: can we train the model to produce KV representations that are easier to compress? In other words: keep the compression method fixed, and change the representations it sees.

29,198

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Apr 15

New blogpost out 📃 "Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (ffabffrasca.substack.com/p/d…) [1/8]

1,935

more replies

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Apr 15

- "Neural Message-Passing on Attention Graphs for Hallucination Detection", ICLR 2026 (arxiv.org/pdf/2509.24770) [7/8]

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Apr 15

- "Beyond Next Token Probabilities: Learnable, Fast Detection of Hallucinations and Data Contamination on LLM Output Distributions", AAAI 2026 (arxiv.org/pdf/2503.14043) [8/8]

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Feb 19

Check out our new ICLR 2026 paper - we explore hallucination detection through graph learning. Take a look!

Fabrizio Frasca @ffabffrasca

Feb 18

🧵"Neural Message Passing on Attention Graphs for Hallucination Detection" at #ICLR2026 ! 🕸️We apply GNNs on the structured data LLMs produce as they generate text (e.g. attentions) to predict their errors. 📄 arxiv.org/abs/2509.24770 🤝 @GuyBarSh (co-1st) @YftahZ @HaggaiMaron

1,400

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Feb 2

Happy to share my new #ICLR2026 papers !

3,287

more replies

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Feb 2

📌 [4/4] On the Expressive Power of GNN Derivatives We study how using gradients of GNNs can increase their expressive power, providing a principled way to go beyond standard message passing. arxiv.org/pdf/2510.02565

307

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

Feb 2

These works were joint efforts with a group of amazing collaborators: @ffabffrasca @HaggaiMaron @YftahZ @ytn_ym @yaniv_galron @yoav_gelberg @itayevron @mayabechlerspei @ido_guy Ami Tavory Moshe Eliasof Ran Elbaz

347

Haggai Maron

Guy Bar-Shalom retweeted

Haggai Maron @HaggaiMaron

30 Nov 2025

📄 Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT w/ @GuyBarSh , @ffabffrasca, Yaniv Galron, @YftahZ arxiv.org/abs/2510.00296

Beyond Token Probes: Hallucination Detection via Activation...

Detecting hallucinations in Large Language Model-generated text is crucial for their safe deployment. While probing classifiers show promise, they operate on isolated layer-token pairs and are...

arxiv.org

670

Fabrizio Frasca

Guy Bar-Shalom retweeted

Fabrizio Frasca @ffabffrasca

5 Dec 2025

Replying to @GuyBarSh

@GuyBarSh and I will be presenting the poster today, stop by 🤗 📍 Fri, Dec 5 • 4:30–7:30 PM PST • Exhibit Hall C,D,E # 4000

Guy Bar-Shalom @GuyBarSh

4 Oct 2025

[1/7] New paper: "Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT" #NeurIPS2025 [arxiv.org/pdf/2510.00296] Joint work with: @ffabffrasca (co-first), @yaniv_galron, @YftahZ , @HaggaiMaron

899

Omer Belhasin

Guy Bar-Shalom retweeted

Omer Belhasin @omerbelhasin

27 Nov 2025

🤔 Can discrete diffusion models actually outperform standard classifiers? We show that it can! 📄 arxiv.org/pdf/2511.20263 💻 github.com/omerb01/didicm 🌐 omerb01.github.io/didicm-web

0:44

7,343

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

4 Oct 2025

4,044

more replies

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

4 Oct 2025

[6/7] Results (over 15 LLM/dataset combinations): • Consistently outperforms classic probes • Zero-shot generalization to new datasets • Fast adaptation to unseen LLMs by tuning only their new corresponding adapter

Guy Bar-Shalom

Guy Bar-Shalom @GuyBarSh

4 Oct 2025

[7/7] Code: github.com/BarSGuy/ACT-ViT

GitHub - BarSGuy/ACT-ViT: Beyond Token Probes: Hallucination Detection via Activation Tensors with...

Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT (NeurIPS 2025) - BarSGuy/ACT-ViT

github.com

295