Taylor Sorensen

Taylor Sorensen

10 Photos and videos

Tweets

Ben Newman retweeted

Taylor Sorensen @ma_tay_

13 Oct 2025

🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!) We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈 1/🧵

196

68,328

Ben Newman

Ben Newman @blnewm

11 Nov 2024

✨EMNLP Paper! ✨ Have you ever constructed a table to organize your lit review process? Can we use LMs to generate these automatically? We are excited to present ArxivDIGESTables 🍽️ a study of collecting, generating, and evaluating 🎓 scientific literature review tables 📃!

A screenshot of the first page of the paper discussed in the thread. Figure 1 contains a set of three cartoon papers with related text highlighted in three different colors. To its left, there's an arrow pointing to a cartoon table with a column corresponding to each color and a row corresponding to each table.

ALT A screenshot of the first page of the paper discussed in the thread. Figure 1 contains a set of three cartoon papers with related text highlighted in three different colors. To its left, there's an arrow pointing to a cartoon table with a column corresponding to each color and a row corresponding to each table.

17,714

more replies

Ben Newman

Ben Newman @blnewm

11 Nov 2024

We also find that providing more table context (captions, in-text references) to models leads to higher recall when generating columns but does not help when generating values.

Two plots of recall versus threshold for determining a schema match: one for GPT-3.5 Turbo and another for Mixtral 8x22B. There are five lines in each plot. Each line travels from the top left to bottom right of the plot with y-intercepts that are generally in increasing order by the following types of context: generated caption, baseline, gold caption, in-context examples, caption in-text references.

ALT Two plots of recall versus threshold for determining a schema match: one for GPT-3.5 Turbo and another for Mixtral 8x22B. There are five lines in each plot. Each line travels from the top left to bottom right of the plot with y-intercepts that are generally in increasing order by the following types of context: generated caption, baseline, gold caption, in-context examples, caption in-text references.

290

Ben Newman

Ben Newman @blnewm

11 Nov 2024

This is work with @yoonjoo_le2, @arnaik19, @Siangliulue, @rayrayfok, @imjuhokim, @dsweld, @josephcc, and @kylelostat conducted at @SemanticScholar @allen_ai @uwcse @kaist Code & Data: github.com/bnewm0609/arxivDI… Paper: aclanthology.org/2024.emnlp-…

GitHub - bnewm0609/arxivDIGESTables

Contribute to bnewm0609/arxivDIGESTables development by creating an account on GitHub.

github.com

459

Ben Newman

Ben Newman @blnewm

23 Dec 2023

RT @maria_antoniak: I've started thinking about what I want to do after my postdoc at AI2 💫 If you know of an academic department or indust…

John Hewitt

Ben Newman retweeted

John Hewitt @johnhewtt

15 Nov 2023

I'm on the faculty market! My goal is to build language systems that we understand deeply through discovery and by design, so we can precisely control them and treat their failures. Let's tackle this grand challenge of science and engineering together. nlp.stanford.edu/~johnhew/

407

96,899

Joseph Chee Chang

Ben Newman retweeted

Joseph Chee Chang @josephcc

10 Dec 2023

If you missed our demo at #emnlp2023 this week, you can now play with PaperMage yourself and explore how you can easily access PDF documents using Python for your future projects: (1/5)

EMNLP 2026 @emnlpmeeting

10 Dec 2023

EMNLP 2023 Best Paper Demo PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents (Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chang et al.) aclanthology.org/2023.emnlp-… #EMNLP2023 #NLProc

6,479

Kyle Lo

Ben Newman retweeted

Kyle Lo

@kylelostat

10 Dec 2023

so happy to have our work recognized at #emnlp2023 🥳 big thanks to @shannonzshen @blnewm @josephcc @soldni and collaborators at @SemanticScholar @allen_ai @UCBerkeley @MIT two of my favorite aspects of this work:

EMNLP 2026 @emnlpmeeting

10 Dec 2023

105

20,320

EMNLP 2026

Ben Newman retweeted

EMNLP 2026 @emnlpmeeting

10 Dec 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scienti...

Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chang, Russell Authur, Erin Bransom, Stefan Candra, Yoganand Chandrasekhar, Regan Huff, Bailey Kuehl, Amanpreet Singh, Chris Wilhelm, Angele Zamarron,...

aclanthology.org

177

47,067

Ben Newman

Ben Newman @blnewm

9 Dec 2023

Hello! We will be presenting our poster on 📃 scientific decontextualization🎓 at #EMNLP2023 TODAY (12/9) at 11am in the east foyer! This is work with @soldni, @rayrayfok @armancohan, and @kylelostat, conducted at @SemanticScholar/@allen_ai

ALT A screenshot of the first page of the paper.

8,080

more replies

Ben Newman

Ben Newman @blnewm

9 Dec 2023

We propose a QA-based decontextualization framework with three stages: 1️⃣ Question Generation: identify clarifying questions 2️⃣ Question Answering: find evidence from the paper and cited papers. 3️⃣ Rewriting: Incorporate the answers in the snippets, which are now decontextualized

ALT Illustration of the three stages of our pipeline.

139

Ben Newman

Ben Newman @blnewm

9 Dec 2023

We find our framework helps annotators when collecting data. ✅ And it leads to LLM pipelines based that improve over end-to-end baselines. ✅ Findings: open models underperform closed ones, and that QG and QA are the most challenging. Link: arxiv.org/abs/2305.14772

ALT Table showing relative performance of open and closed models at the rewriting component of our pipeline.

121

Luca Soldaini 🎀

Ben Newman retweeted

Luca Soldaini 🎀

@soldni

9 Dec 2023

multimodal PDF processing is painful but doesn’t have to! come to our demo at #EMNLP2023 of Papermage, a library for fast manipulation of PDFs (Friday 9/12 @ 9am) we have used it for LLM data cleanup, paper QA, HCI prototypes github.com/allenai/papermage aclanthology.org/2023.emnlp-…

ALT A picture of kyle in front of our poster

ALT A screenshot of the first page of papermage demo paper

307

33,300