NLP research - PhD student @uwcse

Joined June 2017
10 Photos and videos
Ben Newman retweeted
🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!) We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈 1/🧵
5
49
196
68,328
11 Nov 2024
✨EMNLP Paper! ✨ Have you ever constructed a table to organize your lit review process? Can we use LMs to generate these automatically? We are excited to present ArxivDIGESTables 🍽️ a study of collecting, generating, and evaluating 🎓 scientific literature review tables 📃!
1
16
77
17,714
11 Nov 2024
We also find that providing more table context (captions, in-text references) to models leads to higher recall when generating columns but does not help when generating values.
1
1
1
290
23 Dec 2023
RT @maria_antoniak: I've started thinking about what I want to do after my postdoc at AI2 💫 If you know of an academic department or indust…
52
Ben Newman retweeted
15 Nov 2023
I'm on the faculty market! My goal is to build language systems that we understand deeply through discovery and by design, so we can precisely control them and treat their failures. Let's tackle this grand challenge of science and engineering together. nlp.stanford.edu/~johnhew/

6
73
407
96,899
Ben Newman retweeted
If you missed our demo at #emnlp2023 this week, you can now play with PaperMage yourself and explore how you can easily access PDF documents using Python for your future projects: (1/5)
EMNLP 2023 Best Paper Demo PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents (Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chang et al.) aclanthology.org/2023.emnlp-… #EMNLP2023 #NLProc
1
9
28
6,479
Ben Newman retweeted
10 Dec 2023
so happy to have our work recognized at #emnlp2023 🥳 big thanks to @shannonzshen @blnewm @josephcc @soldni and collaborators at @SemanticScholar @allen_ai @UCBerkeley @MIT two of my favorite aspects of this work:
EMNLP 2023 Best Paper Demo PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents (Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chang et al.) aclanthology.org/2023.emnlp-… #EMNLP2023 #NLProc
7
14
105
20,320
9 Dec 2023
Hello! We will be presenting our poster on 📃 scientific decontextualization🎓 at #EMNLP2023 TODAY (12/9) at 11am in the east foyer! This is work with @soldni, @rayrayfok @armancohan, and @kylelostat, conducted at @SemanticScholar/@allen_ai
1
5
37
8,080
9 Dec 2023
We propose a QA-based decontextualization framework with three stages: 1️⃣ Question Generation: identify clarifying questions 2️⃣ Question Answering: find evidence from the paper and cited papers. 3️⃣ Rewriting: Incorporate the answers in the snippets, which are now decontextualized
1
2
139
9 Dec 2023
We find our framework helps annotators when collecting data. ✅ And it leads to LLM pipelines based that improve over end-to-end baselines. ✅ Findings: open models underperform closed ones, and that QG and QA are the most challenging. Link: arxiv.org/abs/2305.14772
1
121
Ben Newman retweeted
multimodal PDF processing is painful but doesn’t have to! come to our demo at #EMNLP2023 of Papermage, a library for fast manipulation of PDFs (Friday 9/12 @ 9am) we have used it for LLM data cleanup, paper QA, HCI prototypes github.com/allenai/papermage aclanthology.org/2023.emnlp-…
7
45
307
33,300