Associate Professor, School of Information, UC Berkeley. NLP, computational social science, digital humanities. Not active here; find me at @dbamman.bsky.social

Joined October 2009
48 Photos and videos
15 Oct 2024
Lucy is a rock star and you should all hire her!
14 Oct 2024
Hi friends, colleagues, followers. I am on the faculty job market! I am a PhD student @BerkeleyISchool @berkeley_ai. I work on NLP, and I believe all language, whether AI- or human-generated, is ✨social and cultural data✨. My work includes: 🧵
4
37
4,135
David Bamman retweeted
30 Sep 2024
How might one do classification in the era of LLMs for humanities research? 🤔 @dbamman, @KentKChang, @NaitianZhou & I apply LLMs on ten tasks from prior cultural analytics lit. Larger LMs are competitive w/ older methods on established tasks, but perform less well on new ones.
30 Sep 2024
My group just finished up a new paper that I'm excited to get out into the world: "On Classification with Large Language Models in Cultural Analytics" (to be published at CHR): github.com/bamman-group/ca-c…. More info here! bsky.app/profile/dbamman.bsk…
1
2
8
1,873
David Bamman retweeted
In cultural analytics, accuracy is often not the only (or even primary) objective. Here, we explore the myriad ways CA uses classification, how LLMs compare to other commonly used methods, and how they might enable new approaches to sensemaking from text data.
30 Sep 2024
My group just finished up a new paper that I'm excited to get out into the world: "On Classification with Large Language Models in Cultural Analytics" (to be published at CHR): github.com/bamman-group/ca-c…. More info here! bsky.app/profile/dbamman.bsk…
2
1
6
663
30 Sep 2024
My group just finished up a new paper that I'm excited to get out into the world: "On Classification with Large Language Models in Cultural Analytics" (to be published at CHR): github.com/bamman-group/ca-c…. More info here! bsky.app/profile/dbamman.bsk…
2
11
2,954
27 Aug 2024
Big congrats to @KentKChang for passing his qualifying exam today! Lots of super exciting work on measuring social interactions in culture in the pipeline --
1
22
1,861
Well deserved, congrats Kent!
It’s an extraordinary pleasure and honor to teach alongside @dbamman and his wonderful students of NLP, now doubly so to have my small part recognized by @BerkeleyISchool & UC Berkeley.
9
1,060
David Bamman retweeted
It’s an extraordinary pleasure and honor to teach alongside @dbamman and his wonderful students of NLP, now doubly so to have my small part recognized by @BerkeleyISchool & UC Berkeley.
3
2
49
4,378
17 Jun 2024
See Naitian at poster #5!
Hey NLPals, I'll be at #NAACL2024 this upcoming week! Let's chat about sociocultural NLP, what it means to study culture, and finding variation in unusual places (like memes!) I'll be presenting this memes paper at the first poster session.
3
671
17 Jun 2024
For anyone at #NAACL2024 considering lucha libre, I can attest it was spectacular (though that may be influenced by attending with my 9yo)
12
1,108
David Bamman retweeted
15 Jun 2024
I’m headed to NAACL to present this paper! I’m around mostly Sunday evening thru Tuesday. This fall I’ll be doing some thinking about what to do after my PhD; if you have advice/thoughts about this definitely chat with me!
25 Oct 2023
New preprint! 🎉 We examine two contrasting yet common assumptions around what it means for an NLG model or system to be “fair” or “good”: 1⃣ treating all social groups the same, where “bias” = any diff in outputs (invariance), or 2⃣customizing outputs to them (adaptation).
1
7
76
16,660
David Bamman retweeted
Hey NLPals, I'll be at #NAACL2024 this upcoming week! Let's chat about sociocultural NLP, what it means to study culture, and finding variation in unusual places (like memes!) I'll be presenting this memes paper at the first poster session.
Memes are pervasive in online speech. Do they have the socially meaningful variation we see in other aspects of language? YES! New preprint from me, @david__jurgens and @dbamman on the semantic structure and visual diversity of 3.8M Reddit memes. 🌐 naitian.org/social-memeing
1
3
28
5,034
17 Jun 2024
Looking forward to seeing people at #NAACL2024 this week! Today, be sure to check out @NaitianZhou's poster on the sociolinguistics of memes (11am) and @lucy3_li's talk on concepts of fairness in NLG systems at 2:36pm (ethics/bias/fairness 1)
1
1
27
1,662
Join us on Monday, 2/26 at 4:30 pm for a lecture by @dbamman: The Promise and Peril of Large Language Models for Cultural Analytics. RSVP: forms.gle/by1m6xHzTLhjJQbd8 More info: cdh.princeton.edu/events/202… Co-sponsored by @PrincetonPLI.
4
13
1,679
David Bamman retweeted
23 Jan 2024
Very excited to announce the launch of our citizen science initiative "The Lives of Literary Characters" hosted @the_zooniverse. This is the first ever literary citizen science project that aims to promote story understanding. A Thread 🧵 zooniverse.org/projects/citi…
2
17
50
6,863
David Bamman retweeted
16 Jan 2024
New preprint! 📜 We investigate how ten “quality” and English langID filters, drawn from prior lit on LLM pretraining data curation pipelines, affect webpages linked to self-descriptions of their creators. Paper: arxiv.org/abs/2401.06408 Data: huggingface.co/datasets/alle… 🧵(1/6)
3
22
153
32,166
David Bamman retweeted
22 Dec 2023
🚨NLP CSS workshop is back and will be at NAACL 2024! Paper submission deadline: March 24 sites.google.com/site/nlpand… Organizing team: @anjalie_f @dallascard @dirk_hovy and myself
1
13
68
11,799
10 Dec 2023
Congrats to Masha et al on this best industry paper award! (Masha’s a Berkeley School of Information MIMS alum!)
EMNLP 2023 Best Industry Paper Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems (Masha Belyi, Charlotte Dzialo, Chaitanya Dwivedi, Prajit Muppidi, Kanna Shimizu) aclanthology.org/2023.emnlp-… #EMNLP2023 #NLProc
3
1,015
10 Dec 2023
Awesome work! Congrats @nikita_mehandru @swetaagrawal20 et al!!!!
I’m thrilled that this Human-Centered MT paper was recognized with an outstanding paper award at #EMNLP2023. Congratulations to lead authors Nikita Mehandru (@ucberkeley iSchool) and @swetaagrawal20 (@umdclip @istecnico) for making this interdisciplinary collaboration a success!
10
1,295
David Bamman retweeted
I’m thrilled that this Human-Centered MT paper was recognized with an outstanding paper award at #EMNLP2023. Congratulations to lead authors Nikita Mehandru (@ucberkeley iSchool) and @swetaagrawal20 (@umdclip @istecnico) for making this interdisciplinary collaboration a success!
Replying to @MarineCarpuat
2/8 "Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors" with @nikita_mehandru @swetaagrawal20 @elainekhoong Niloufar Salehi among others arxiv.org/abs/2310.16924 virtual2023.emnlp.org/paper_…
4
20
113
19,228