Joined October 2024
47 Photos and videos
Our Editor-in-Chief, Dr. Wei Lu, presents a new vision for the journal in the editorial 'Opening a New Chapter for Computational Linguistics', marking a forward-looking transition as the journal enters its second half-century. Read: direct.mit.edu/coli/article/…
3
17
4,911
CL Journal has over 50 years of history in the field. While the field is moving, some of the articles from decades ago provide perspective and still relevant content. What do you think of these articles from 20 and 40 years ago? (2006 and 1986) #NLProc
1
3
14
872
While we're preparing the next paper or next research question to explore, perhaps reading some of the older articles can give us a wider perspective. #CLJournal #NLProc
1
4
145
While we're waiting for the next issue of the CL Journal to introduce the new articles, you can get early access to upcoming articles here: direct.mit.edu/coli/online-e… #NLProc #CLJournal

1
3
313
Like humans, LLMs can be right for the wrong reasons. They can also be wrong for the wrong reasons. Anthropocentric bias has gone largely unexamined, posing a serious obstacle to the objective assessment of LLM capacities. Read more at doi.org/10.1162/COLI.a.582 #NLProc #CLJournal
1
15
1,359
Interpretability provides a toolset for understanding how and why LMs behave in certain ways. This survey proposes a perspective on interpretability research grounded in causal mediation analysis: doi.org/10.1162/COLI.a.572 #NLProc #CLJournal @SunJiuding @ericwtodd
8
54
4,113
How Can We Effectively Expand the Vocabulary of LLMs with 0.01GB of Target Language Text? This article explores a very important question for low-resource languages by experimenting with various techniques across 10 languages.A must-read at:doi.org/10.1162/COLI.a.581 #CLJournal #NLP
4
20
3,068
Should NLP metrics for bilingual code-switching use words as the token level or Intonation Units? Authors of this article show that intonation units will enhance comparisons between bilingual individuals, settings, and communities: doi.org/10.1162/COLI.a.580 #NLProc @rpattichi
1
5
368
Have you heard of soft metrics such as soft micro F1? In this article, the authors argue that for evaluating model predictions with human label variation, the standard metrics may not be sufficient. Read more at: doi.org/10.1162/COLI.a.578 #NLProc
2
8
961
To assess whether multilingual NLP performs well across languages, we need to evaluate it on all world languages. That is not feasible. This paper implements two sampling methods from linguistic typology and provides a Python package to facilitate this: doi.org/10.1162/COLI.a.577
7
25
2,682
Developing methods to assess the factuality of LLMs has become urgent. This paper presents LLM-OASIS for the factuality evaluation task. It turns out it significantly challenges SOTA LLMs. If you're up for improving over what's out there, start reading: doi.org/10.1162/COLI.a.575
7
395
Language Models are susceptible to adversarial attacks, where even subtle perturbations to input texts adversely affect model performance. Yang et al. propose a novel method to tackle this called Defensive Dual Masking (DDM). Read more at: doi.org/10.1162/COLI.a.574 #NLProc #NLP
2
5
874
Do current LLMs with fast-improving functional linguistic abilities exhibit distinct localization of formal (e.g., producing fluent, grammatical text) and functional (e.g., reasoning and consistent fact retrieval) linguistic mechanisms? Answer in: doi.org/10.1162/COLI.a.24 #NLProc
16
1,347
Have you heard of linguistic steganography? It seeks to conceal secret information within natural language text. Liu et al. propose a novel method called SA-ANS. It's a self-adaptive framework based on a self-adjusting Asymmetric Numeral System: doi.org/10.1162/COLI.a.22 #NLProc
7
626
If your research is around metaphor detection and interpretation, this article is for you! It introduces Meta4XNLI, the first parallel dataset for Natural Language Inference (NLI), in both English and Spanish: doi.org/10.1162/COLI.a.20 #NLProc
6
10
1,228
Volume 52, Issue 1 of Computational Linguistics is released 📣 You can access this issue at direct.mit.edu/coli/issue/52… The articles included in this issue will be introduced here and on 🦋 bsky.app/profile/complingjou…! #NLProc #NLP

2
6
464
What % of the NLP papers measure their impact in the real world? This paper proposes an "impact evaluation" of NLP models or systems for real-world usage, changing the research culture of NLP to focus more on real-world impact and less on SOTA-chasing: doi.org/10.1162/COLI.a.18
4
42
5,510
Hallucinations pose a substantial challenge to the reliability of LLMs in real-world scenarios. Zhang et al. survey methods of detection, explanation, & mitigation of hallucination, & provide a taxonomy & list of benchmarks for evaluation in this paper: doi.org/10.1162/COLI.a.16
5
29
3,424
Generative AI has advanced. But, for complex problems, it's lagging. In this position paper, the authors propose Human–AI Co-Construction (HAI-Co2), a framework for human–AI cooperative problem solving that facilitates such interaction. Read more at: doi.org/10.1162/COLI.a.19 #NLP
1
220
Tasks such as summarization, QA & timeline creation need temporal expression normalization. The authors propose a novel method that deals with the known problems of temporal expression normalization on data scarcity, language and domain adaptation:doi.org/10.1162/COLI.a.12 #NLProc
1
1
3
413
Have you heard of BLiMP for evaluating English LMs? Well, the authors of this paper introduce the Dutch version: BLiMP-NL. It's a benchmark set of linguistic minimal pairs for grammatical evaluation of Dutch LMs. Read more at: doi.org/10.1162/COLI_a_00559 #NLPRoc
6
252