Bryan Li

Bryan Li

Photos and videos

Tweets

Colin Cherry retweeted

11 Mar 2025

Externally retrieving knowledge empowers LLMs for domain-adapted MT ⚖️🩺. But how is knowledge best represented, and how viable is generating it from an LLM itself? Our @GoogleAI paper investigates these questions through a careful experimental setup 📜. arxiv.org/abs/2503.05010

446

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

14 Mar 2025

<<Call for BoF/Affinity Group meeting>> Applicants should fill out the application form before March 24 2025.naacl.org/calls/affinit… #NAACL2025

Call for Birds of a Feather Session / Affinity Group Meeting Organizer Application

Official website for the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies

2025.naacl.org

1,520

iseeaswell꩜bʂky

Colin Cherry retweeted

iseeaswell꩜bʂky @iseeaswell

19 Feb 2025

😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: arxiv.org/pdf/2502.12301 Huggingface: huggingface.co/datasets/goog…

4,187

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

12 Feb 2025

The call for Diversity and Inclusion Subsidies is out: 2025.naacl.org/calls/dei_sub…

Call for NAACL 2025 Diversity and Inclusion Subsidies

Official website for the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies

2025.naacl.org

2,143

Mara Finkelstein

Colin Cherry retweeted

Mara Finkelstein @marafinkels

26 Nov 2024

LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes! arxiv.org/abs/2411.15387

11,766

Dan Deutsch

Colin Cherry retweeted

Dan Deutsch @_danieldeutsch

12 Nov 2024

New application link! google.com/about/careers/app… I am at EMNLP/WMT this week. Please come find me if you want to learn more about this role!

Dan Deutsch @_danieldeutsch

18 Oct 2024

Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: google.com/about/careers/app…

5,536

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

29 Oct 2024

📢Don't miss the NAACL Student Research Workshop! 🖇️ CFP & Important dates: naacl2025-srw.github.io/cfp #NLProc

3,003

NAACL

Colin Cherry retweeted

NAACL @naacl

24 Oct 2024

Thank you to those who participated in our recent all-member vote regarding our name change. The change is happening! We are: The Nations of the Americas Chapter of the Association for Computational Linguistics! Announcement 👉 naacl.org/posts/2024-10-24-N…

3,271

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

21 Oct 2024

📢 NAACL needs Reviewers & Area Chairs! 📝 If you haven't received an invite for ARR Oct 2024 & want to contribute, sign up by Oct 22nd! ➡️AC form: forms.office.com/r/8j6jXLfAS… ➡️Reviewer form: forms.office.com/r/cjPNtL9gP… Please RT 🔁 and help spread the word! 🗣️ #NLProc @ReviewAcl

9,806

Dan Deutsch

Colin Cherry retweeted

Dan Deutsch @_danieldeutsch

18 Oct 2024

Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: google.com/about/careers/app…

246

38,341

Slator

Colin Cherry retweeted

Slator

@slatornews

17 Oct 2024

Researchers from @Google reveal that verbose #LLMs, 🤖 which offer multiple translations 🔄 or refuse to translate, 🚫 pose significant challenges ⚠️ to traditional #MT evaluation frameworks. #machinetranslation @ebriakou @ColinCherry @markuseful slator.com/google-finds-refu…

Google Finds ‘Refusal to Translate’ Most Common Form of LLM Verbosity

Google researchers reveal that verbose LLMs, which offer multiple translations or refuse to translate, challenge traditional MT evaluation.

slator.com

475

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

17 Oct 2024

📢 Call for demos is out!! #NAACL2025 #NLProc Check the website for submission guidelines and a chance to win the Best Demo Award! 🏆 🖇️ 2025.naacl.org/calls/demo/

Call for System Demonstrations

Official website for the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies

2025.naacl.org

6,366

Paola Garcia

Colin Cherry retweeted

Paola Garcia @leibnyPaola

7 Oct 2024

📢📢🌟@jhuclsp Have an Idea? Let’s Hear It! JSALT 2025 Call for proposal is out. Deadline: October 15th, 2024 For more information: clsp.jhu.edu/the-11th-freder…

The 11th Frederick Jelinek Memorial Summer Workshop on Speech and Language Technology - Center for...

Brno University of Technology JSALT2025 site June 9 — August 1, 2025 Each summer, the Center for Language and Speech Processing (CLSP) organizes and hosts multidisciplinary teams of more than 50...

clsp.jhu.edu

4,486

Eleftheria Briakou

Colin Cherry retweeted

Eleftheria Briakou @ebriakou

2 Oct 2024

[1/5] Are verbose #LLM translations skewing evaluation results? TLDR: Yes! Our recent work dives into the prevalence and impact of LLM verbosity in automatic and human evaluations. 📎 Paper: arxiv.org/pdf/2410.00863

4,492

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

2 Oct 2024

📢 Second call for papers is out!! #NAACL2025 #NLProc 🖇️ 2025.naacl.org/calls/papers/

9,543

Eleftheria Briakou

Colin Cherry retweeted

Eleftheria Briakou @ebriakou

12 Sep 2024

Translation is a complex task involving pre-translation research and post-translation stages. Can #LLMs handle this process step-by-step, relying solely on their internal knowledge? ✨We show that decomposing the translation process significantly improves #Gemini translation quality of long-form texts across all #WMT24 languages! 📜arxiv.org/pdf/2409.06790

6,564

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

12 Sep 2024

📢 Calling all #NLProc enthusiasts! Submit your tutorial and workshop proposals to 2025 *ACL conferences (NAACL, ACL, EMNLP) through one joint call! Tutorials: 2025.naacl.org/calls/tutoria… Workshops:2025.naacl.org/calls/worksho…

3,760

Mara Finkelstein

Colin Cherry retweeted

Mara Finkelstein @marafinkels

27 Aug 2024

🥳 LLMs are changing the game, even for datasets! NewsPaLM, a publicly released LLM-generated dataset, outperforms larger web-crawled corpora for MT. It includes sentence & paragraph-level, MBR-decoded data. See paper for more, incl. LLM self-distillation. arxiv.org/abs/2408.06537

Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated...

Recent research in neural machine translation (NMT) has shown that training on high-quality machine-generated data can outperform training on human-generated data. This work accompanies the...

arxiv.org

3,523

NAACL HLT 2027

Colin Cherry retweeted

NAACL HLT 2027 @naaclmeeting

22 Aug 2024

First call for papers is out! #NAACL2025 🔴2025.naacl.org/calls/papers/

7,917

Rishabh Agarwal

Colin Cherry retweeted

Rishabh Agarwal

@agarwl_

17 Jul 2024

[New paper] If you are sampling multiple outputs from a teacher LLM (e.g., Gemini 1.5 GPT), ranking them, and fine-tuning the student on the best output, you can do better. Simple idea: Fine-tune / Distill on the top-k outputs instead. Consistent gains on machine translation.

184

20,723