We made LLMs speak Tulu, a language with only 2 million speakers.
It wasn't easy because LLMs kept confusing it with Kannada, but we discovered negative constraints really help.
🚨 New Paper
Training an LLM to speak low-resource language
(EACL workshop, 2026)
Tulu is spoken by 2M people in coastal Karnataka and LLMs basically can't speak it. We got to 85% grammar accuracy without fine-tuning anything or collecting a single new training example.