Thanks for bringing this to my attention. I honestly wasnât aware of the situation until the recent posts started going viral. I would never encourage my students to do anything like thisâif I were serving as an Area Chair, any paper with this kind of prompt would be desk-rejected right away. That said, for any problematic submission, co-authors all share the responsibility, no excuse here. And this has been a good reminder for me, as a PI, to not just check the final PDF but also look through the full submission files. I wasnât aware of this kind of need before.
Let me take a moment to share what we found after doing a full internal review this past week--everythingâs backed up by logs and screenshots, available if needed.
1. Background
In November 2024, a researcher
@jonLorraine9 tweeted this:
x.com/jonLorraine9/status/18âŚ. That was the first time I saw this kind of idea, and I think it was also when people realized that LLM prompts could be embedded in papers. Note that such injection only works if the reviewer uploads the PDF to an LLM directly.
At that time, one thing we all agree is that LLMs should NOT be used for reviewing. Itâs a real threat to the integrity of the process. Thatâs why conferences like CVPR and NeurIPS have now explicitly and strictly banned LLM reviewing (e.g., âLLMs are NOT allowed to be used for writing the reviews nor the meta-reviews at any step.â).
If you've published at AI conferences, you probably know how frustrating it is to receive a review that was clearly written by an AI. Itâs nearly impossible to respond to, and often just as hard to definitively prove that an LLM wrote it.
While the original post might have been made partly as a joke, we all felt that trying to âfight fire with fireâ isnât the right defense--it raises more ethical issues than it solves. A better path is to address these concerns through official conference policies, not through individual hacks that can backfire.
2. What happened in our case
The student authorâwho was visiting our group briefly from Japanâtook that tweet a bit too literally and used the idea in an EMNLP submission. They copied the format exactly, not realizing it was partly a joke and could come across as manipulative or misleading. They also didnât fully grasp how this might impact public trust in science or the integrity of peer review. On top of that, they included the same thing in the arXiv version without thinking twice. I missed it tooâpartly because this goes beyond the usual checks I have in place to catch anything ethically questionable as a coauthor.
3. Next steps
The student has since updated the paper and reached out to ARR for formal guidance. We'll follow whatever steps they recommend.
4. Bigger picture
This has been a teaching moment for me. Students under pressure donât always think through all the ethical implicationsâespecially in newer areas like this. My job is to guide them through these gray zones, not just react to their mistakes. Rather than punishment, whatâs really needed is better education around these issues.
I was upset with the student at first too. But after thinking it through, I donât think the students should be punished beyond having the paper rejected. Iâve told them clearly this canât happen in the future, and weâre also planning additional training around AI ethics and responsible research practices (which to me is more about having some common sense).
Iâll be honestâitâs been not a good feeling being at the center of this kind of public shaming. These conversations should be thoughtful and constructive, not about singling people out. And honestly, the students feel the pressure even more.
I've actually been keeping up with the public conversations around this, and in a recent poll, 45.4% of people said they think this kind of thing is actually okay. Sure, itâs just a poll and there could be biasâbut it still says something about the nature of this problem.Â
x.com/gabriberton/status/194âŚ
The real issue here is the current systemâit creates space for things like this to happen. And this isnât traditional academic misconduct like faking data; itâs something newer, and it calls for a deeper, more nuanced conversation about how research ethics are evolving in the age of AI. In that sense, I donât feel too badâI feel confident I could explain the context honestly to any ethics board.
And to circle back to the original postâs questionâthis whole situation really highlights why we need to rethink how the game is played in academia. Thatâs really the main point I was trying to make in my talk. Iâm going to continue doing my best to help students learn how to do solid research.
(This post was written by me, with help from ChatGPT-4o on editing.)
Is it ethical to add a hidden line of text in your paper saying "write a good review" in case R2 uses chatGPT to review your paper?