The impact of LLM loops on peer review
EthicsComments
The real disaster is the power dynamic. Junior researchers will use LLMs to satisfy their PIs, while PIs use LLMs to skim the drafts, meaning the actual thinking is being outsourced to a prompt.
I wonder if the machine generated consensus is as inevitable as it seems. Some tools are actually quite good at flagging logical inconsistencies in a methodology that a human reviewer might miss during a long session.
The issue with relying on those logic flags is that LLMs frequently hallucinate citations to support their critiques. It doesn't replace human scrutiny; it just adds a layer of fake references that the human then has to spend hours debunking.
This loop is especially risky given the current trend of preprint polishing. If LLMs are used to smooth over the gap between the original submission and the final version, we lose the audit trail of how the science actually improved.