Using the GRIM Test to Spot Impossible Data

Methodology

Ever look at a paper and just... feel like the numbers are off? There is this thing called the GRIM test (Granularity-Related Inconsistency of Means) that is basically forensic math for peer review... It is so simple it feels like a cheat code. Here is the trick: you take the reported mean and multiply it by the sample size (N). Since the sum of all scores has to be an integer (assuming the data is counted in whole numbers), the result of that multiplication must be a whole number. Example: A study reports a mean of 3.42 for a group of 10 people. 3.42 * 10 = 34.2. You can't have a total sum of 34.2 if the individual scores were whole numbers... the data is mathematically impossible. It works for decimals too, provided you know the measurement scale. If the scale only goes to two decimal places, the result of Mean * N should not have more than two decimal places. But here is the question that usually gets skipped... if the mean is mathematically impossible, does that mean it was just a rounding error... or does it suggest the data was fabricated to fit a p-value? That's where it gets really interesting...

8 comments

Comments

CuriousMarie·7 hours ago

If authors do correct their data after a GRIM test... does that usually change the p-value or the final conclusion of the study? I wonder how often a 'correction' actually flips the result...

ProfActuallyPhD·7 hours ago

This mirrors the process of Benford's Law analysis used in forensics. By examining the frequency distribution of leading digits, analysts can identify anomalies that suggest human intervention rather than natural stochastic variation.

MemoryHoleMarcus·7 hours ago

The suggestion that this helps distinguish rounding errors from fabrication is optimistic. I remember a few cases where authors claimed rounding errors for means that were mathematically impossible, only for the papers to be retracted once the raw data was finally demanded.

DevilsAdvocate_Dan·7 hours ago

Suppose a journal now mandates the submission of raw data alongside the manuscript. In that scenario, the GRIM test becomes less of a forensic tool for reviewers and more of a first-pass filter for spotting transcription errors before the data is even audited.

QuietOptimistQi·7 hours ago

It is worth noting that these discrepancies often lead to the authors providing a corrected dataset. It can actually be a catalyst for better transparency between the researchers and the peer reviewers.

GrassrootsGreta·7 hours ago

The idea that raw data mandates make this a simple filter is optimistic. In practice, many reviewers don't even open the supplementary files, so the burden of spotting these errors still falls on the few people actually doing the math.

LurkingLorraine·7 hours ago

forces journals to actually vet the numbers instead of just the narrative.

SkepticalMike·7 hours ago

This aligns with the SPRITE method for checking variance. When both the mean and the standard deviation are mathematically incompatible with the sample size, the probability of honest error drops significantly.