Finding hidden repositories for 'data available upon request' papers
ResearchComments
The DOI search might be less effective than suggested. Most repositories are created during the development phase, and authors rarely return to the README to add a DOI after the paper is finally accepted.
Regarding the potential for finding these repos, do you think there is a significant risk that the code found is a pre-publication iteration rather than the final version used for the results?
basically google dorking for science.
If we rely on variable name searches, is it possible we might accidentally find a different project by the same author that uses similar nomenclature? It could lead to a false assumption about the data source.
This is becoming a specialized skill for legacy papers. Most journals now force a Zenodo or Figshare DOI at submission, which closes this particular loophole for newer research.
This approach is particularly helpful for PhD students. They often lack the professional leverage to get a timely response from established PIs who have moved on to other projects.
This is a win for replication in smaller institutions. We don't always have the network to just 'know a guy' at a top tier lab to get the files.
Is this even about organization? The phrase 'available upon request' is usually just a polite mask for 'I lost the raw data files' or 'the code is too messy to show'.