Skip to content

Latest commit

 

History

History
25 lines (17 loc) · 912 Bytes

README.md

File metadata and controls

25 lines (17 loc) · 912 Bytes

PseudoProof

https://pseudoproof.streamlit.app

Using ML models and generative AI to identify fabricated datasets in academic papers

Project done @ Le Wagon Bootcamp by:

Anaïs Pepey, Mariano Rubio and Despoina Kotsopoulou

Inspired from previous research by Michael S. Bradshaw and Samuel H. Payne
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0260395

Step-by-step tutorial:

  • Upload a csv dataset you have suspicions about
  • Let the random forest model do the work for about 5 seconds
  • Download the results: the original dataset completed with our prediction per row
    • 1: this row has likely been fabricated by a machine
    • 0: this row is likely authentic

PseudoProof demo:

Demo-PseudoProof.mov

PseudoProof presentation @ Le Wagon:

https://youtu.be/RVEVqyTf8X8?feature=shared&t=4245