ODKCleaner is a web app that generates Stata cleaning code for your ODK-generated dataset. Simply upload your XLSForm and get the cleaning code ready for your data.
- Generate Stata variable labels based on your XLSForm
- Generate value labels for single choice variables
- Handle select_multiple questions, either by splitting them into binary variables or labeling them if they are already split
- Remove note variables from the dataset
- Support for different languages in the questionnaire
To use ODKCleaner, simply follow these steps:
- Open the web app at ODKCleaner
- Upload your XLSForm (the questionnaire in .xlsx format) from your Kobo/ODK Cloud/SurveyCTO dashboard
- Choose the questionnaire labels to use as your variable labels
- Specify how you want to handle select_multiple questions
- Get the generated Stata cleaning code
- streamlit
- pandas
- openpyxl
Contributions are welcome! If you'd like to contribute, please follow these steps:
- Fork the repository on GitHub
- Create a new branch for your changes
- Make your changes and commit them to your branch
- Create a pull request to merge your changes into the main repository
- Wait for your pull request to be reviewed and merged