This repository relates was created to document the work done for the 3rd and final "Twist" challenge of the DSI 2022. The aim of this challenge is gain interesting insight from one or both of the following dataset:
- GDELT (Global Database of Events, Language and Tone)
- ACLED (The Armed Conflict Location & Event Data Project)
The question I chose to attempt to answer is "Is media coverage biased?" and if so, "Can this bias be visualised?". In order to answer these questions, I use the ACLED dataset to identify countries and time periods of serious conflict. This achieved by identifying events, limited to events of types "Battles", "Violence against civilians" and "Explosions/Remote violence", where the number of fatalities was estimated to be 100 or more. Once countries of interest have been identified, the GDELT dataset is queried to establish the media amount of media coverage given to each country over the chose time period, here 9 April 2019 to 15 April 2022.
App Description | Link |
---|---|
ACLED data acquisition | https://github.com/AmyRouillard/Bias_in_conflict_coverage/blob/main/ACLED_data_request.ipynb |
GDELT data acquisition | https://github.com/AmyRouillard/Bias_in_conflict_coverage/blob/main/GDELT_data_processing.ipynb |
Data processing | https://github.com/AmyRouillard/Bias_in_conflict_coverage/blob/main/Data_analysis.ipynb |
Streamlit deployment | https://github.com/AmyRouillard/Bias_in_conflict_coverage/blob/main/app.py |