To begin using the project, follow the steps below:
-
Clone the repository:
$ git clone https://github.com/open-data-kazakhstan/city-population.git
-
Create and activate a virtual environment:
pip install venv python -m venv /path/to/localrepo cd /path/to/localrepo Scripts/activate # For Windows users
-
Install dependencies:
pip install -r requirements.txt
-
Run the script (data transformation):
python scripts/transform.py
Population data is sourced from stat.gov.
archive/source.xlsx
: Raw data of population of Kazakhstan as of January 1, 2023.data/city_population.csv
: Cleaned version containing population information in regions and three major cities of Kazakhstan.
package.py
: Used to create or update the datapackage.json file containing metadata about the dataset.transform.py
: Used to convert the source.xlsx file to a CSV format for easier processing.
This dataset is licensed under the Open Data Commons Public Domain and Dedication License.