Bird Identification

A bird audio identification and quality control tool, designed to analyze recordings and generate a list of bird species detected using an AI model from the University of Jyväskylä Muuttolintujen kevät -project.

Built with TensorFlow, Python and Docker. The tool is a work in progress and currently in a preliminary stage.

Features

Analyzes audio recordings (WAV, MP3, FLAC) to detect bird species, either locally or via API
- Files are divided into smaller chunks based on chunk_size parameter, because the model has a limit on the input size. There chunks are then divided into segments of clip_dur seconds (currently fixed to 3 seconds), and overlap parameter defines how much of each segment is overlapped with the next segment.
Uses species distribution and temporal modeling to improve detection accuracy
Handles batch processing of multiple audio files
Generates reports with species statistics and sample audio clips to help verifying the results

Setup

Prerequisites

Docker
Git
The following AI models (available on request):
- BirdNET
- Muuttolintujen kevät

Installation

git clone
cd bird-identification
Place models to the /models folder: BirdNET and Muuttolintujen kevät
docker compose up --build; docker compose down;
Access the running docker container:
- docker exec -ti bird-identification bash
- Run the scripts, see below

Running unit tests

Test can be run from the host machine using Docker Compose:

docker compose run --rm test

Test can also be run from within the container:

docker exec -ti bird-identification bash
pytest /app/tests -v

Usage

Identifying species locally

This analyzes audio files and generates tabular text files containing the identifications, one file for each audio file.

Place audio files to a folder under /input, for example /input/my_backyard_2025-01.
Place metadata.yaml file in the same folder. This contains information that is shared by all the files. Example format:

lat: 60.123
lon: 24.123 
day_of_year: 152 # Note: this will be overridden if audio file names include a date

Run the script with python main.py --dir <subfolder>
Optional parameters:
- --thr: Detection threshold as a decimal number between 0<>1, default 0.5
- --noise: Include noise in the output, default False
- --sdm: Use species distribution model to adjust confidence values, default False
- --skip: Skip audio files that already have a corresponding result file, default False
- --overlap: Overlap of segments to be analyzed in seconds, default 1.
- --chunk_size: Audio files are cut into chunks for analysis. This defines the size in seconds, default 600.

Note

Expects that
- Audio filenames are in format [part1].[extension]
- Extension is wav, mp3 or flac
If classification stops with message "Killed", try restarting the Docker container. It's unclear what causes this issue.
The model and/or classifier has limitations:
- Segments can't be too long. 10 minutes seem to work fine, 30 minutes are too long.
- Overlap can't be too high. 1 second works fine, 2 seconds doesn't. Longer overlap leads to "Killed" message.

Generating validation report

This reads tabular files containing species identifications, and generates an HTML report with example audio files for validation, and statistics and charts of the species.

First do species identification, see above. You can also use BirdNET to do the identifications (use csv export format.)
Validation report generation expects that:
- Data files are in the same directory as the audio files and in format [part1].[part2].results.csv
- Data files have columns: Start (s), End (s), Scientific name, Common name, Confidence, [Optional columns]
Run the script with python main_report.py --dir <subfolder>
Optional parameters:
- --thr: Detection threshold as a decimal number between 0<>1, default 0.5
- --padding: Padding in seconds for example audio files, default 1.
- --examples: Number of example audio files to pick for each species, minimum 5, default 5.

Identifying species using API

Submit data to endpoint /classify.

A bare minimum call with mandatory latitude and longitude parameters looks like this:

curl -X POST "http://localhost:8000/classify?latitude=60.1699&longitude=24.9384" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@<path_to_audio_file>"

Call with all parameters:

curl -X POST "http://localhost:8000/classify?latitude=60.1699&longitude=24.9384&threshold=0.5&include_sdm=True&include_noise=True&day_of_year=1&chunk_size=500&overlap=1" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@<path_to_audio_file>"

Note

Day of year can be set as a parameter, but if not, today's date is used.

Todo

Analysis
- Include inference details in the analysis result file, or at least identify the inference file?
- Refactor to handle settings in a centralized way, so that adding new parameters is easier
- Add clip_dur as a parameter
- Include both sdm and non-sdm predictions in the output
- Add taxon MX codes to the output
- Check why comparison-audio files are sometimes split into 5, 6 or 7 segments
Report
- If data from one day only, don't create date histogram
- Include inference metadata into the report, so that it can be shared independently. But what to do if there are multiple inference files?
- Species commonness: how many % of observations from that area (+- 100 km) and time (+-10 days) are this species
- Normalize x-axis for all temporal charts. Get first and last time from the original data when it's loaded?
- Histograms are not made for species with only few detections. However, tag is generated for these on the result service. Would be elegant not to have broken image links, though they are not visible for users.
Misc
- Organizing the repos: continue with this repo, include baim features. Then rethink whether this tool and analysis (Bart) tool should be bundled together. And how to manage web interface vs. desktop app.
- Error handling when functions return None
- More unit testing
- Handle file paths in a more consistent ways (directory path, file name, datetime from filename)

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
input		input
models		models
scripts		scripts
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
rebuild.sh		rebuild.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bird Identification

Features

Setup

Prerequisites

Installation

Running unit tests

Usage

Identifying species locally

Note

Generating validation report

Identifying species using API

Note

Todo

About

Releases

Packages

Contributors 2

Languages

luomus/bird-identification

Folders and files

Latest commit

History

Repository files navigation

Bird Identification

Features

Setup

Prerequisites

Installation

Running unit tests

Usage

Identifying species locally

Note

Generating validation report

Identifying species using API

Note

Todo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages