Developer Guidelines for BirdNet Pipeline

Current Configuration

Model: The BirdNet classifier currently used is kenya, stored in ml_public_models.
Scripts: The main processing scripts are analyze.py, segments.py, and archive_original_data_assets.py executed in multiple phases.
Environment: The pipeline uses the birdnet-process-env (version 9) with required dependencies.

Project-Specific Configuration

Each organization or project has a unique JSON configuration file that customizes the pipeline. When creating or updating a pipeline for a new project, ensure:

A new JSON config file is created.
Project-specific parameters (e.g., datastore paths, compute resources) are correctly referenced.

How to Modify the Pipeline

Changing the Model

Upload the new model/classifier to the ml_public_models datastore.
Update the model path in the pipeline parameters.

Updating the Script

Modify the required script (analyze.py, segments.py, archive_original_data_assets.py).
Ensure the script is correctly referenced in the pipeline configuration.

Changing Compute Resources

Go to the pipeline settings in AML Studio.
Update the compute parameter in the relevant pipeline steps.

Testing Changes

Always test pipeline modifications in a development environment before deploying to production.