This repository provides a simple toolkit to train baseline models and generate submission files for the Hackathon on Water Scarcity 2025. The baseline models predict water discharge for the 52 stations of eval dataset. You are free to experiment with different modeling approaches or use distinct models per station, as long as your submission file adheres to the required format (see Codabench guidelines).
- Download:
Obtain the dataset from Zenodo. - Setup:
Unzip the dataset and place it in the root directory of the repository.
-
Preprocessing
- 01 - Data Preprocessing
- 02 - Feature Engineering
-
Training and Submission
- 03 - Modelisation
- 01 - Prediction Computation
-
Exploration
- 01 - Performance Comparison
- 02 - Single Model Optimisation
After running the notebooks, create your submission file (data/evaluation/predictions.zip) and upload it to Codabench.
-
Python Version This repository was developed and tested using Python 3.12.6. For best compatibility, please ensure you are using Python 3.12 or a newer version.
-
Create your venv - Mac version
# Create a virtual environment
python3 -m venv .venv
# Activate the virtual environment
source .venv/bin/activate
# Upgrade pip if needed
pip install --upgrade pip
# Install required packages
pip install -r requirements.txt
# Add the current directory to PYTHONPATH
export PYTHONPATH="$PYTHONPATH:$PWD"