STAC Overflow: Map Floodwater from Radar Imagery

Code for the STAC Overflow competition on DrivenData.

Originally I ranked 12th with a single-model solution. Later I worked on this task even further and implemented an ensemble model.

Artifacts

data exploration notebook
data augmentation notebook
Weights & Biases report on k-fold cross-validation:

Local installation for development

pip install -e .[dev]

python -m pretty_errors -s -p

Data splitting for train/validation

DATASTORE="$HOME/datastore/stac-overflow"

VALIDATION=("_" "awc,ayt,hxu,tnp" "ayt,hxu,pxs,qus" "coz,hxu,jja,tht" "coz,kuo,tht,wvy" "hbe,hxu,kuo,qus")

for VAL in $VALIDATION ; do

python stac_overflow/split_images_by_flood_ids.py \
  --metadata_csv="$DATASTORE/flood-training-metadata.csv" \
  --validation_flood_ids="$VAL" \
  --features_dir="$DATASTORE/train_features" \
  --labels_dir="$DATASTORE/train_labels" \
  --destination_dir="$DATASTORE/tfrecords" \
  --log_dir="$DATASTORE/tfrecords_logs" \
  --alsologtostderr \
  -- \
  --runner=DirectRunner \
  --direct_running_mode=multi_processing \
  --direct_num_workers=0

done

Local training with k-fold cross-validation

DATASTORE="$HOME/datastore/stac-overflow"

VALIDATION=("awc,ayt,hxu,tnp" "ayt,hxu,pxs,qus" "coz,hxu,jja,tht" "coz,kuo,tht,wvy" "hbe,hxu,kuo,qus")

for VAL in $VALIDATION ; do

python stac_overflow/train_segmentation_model.py \
  --train_tfrecords="$DATASTORE/tfrecords/$VAL/train-*.tfrecords" \
  --validation_tfrecords="$DATASTORE/tfrecords/$VAL/validation-*.tfrecords" \
  --tfrecords_geo_channel_keys="vv,vh,nasadem,jrc_extent:255:0,jrc_occurrence:255:0,jrc_recurrence:255:0,jrc_seasonality:255:0,jrc_transitions:255:0" \
  --img_height=512 \
  --img_width=512 \
  --models_dir="$DATASTORE/models" \
  --network_type="unet" \
  --backbone="seresnet152" \
  --num_replicas=1 \
  --batch_size_per_replica=4 \
  --train_steps_per_epoch=5 \
  --num_epochs=5 \
  --data_augmentation \
  --wandb_project=stac-overflow \
  --wandb_group=local \
  --wandb_mode=online

done

Grid.ai training with k-fold cross-validation

Check machine specs and hourly rates here.

TODO(stefanistrate): Remove dummy true values from the boolean flags below after this Grid.ai bug is fixed.

VALIDATION=("awc,ayt,hxu,tnp" "ayt,hxu,pxs,qus" "coz,hxu,jja,tht" "coz,kuo,tht,wvy" "hbe,hxu,kuo,qus")

for VAL in $VALIDATION ; do

grid run \
  --instance_type=g4dn.xlarge \
  --cpus=3 \
  --gpus=1 \
  --dockerfile=Dockerfile \
  stac_overflow/train_segmentation_model.py \
  --root_tfrecords="grid:stac-overflow:1" \
  --train_tfrecords="$VAL/train-*.tfrecords" \
  --validation_tfrecords="$VAL/validation-*.tfrecords" \
  --tfrecords_geo_channel_keys="vv,vh,nasadem,jrc_extent:255:0,jrc_occurrence:255:0,jrc_recurrence:255:0,jrc_seasonality:255:0,jrc_transitions:255:0" \
  --img_height=512 \
  --img_width=512 \
  --models_dir="models" \
  --network_type="unet" \
  --backbone="seresnet152" \
  --num_replicas=1 \
  --batch_size_per_replica=4 \
  --train_steps_per_epoch=100 \
  --num_epochs=50 \
  --data_augmentation=true \
  --noearly_stopping=true \
  --noprogress_bar=true \
  --redirect_logs=true \
  --wandb_api_key="$WANDB_API_KEY" \
  --wandb_project=stac-overflow \
  --wandb_group=unet_pc_rewrite_255s \
  --wandb_mode=online

done

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
stac_overflow		stac_overflow
.gitignore		.gitignore
.pylintrc		.pylintrc
.style.yapf		.style.yapf
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAC Overflow: Map Floodwater from Radar Imagery

Artifacts

Local installation for development

Data splitting for train/validation

Local training with k-fold cross-validation

Grid.ai training with k-fold cross-validation

About

Releases

Packages

Languages

License

stefanistrate/drivendata-stac-overflow

Folders and files

Latest commit

History

Repository files navigation

STAC Overflow: Map Floodwater from Radar Imagery

Artifacts

Local installation for development

Data splitting for train/validation

Local training with k-fold cross-validation

Grid.ai training with k-fold cross-validation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages