Skip to content

Latest commit

 

History

History
71 lines (51 loc) · 2.5 KB

ideas.md

File metadata and controls

71 lines (51 loc) · 2.5 KB

Plan

Dataset Creation

  • read train_metadata.csv
  • select only labels with good rating
  • slice clips into fragments with stride and length or cut based on detection model
  • randomly drop samples classes with too much data
  • "upsample" classes with too little data
  • coarsen coordinates
  • add date coarsened up to season (month?, 1/8 of y?)

  • read train_soundscape_labels.csv
  • add date coarsened date
  • add coarsened coordinates
  • assume rating is '5'
  • assume all labels are primary (?)

  • join datasets (short+long+external?)
  • treat secondary labels with lower value than 1 (?)
  • add folds

Training

  • use rating as sample weight (?)
  • use secondary labels with label value < 1 and linear activation (?)
  • do something about class imbalance
  • do augmentation with mixing of random fragments, (predict all labels mixed up)
  • try "Cosine Annealing Scheduler with warmup"

  • use sounscapes data for fine-tuning

Models

  • Baseline: ensemble of 2d and 1d convnets
  • Perceiver (?)

Prediction

  • Predictions close by in time of the same bird should be treaded with more confidence
  • Try LSTM that takes predicted stream and trains on soundscapes to correct predictions
  • Try different ensembling methods: averaging/sqrt(sum(squares))/voting

Ideas

Crazy Ideas

  • Create synthetic data with GANs

Links