Code Review 8/29/18 #46

auroracramer · 2018-08-29T18:33:52Z

No description provided.

Conflicts: l3embedding/train.py

Add basic functions to sample from audio and video files

Conflicts: l3embedding/train.py

Move script part of train.py to a different file Add retry loop to opening video files Add periodic model checkpoints to training

Custom read_video function

Fix model building configuration

…on set

Add augmentation for audio and images from L3 paper and add use of validation set

* Move ReLU _after_ batch norm * Use weight decay (assumed L2) * Set learning rate to 1e-4 * Allow for volume gain less than 0dB, as paper seems like it should allow for that

Add more changes to the model so that it is consistent with paper

Add parameter search code

…ue, generalize some gsheets things

…on data, and fix bug where only best parameter results are updated in the metrics history

auroracramer · 2018-08-29T20:04:16Z

classifier/train.py

+    train_metrics = compute_metrics(y_train, train_pred, num_classes=num_classes)
+    # Set up train and validation metrics
+    train_metrics = {
+        'loss': metric_cb.train_loss[-1],


Since we're using early stopping, we should find the index of the lowest validation loss (as opposed to the last index) and report those metrics!

This will likely change some of the results...

Also, we need to make sure we're reloading the checkpoint with the best loss, since Keras just leaves the model in whatever state the model was when finished training

auroracramer · 2018-08-29T20:08:50Z

classifier/train.py

+        'accuracy': metric_cb.train_acc[-1],
+        'class_accuracy': train_metrics['class_accuracy'],
+        'average_class_accuracy': train_metrics['average_class_accuracy']
+    }


Should also keep track of training and validation accuracies over time, even if it is saved in the history CSV

auroracramer · 2018-08-29T20:09:08Z

classifier/train.py

+    valid_metrics = {
+        'loss': metric_cb.valid_loss[-1],
+        'accuracy': metric_cb.valid_acc[-1],
+    }


Same thing as above two comments

auroracramer · 2018-08-29T20:12:39Z

classifier/train.py

+        train_data_skf = train_data
+        valid_data_skf = valid_data
+    else:
+        splitter = StratifiedShuffleSplit(n_splits=1, test_size=valid_ratio)


Not a code comment, but we should make sure in the paper we specify that we do a stratified split

auroracramer · 2018-08-29T20:55:12Z

classifier/metrics.py

+    if y.ndim == 2:
+        y = np.argmax(y, axis=1)
+    if pred.ndim == 2:
+        pred = np.argmax(pred, axis=1)


Convert y and pred to np.array just in case a list is passed in

This shouldn't actually have affected anything since we're calling arr.ndim, but it's a good precaution

auroracramer · 2018-08-29T22:45:01Z

l3conda.yml

@@ -0,0 +1,128 @@
+name: l3embedding


We should update this

auroracramer · 2018-08-29T23:01:04Z

l3embedding/audio_model.py

+    # 257 x 199 x 1
+    y_a = Spectrogram(n_dft=n_dft, n_hop=n_hop, power_spectrogram=1.0, # n_win=n_win,
+                      return_decibel_spectrogram=True, padding='valid')(x_a)
+    y_a = BatchNormalization()(y_a)


Take this BN out

Or leave it in, depending on what we decide

auroracramer · 2018-08-29T23:02:25Z

l3embedding/vision_model.py

+    ####
+    # INPUT
+    x_i = Input(shape=(224, 224, 3), dtype='float32')
+    y_i = BatchNormalization()(x_i)


Take this BN out

Or leave it in, depending on what we decide.

auroracramer · 2018-08-30T00:58:38Z

l3embedding/train.py

+            self.best_valid_loss, self.best_train_acc, self.best_valid_acc]
+
+        update_experiment(self.service, self.spreadsheet_id, self.param_dict,
+                          'R', 'AA', values, 'embedding')


This should be 'Z', though it doesn't seem to be breaking anything.

auroracramer · 2018-08-30T01:35:08Z

l3embedding/train.py

+    else:
+        m, inputs, outputs = MODELS[model_type](num_gpus=gpus)
+
+    loss = 'binary_crossentropy'


The loss is the same as as categorical_crossentropy, up to a constant. But binary_crossentropy = 2 * categorical_crossentropy. This shouldn't cause any serious optimization issues, but effectively halves the weight decay constant.

justinsalamon and others added 30 commits October 11, 2017 10:53

Initial commit

f005132

Ignore pycharm files

1a10814

Initial structure

8a89d27

Add basic functions to sample from audio and video files

2d71992

Add AudioSet ontology class

752d2af

Add preliminary training code (no eval yet)

c3a1b3b

sample any 1 second contains video, move open files out

044c0f0

Merge branch 'pescador' into model

bcd53d4

Conflicts: l3embedding/train.py

open audio file outside

5b76885

Merge pull request #5 from marl/pescador

0ccc83e

Add basic functions to sample from audio and video files

Merge branch 'master' into model

5eddbd0

Conflicts: l3embedding/train.py

Add resize function

8962b39

Merge branch 'master' into model

0449b1a

Change image sampling to follow paper

c5bbd03

Move script part of train.py to a different file Add retry loop to opening video files Add periodic model checkpoints to training

Merge pull request #6 from marl/model

0b94da4

s/min/max

7b9c518

Custom read_video function

38fcfa4

Merge pull request #8 from marl/fix_video_read

8e42cd3

Custom read_video function

fix some configuration so the model can train

f43f01f

Merge pull request #9 from marl/fix_model

eb63af1

Fix model building configuration

Update Spectrogram parameters to reflect changes in kapre fork

0c3e38b

Add data augmentation from paper

d87bd43

Add docstrings and fix brightness adjustment

cfd750d

Add augmentation option to training functions and add use of validati…

db1315b

…on set

Randomize order of saturation and brightness jittering

2908f07

Fix rescaling bug

a2aaf0b

Merge pull request #10 from marl/augment

631d55a

Add augmentation for audio and images from L3 paper and add use of validation set

Add more changes to the model so that it is consistent with paper

b68eae5

* Move ReLU _after_ batch norm * Use weight decay (assumed L2) * Set learning rate to 1e-4 * Allow for volume gain less than 0dB, as paper seems like it should allow for that

Merge pull request #11 from marl/model-fixes

c18055b

Add more changes to the model so that it is consistent with paper

Split L3 network into separate audio and vision models

f320653

auroracramer and others added 24 commits April 20, 2018 18:19

Merge pull request #40 from marl/crossvalidation

1d73c81

Add parameter search code

Update valid metrics instead of overwriting

4d85c2b

Merge branch 'crossvalidation'

995aa4a

Add random forest and fix metrics for class-wise classification metrics

261fbbb

Expose option for retraining with validation in command line arguments

f0fe9e7

Negate command line option for retraining with validation fold

67b95a5

Add param search train with valid to spreadsheet, fix spreadsheet iss…

141d192

…ue, generalize some gsheets things

Use best results instead of retrain when not retraining with validati…

b04fed2

…on data, and fix bug where only best parameter results are updated in the metrics history

Update classifier training script

ae2b517

Fix file idxs for us8k

8d265f5

Change to verbose=2 for mlp.fit() to avoid giant slurm output files

154ef20

DCASE 2013 results (data and notebook)

f24fc84

Add support for augmented data and remove setting of random seed

711ba4e

Merge branch 'master' of https://github.com/marl/l3embedding

53314df

Add random time delay befor mkdirs to avoid parallel jobs colliding

373cfca

Add scripts for creating plots and significance tests

7b1b0f1

Merge branch 'master' of https://github.com/marl/l3embedding

f63b6b8

Add plotting and significance test script

3349911

Rename plotting and sig test script to be more helpful

92c5f71

Add some fixes to the significance tests and plots

0f811d1

Add epoch experiment plots to plotting script

eab2a8a

Update plot generation script

2f3352c

Add missing sample rate argument to Melspectrogram constructor

af9a5b5

Merge branch 'master' into review

a3ce8c8

auroracramer requested review from hohsiangwu and justinsalamon August 29, 2018 18:34

auroracramer assigned hohsiangwu, auroracramer and justinsalamon Aug 29, 2018

auroracramer commented Aug 30, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code Review 8/29/18 #46

Code Review 8/29/18 #46

auroracramer commented Aug 29, 2018

auroracramer Aug 29, 2018

auroracramer Aug 30, 2018

auroracramer Aug 29, 2018

auroracramer Aug 29, 2018

auroracramer Aug 29, 2018

auroracramer Aug 29, 2018

auroracramer Aug 30, 2018

auroracramer Aug 29, 2018

auroracramer Aug 29, 2018

auroracramer Aug 30, 2018

auroracramer Aug 29, 2018

auroracramer Aug 30, 2018

auroracramer Aug 30, 2018

auroracramer Aug 30, 2018

Code Review 8/29/18 #46

Are you sure you want to change the base?

Code Review 8/29/18 #46

Conversation

auroracramer commented Aug 29, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment