Skip to content

SPEECHCOG/COCO_lextest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Word separability test for words in MS-COCO concepts (names of labeled visual objects)

Getting started

  • Run your model on the audio files at COCO_synth/ and extract embeddings as corresponding .txt files into /path_to/extracted/embeddings/. Embeddings can be one embedding per .wav or frame-level embeddings.

One embedding per wav: each .txt file should have one vector on the first row, float values separated by white spaces. Frame-level embeddings per wav: each .txt file should have one frame per row, embedding values as floats separated by white spaces.

  • Run evaluation software to get overall separability score.

Running from command line (tested on Narvi and Puhti clusters)

  1. Get a CPU node.

  2. Load MATLAB if not present. e.g.:

module load matlab

3a) For utterance-level embeddings, execute:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/utt_level_embeddings/

3b) For frame-level embeddings, execute:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 0

or for parallel computing (recommended if parfor available):

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 1

  1. Results will be written to output.txtin COCO_lextest main folder. If you want to specify different output folder for the results, give the path as the fifth argument, e.g.:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 1 /path_to/output/folder/

By default, audio files are located in COCO_synth/ of this repository.

Running from MATLAB desktop

You can run the code as a normal MATLAB script by calling COCO_lextest.m directly (the same syntax as above).

Baseline replication with provided log-Mel example embeddings

In order to replicate baselines with log-Mel features, run:

sh COCO_lextest.sh 'COCO_synth/' 'demodata/COCO_embs_uttlevel/'

or

sh COCO_lextest.sh 'COCO_synth/' 'demodata/COCO_embs_frame/' 'full' 1

which should produce 0.208 and 0.568, respectively.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published