GitHub

Word separability test for words in MS-COCO concepts (names of labeled visual objects)

Getting started

Run your model on the audio files at COCO_synth/ and extract embeddings as corresponding .txt files into /path_to/extracted/embeddings/. Embeddings can be one embedding per .wav or frame-level embeddings.

One embedding per wav: each .txt file should have one vector on the first row, float values separated by white spaces. Frame-level embeddings per wav: each .txt file should have one frame per row, embedding values as floats separated by white spaces.

Run evaluation software to get overall separability score.

Running from command line (tested on Narvi and Puhti clusters)

Get a CPU node.
Load MATLAB if not present. e.g.:

module load matlab

3a) For utterance-level embeddings, execute:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/utt_level_embeddings/

3b) For frame-level embeddings, execute:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 0

or for parallel computing (recommended if parfor available):

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 1

Results will be written to output.txtin COCO_lextest main folder. If you want to specify different output folder for the results, give the path as the fifth argument, e.g.:

sh COCO_lextest.sh '/path_to/original/audios/' '/path_to/extracted/frame_level_embeddings/' 'full' 1 /path_to/output/folder/

By default, audio files are located in COCO_synth/ of this repository.

Running from MATLAB desktop

You can run the code as a normal MATLAB script by calling COCO_lextest.m directly (the same syntax as above).

Baseline replication with provided log-Mel example embeddings

In order to replicate baselines with log-Mel features, run:

sh COCO_lextest.sh 'COCO_synth/' 'demodata/COCO_embs_uttlevel/'

or

sh COCO_lextest.sh 'COCO_synth/' 'demodata/COCO_embs_frame/' 'full' 1

which should produce 0.208 and 0.568, respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
COCO_synth		COCO_synth
demodata		demodata
.gitignore		.gitignore
COCO_lextest.m		COCO_lextest.m
COCO_lextest.sh		COCO_lextest.sh
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word separability test for words in MS-COCO concepts (names of labeled visual objects)

Getting started

Running from command line (tested on Narvi and Puhti clusters)

Running from MATLAB desktop

Baseline replication with provided log-Mel example embeddings

About

Releases

Packages

Languages

SPEECHCOG/COCO_lextest

Folders and files

Latest commit

History

Repository files navigation

Word separability test for words in MS-COCO concepts (names of labeled visual objects)

Getting started

Running from command line (tested on Narvi and Puhti clusters)

Running from MATLAB desktop

Baseline replication with provided log-Mel example embeddings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages