Skip to content

Latest commit

 

History

History
102 lines (61 loc) · 4.13 KB

README.md

File metadata and controls

102 lines (61 loc) · 4.13 KB

C10-Audio-Based-Interaction-Detection

Challenge

To participate and submit to this challenge, register at the EPIC-Sounds Audio-Based Interaction Detection Codalab Challenge. The labelled train/val annoations are available on the EPIC-Sounds annotations repo.

This repo is a modified version of the existing Action Detection Challenge.

NOTE: For this version of the challenge (version "0.1"), the class "background" (class_id=13) has been redacted from the test set. The evaluation code audio_based_interaction_detection.py will remove background labels from the evaluation.

Evaluation Code

This repository contains the official code to evaluate audio-based interaction detection methods on the EPIC-SOUNDS validation set. Parts of the evaluation code have been adapted from ActivityNet.

To use this code, move to the EvaluationCode directory:

cd EvaluationCode

Requirements

In order to use the evaluation code, you will need to install a few packages. You can install these requirements with:

pip install -r requirements.txt

Usage

You can use this evaluation code to evaluate submissions on the validation set in the official JSON format. To do so, you will need to first download the public EPIC-SOUNDS annotations with:

export PATH_TO_ANNOTATIONS=/desired/path/to/annotations
git clone https://github.com/epic-kitchens/epic-sounds-annotations.git $PATH_TO_ANNOTATIONS

You can then evaluate your json file with:

python audio_based_interaction_detection.py /path/to/json $PATH_TO_ANNOTATIONS/EPIC_Sounds_validation.pkl

Where /path/to/json is the path to the json file to be evaluated and /path/to/annotations is the path to the cloned epic-sounds-annotations repository.

Example json

As an example, we provide a json file generated with the baseline on the validation set. You can evaluate the json file with:

python audio_based_interaction_detection.py actionformer_baseline_validation.json $PATH_TO_ANNOTATIONS/EPIC_Sounds_validation.pkl

Audio-Based Interaction Detection Baseline

The baseline used for this challenge is ActionFormer.

In the following, we provide instructions to train/evaluate this baseline.

Baseline Requirements

We recommend using Anaconda. To install the necessary packages to run the baseline, follow the installation steps in the ActionFormer GitHub.

Features

We provide auditory slowfast features used train the baseline, which you can download here.

Save the features under the path baseline/data/epic_sounds/auditory_slowfast_features in this repository.

Model

You can download the model used to report baseline results here.

Training

You can train the model by moving to the baseline folder and running:

 python train.py configs/epic_sounds_auditory_slowfast.yaml --output reproduce

Validation

You can evaluate the model by moving to the baseline folder and running:

python eval.py configs/epic_sounds_auditory_slowfast.yaml <path_to_checkpoint>

Compute Test Detections

You can compute the test detections using the following command:

python eval.py --saveonly configs/epic_sounds_auditory_slowfast_test.yaml <path_to_checkpoint>

After running the command a .pkl containing the test detections will be generated. you can convert them to a JSON file using python create_json.py <PATH_TO_PKL> in the EvaluationCode folder. This .json file can then be evaluated using:

python audio_based_interaction_detection.py  test.json <PATH_TO_ANNOTIONS>