Action Segmentation for Surgical Data

Please see project description, goals, methods and results in the final report Report

NOTE: This repo is based on the github repo from the BMVC 2021 paper: ASFormer: Transformer for Action Segmentation.

Main Components

main.py - for running an experiment.

model.py - includes most of the code for model building and training.

train_efficient.py - code used to train Efficientnet as a part of the experiments described in our work.

Also, this repo contains all code needed for running the experiments described in our work, such as a new label defenition for the "transition classes", dimension reduction for the input features and more.

Results

Output videos can be downloaded from: https://drive.google.com/drive/folders/1S_tcUdrOZF3vKk1Ow9YVfAd2Jv9wyxcD?usp=sharing In addition to the results extensively described in our work, this repo contatins all the outputs of our experiments.

In any results directory, the results for the relevant test-set is provided. the results for each video contain its raw gesture-recognition output, and 3 segmentation images describing the outputs of the ASFormer (the third one is the one taken as the model's output). In any segmentation image we can also see a graph of the model's certainty along the frames.

Reproduction

First Step: Training EfficientNet With Transition Classes

First, run data_modifier.py in order to save the new defined labels for the feature extractor.

Then, use the command python train_efficient.py --FOLD {fold_num}, where fold_num can be from 0 to 4.

Second Step: Feature Making - Saving the Inference of Efficientnet

python feature_maker.py

Third step: Dimension Reduction

python pca_maker.py --output_size {size} --split {split_num}, where size is the desired features dimension and split_num is the fold number.

Fourth step: ASFormer Training

python main.py --split {split_num} --hidden {hidden_dim}, where split_num is the fold number and hidden_dim is the desired feature-space size to use.

Fifth step: ASFormer Evaluation

python main.py --split {split_num} --hidden {hidden_dim} --action predict

Extra Step: Video Making

python video_maker.py, will run on 3 previously picked videos based on the extracted results.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
fold_indexes		fold_indexes
results		results
results_smooth_31		results_smooth_31
results_v2_hidden_128		results_v2_hidden_128
results_v2_hidden_1280		results_v2_hidden_1280
results_v2_hidden_256		results_v2_hidden_256
results_v2_hidden_512		results_v2_hidden_512
results_v2_hidden_64		results_v2_hidden_64
LICENSE		LICENSE
Project_report.pdf		Project_report.pdf
README.md		README.md
batch_gen.py		batch_gen.py
data_modifier.py		data_modifier.py
eval.py		eval.py
feature_maker.py		feature_maker.py
fold_index_maker.py		fold_index_maker.py
grid_sampler.py		grid_sampler.py
main.py		main.py
model.py		model.py
pca_maker.py		pca_maker.py
runner.py		runner.py
runner_efficient.py		runner_efficient.py
train_efficient.py		train_efficient.py
utils.py		utils.py
video_maker.py		video_maker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Segmentation for Surgical Data

Main Components

Results

Reproduction

First Step: Training EfficientNet With Transition Classes

Second Step: Feature Making - Saving the Inference of Efficientnet

Third step: Dimension Reduction

Fourth step: ASFormer Training

Fifth step: ASFormer Evaluation

Extra Step: Video Making

About

Releases

Packages

Languages

License

TalIfargan/action_segmentation_for_surgical_data

Folders and files

Latest commit

History

Repository files navigation

Action Segmentation for Surgical Data

Main Components

Results

Reproduction

First Step: Training EfficientNet With Transition Classes

Second Step: Feature Making - Saving the Inference of Efficientnet

Third step: Dimension Reduction

Fourth step: ASFormer Training

Fifth step: ASFormer Evaluation

Extra Step: Video Making

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages