FPAR-Project-MLDL

This directory contains the models that implement First Person Action Recognition using a Resnet34 + CAM and a convLSTM module as a spatial attention network, paired with a temporal network, again a Resnet34, that takes optical flows as an input to extract temporal features. Much of this work is based on the paper: "Swathikiran Sudhakaran and Oswald Lanz. Attention is all we need: Nailing down object-centric attention for egocentric activity recognition. In British Machine Vision Conference, 2018" as also the code.

Models :

resnetMod.py : contains the implementation of the resnet34
MyConvLSTMCell.py : implementation of a convLSTM cell
objectAttentionModelConvLSTM.py : expands the 2 implementation of resnet and convLSTM in the previous files introducing the CAM mechanism paired with the convLSTM module, with a downstream classifier
flow_resnet.py : implements resnet34 to extract temporal features from optical flow data
twoStreaModel.py : paires together the spatial and temporal networks

Transformations:

spatial_transforms.py : implementation of several of pytorch's transformation, optimized to work with both rgb and optical flow frames

Dataset:

gtea_dataset.py : to ease the access to the GTEA61 dataset in every possible way. Contains one class to get the rgb frames, one for the optical flow framses, and one that is a wrapper of the other 2, to get them both

Training:

train_pipeline.ipynb : jupyter notebook with all the training steps. Stage 1) training of the convLSTM module and its downstream classifier Stage 2) train the convLSTM, the classifier and the CAM layer altogether Stage 3) separate training of the temporal network, flow_resnet Stage 4) Joint fine-tuning of the 2 networks

Name		Name	Last commit message	Last commit date
Latest commit History 216 Commits
report		report
AttentMS2.py		AttentMS2.py
AttentionModelMS.py		AttentionModelMS.py
MSBlock.py		MSBlock.py
MS_block2.py		MS_block2.py
MS_regressor.py		MS_regressor.py
MyConvLSTMCell.py		MyConvLSTMCell.py
README.md		README.md
attention_model_MS_reg.py		attention_model_MS_reg.py
attention_model_bigger_ms.py		attention_model_bigger_ms.py
attention_model_bigger_msNoreg.py		attention_model_bigger_msNoreg.py
bigger_ms_block.py		bigger_ms_block.py
clean_dataset.py		clean_dataset.py
ego_rnn_flow.py		ego_rnn_flow.py
flow_layer.py		flow_layer.py
flow_resnet.py		flow_resnet.py
genAttentionMap.py		genAttentionMap.py
gridsearch.py		gridsearch.py
gtea_dataset.py		gtea_dataset.py
log_plotter.ipynb		log_plotter.ipynb
logs.py		logs.py
objectAttentionModelConvLSTM.py		objectAttentionModelConvLSTM.py
resnetMod.py		resnetMod.py
resnet_ours.py		resnet_ours.py
resnet_split.py		resnet_split.py
spatial_transforms.py		spatial_transforms.py
stage2_grid.ipynb		stage2_grid.ipynb
stage2_tuning_pipeline.ipynb		stage2_tuning_pipeline.ipynb
test_ms_regressor.ipynb		test_ms_regressor.ipynb
test_pipeline.ipynb		test_pipeline.ipynb
test_pipeline_MS.ipynb		test_pipeline_MS.ipynb
train_MS2.ipynb		train_MS2.ipynb
train_MS2_DS.ipynb		train_MS2_DS.ipynb
train_ms_regressor.ipynb		train_ms_regressor.ipynb
train_pipeline.ipynb		train_pipeline.ipynb
train_pipeline_MS.ipynb		train_pipeline_MS.ipynb
twoStreamModel.py		twoStreamModel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FPAR-Project-MLDL

About

Releases

Packages

Contributors 3

Languages

Erosinho13/FPAR-Project-MLDL

Folders and files

Latest commit

History

Repository files navigation

FPAR-Project-MLDL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages