Reproduction of paper "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?"

Reproduction based on PaddleVideo

一、简介

网络结构如图所示，本论文的目的是检验当前的视频数据集是否可以用于训练非常深的3D卷积网络。本项目的结构是ResNet18（如红框所示），目标是UCF101分类准确率达到42.4%。

论文：

[1] Kensho Hara, Hirokatsu Kataoka, Yutaka Satoh. Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

代码参考: -https://github.com/kenshohara/3D-ResNets-PyTorch/tree/CVPR2018

二、复现精度

在UCF101验证集上:

model	acc	log
3d-resnet	43.98%	log

三、数据集

UCF101

四、环境依赖

硬件：Tesla V100 * 1
框架：PaddlePaddle==2.2.1

五、快速开始

本项目基于PaddleVideo开发，使用方法和PaddleVideo完全一致
该项目提供了NoteBook版本，带有权重，可快速体验。

第一步：克隆本项目

git clone https://github.com/justld/3D-ResNets_paddle

安装第三方库

pip install -r requirements.txt

第二步：准备数据

参考UCF101数据准备

第三步：训练

训练需要一个小时左右，可以很快得到训练结果.
权重下载提取码：9exc

python3 main.py -c configs/recognition/resnet18_3d/resnet18_3d_ucf101_frames.yaml --validate --seed=10001

第四步：验证

python3 main.py -c configs/recognition/resnet18_3d/resnet18_3d_ucf101_frames.yaml --test -w {path to pdparams}

第五步：导出预测

先导出模型，然后预测

python3.7 tools/export_model.py -c configs/recognition/resnet18_3d/resnet18_3d_ucf101_frames.yaml -p {path to pdparams} -o inference/Res18_3D

python3.7 tools/predict.py --config configs/recognition/resnet18_3d/resnet18_3d_ucf101_frames.yaml --input_file {path to avi} --model_file inference/Res18_3D/Res18.pdmodel --params_file inference/Res18_3D/Res18.pdiparams --use_gpu=True --use_tensorrt=False

六、模型信息

信息	描述
作者	郎督
框架版本	PaddlePaddle==2.2.1
在线体验	notebook

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
applications		applications
configs		configs
data		data
images		images
log		log
paddlevideo		paddlevideo
tools		tools
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh
setup.py		setup.py
ucf101.md		ucf101.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduction of paper "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?"

Reproduction based on PaddleVideo

一、简介

二、复现精度

三、数据集

四、环境依赖

五、快速开始

第一步：克隆本项目

安装第三方库

第二步：准备数据

第三步：训练

第四步：验证

第五步：导出预测

六、模型信息

About

Releases

Packages

Languages

License

justld/3D-ResNets_paddle

Folders and files

Latest commit

History

Repository files navigation

Reproduction of paper "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?"

Reproduction based on PaddleVideo

一、简介

二、复现精度

三、数据集

四、环境依赖

五、快速开始

第一步：克隆本项目

安装第三方库

第二步：准备数据

第三步：训练

第四步：验证

第五步：导出预测

六、模型信息

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages