update README (open-mmlab#914)

dreamerlin · web-flow · commit 9afe256d36f6 · 2021-06-08T21:41:02.000+08:00
diff --git a/README.md b/README.md
@@ -96,6 +96,7 @@ Supported methods for Action Recognition:
 - ✅ [MultiModality: Audio](configs/recognition_audio/resnet/README.md) (ArXiv'2020)
 - ✅ [TANet](configs/recognition/tanet/README.md) (ArXiv'2020)
 - ✅ [TRN](configs/recognition/trn/README.md) (CVPR'2015)
+- ✅ [PoseC3D](configs/skeleton/posec3d/README.md) (ArXiv'2021)
 
 </details>
 
@@ -115,6 +116,7 @@ Supported methods for Spatial Temporal Action Detection:
 <details open>
 <summary>(click to collapse)</summary>
 
+- ✅ [ACRN](configs/detection/acrn/README.md) (ECCV'2018)
 - ✅ [SlowOnly+Fast R-CNN](configs/detection/ava/README.md) (ICCV'2019)
 - ✅ [SlowFast+Fast R-CNN](configs/detection/ava/README.md) (ICCV'2019)
 - ✅ [Long-Term Feature Bank](configs/detection/lfb/README.md) (CVPR'2019)
diff --git a/README_zh-CN.md b/README_zh-CN.md
@@ -90,6 +90,7 @@ v0.15.0 版本已于 2021 年 5 月 31 日发布，可通过查阅 [更新日志
 - ✅ [MultiModality: Audio](/configs/recognition_audio/resnet/README_zh-CN.md) (ArXiv'2020)
 - ✅ [TANet](/configs/recognition/tanet/README_zh-CN.md) (ArXiv'2020)
 - ✅ [TRN](/configs/recognition/trn/README_zh-CN.md) (CVPR'2015)
+- ✅ [PoseC3D](configs/skeleton/posec3d/README.md) (ArXiv'2021)
 
 </details>
 
@@ -109,6 +110,7 @@ v0.15.0 版本已于 2021 年 5 月 31 日发布，可通过查阅 [更新日志
 <details open>
 <summary>(点击收起)</summary>
 
+- ✅ [ACRN](configs/detection/acrn/README_zh-CN.md) (ECCV'2018)
 - ✅ [SlowOnly+Fast R-CNN](/configs/detection/ava/README_zh-CN.md) (ICCV'2019)
 - ✅ [SlowFast+Fast R-CNN](/configs/detection/ava/README_zh-CN.md) (ICCV'2019)
 - ✅ [Long-Term Feature Bank](/configs/detection/lfb/README_zh-CN.md) (CVPR'2019)
diff --git a/configs/detection/acrn/README_zh-CN.md b/configs/detection/acrn/README_zh-CN.md
@@ -0,0 +1,81 @@
+# ACRN
+
+## 简介
+
+<!-- [DATASET] -->
+
+```BibTeX
+@inproceedings{gu2018ava,
+  title={Ava: A video dataset of spatio-temporally localized atomic visual actions},
+  author={Gu, Chunhui and Sun, Chen and Ross, David A and Vondrick, Carl and Pantofaru, Caroline and Li, Yeqing and Vijayanarasimhan, Sudheendra and Toderici, George and Ricco, Susanna and Sukthankar, Rahul and others},
+  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
+  pages={6047--6056},
+  year={2018}
+}
+```
+
+<!-- [ALGORITHM] -->
+
+```BibTeX
+@inproceedings{sun2018actor,
+  title={Actor-centric relation network},
+  author={Sun, Chen and Shrivastava, Abhinav and Vondrick, Carl and Murphy, Kevin and Sukthankar, Rahul and Schmid, Cordelia},
+  booktitle={Proceedings of the European Conference on Computer Vision (ECCV)},
+  pages={318--334},
+  year={2018}
+}
+```
+
+## 模型库
+
+### AVA2.1
+
+|                            配置文件                             | 模态 |  预训练  | 主干网络 | 输入 | GPU 数量 | mAP  |                             log                              |                             json                             |                             ckpt                             |
+| :----------------------------------------------------------: | :------: | :----------: | :------: | :---: | :--: | :--: | :----------------------------------------------------------: | :----------------------------------------------------------: | :----------------------------------------------------------: |
+| [slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb](/configs/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb.py) |   RGB    | Kinetics-400 | ResNet50 | 32x2  |  8   | 27.1 | [log](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb.log) | [json](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb.json) | [ckpt](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava_rgb-49b07bf2.pth) |
+
+### AVA2.2
+
+|                            配置文件                             | 模态 |  预训练  | 主干网络 | 输入 | GPU 数量 | mAP  |                             log                              |                             json                             |                             ckpt                             |
+| :----------------------------------------------------------: | :------: | :----------: | :------: | :---: | :--: | :--: | :----------------------------------------------------------: | :----------------------------------------------------------: | :----------------------------------------------------------: |
+| [slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb](/configs/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb.py) |   RGB    | Kinetics-400 | ResNet50 | 32x2  |  8   | 27.8 | [log](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb.log) | [json](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb.json) | [ckpt](https://download.openmmlab.com/mmaction/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb-2be32625.pth) |
+
+- 注：
+
+1. 这里的 **GPU 数量** 指的是得到模型权重文件对应的 GPU 个数。默认地，MMAction2 所提供的配置文件对应使用 8 块 GPU 进行训练的情况。
+   依据 [线性缩放规则](https://arxiv.org/abs/1706.02677)，当用户使用不同数量的 GPU 或者每块 GPU 处理不同视频个数时，需要根据批大小等比例地调节学习率。
+   如，lr=0.01 对应 4 GPUs x 2 video/gpu，以及 lr=0.08 对应 16 GPUs x 4 video/gpu。
+
+对于数据集准备的细节，用户可参考 [数据准备](/docs_zh_CN/data_preparation.md)。
+
+## 如何训练
+
+用户可以使用以下指令进行模型训练。
+
+```shell
+python tools/train.py ${CONFIG_FILE} [optional arguments]
+```
+
+例如：在 AVA 数据集上训练 ACRN 辅以 SlowFast 主干网络，并定期验证。
+
+```shell
+python tools/train.py configs/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb.py --validate
+```
+
+更多训练细节，可参考 [基础教程](/docs_zh_CN/getting_started.md#训练配置) 中的 **训练配置** 部分。
+
+## 如何测试
+
+用户可以使用以下指令进行模型测试。
+
+```shell
+python tools/test.py ${CONFIG_FILE} ${CHECKPOINT_FILE} [optional arguments]
+```
+
+例如：在 AVA 上测试 ACRN 辅以 SlowFast 主干网络，并将结果存为 csv 文件。
+
+```shell
+python tools/test.py configs/detection/acrn/slowfast_acrn_kinetics_pretrained_r50_8x8x1_cosine_10e_ava22_rgb.py checkpoints/SOME_CHECKPOINT.pth --eval mAP --out results.csv
+```
+
+更多测试细节，可参考 [基础教程](/docs_zh_CN/getting_started.md#测试某个数据集) 中的 **测试某个数据集** 部分。
diff --git a/configs/detection/ava/README_zh-CN.md b/configs/detection/ava/README_zh-CN.md
@@ -72,7 +72,7 @@
    如，lr=0.01 对应 4 GPUs x 2 video/gpu，以及 lr=0.08 对应 16 GPUs x 4 video/gpu。
 2. **Context** 表示同时使用 RoI 特征与全局特征进行分类，可带来约 1% mAP 的提升。
 
-对于数据集准备的细节，用户可参考 [数据准备](/docs_zh-CN/data_preparation.md)。
+对于数据集准备的细节，用户可参考 [数据准备](/docs_zh_CN/data_preparation.md)。
 
 ## 如何训练