mindspore-lab
diff --git a/‎README.md
+1 b/‎README.md
+1
diff --git a/‎configs/yolov11/README.md
+82 b/‎configs/yolov11/README.md
+82
diff --git a/‎configs/yolov11/hyp.scratch.l.yaml
+45 b/‎configs/yolov11/hyp.scratch.l.yaml
+45
diff --git a/‎configs/yolov11/hyp.scratch.m.yaml
+45 b/‎configs/yolov11/hyp.scratch.m.yaml
+45
diff --git a/‎configs/yolov11/hyp.scratch.n.yaml
+39 b/‎configs/yolov11/hyp.scratch.n.yaml
+39
diff --git a/‎configs/yolov11/hyp.scratch.s.yaml
+45 b/‎configs/yolov11/hyp.scratch.s.yaml
+45
diff --git a/‎configs/yolov11/hyp.scratch.x.yaml
+45 b/‎configs/yolov11/hyp.scratch.x.yaml
+45
diff --git a/‎configs/yolov11/yolov11-base.yaml
+77 b/‎configs/yolov11/yolov11-base.yaml
+77
@@ -31,6 +31,7 @@ The following is the corresponding `mindyolo` versions and supported `mindspore`
 See [Benchmark Results](benchmark_results.md).
 
 ## supported model list
+- [x] [YOLOv11](configs/yolov11)
 - [x] [YOLOv10](configs/yolov10)
 - [x] [YOLOv9](configs/yolov9)
 - [x] [YOLOv8](configs/yolov8)
 
@@ -0,0 +1,82 @@
+# YOLOv11
+
+## Abstract
+Ultralytics YOLO11 is a cutting-edge, state-of-the-art (SOTA) model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility. YOLO11 is designed to be fast, accurate, and easy to use, making it an excellent choice for a wide range of object detection and tracking, instance segmentation, image classification and pose estimation tasks.
+
+<div align=center>
+<img src="https://github.com/user-attachments/assets/10b2a1f7-b75c-40fe-8cc2-59e21c2d4d08"/>
+</div>
+
+## Requirements
+
+| mindspore | ascend driver | firmware     | cann toolkit/kernel |
+| :-------: | :-----------: | :----------: |:-------------------:|
+| 2.5.0     | 24.1.0      | 7.5.0.3.220  |   8.0.0.beta1     |
+
+## Quick Start
+
+Please refer to the [GETTING_STARTED](https://github.com/mindspore-lab/mindyolo/blob/master/GETTING_STARTED.md) in MindYOLO for details.
+
+### Training
+
+<details open>
+<summary><b>View More</b></summary>
+
+#### - Distributed Training
+
+It is easy to reproduce the reported results with the pre-defined training recipe. For distributed training on multiple Ascend 910 devices, please run
+```shell
+# distributed training on multiple Ascend devices
+msrun --worker_num=3 --local_worker_num=3 --bind_core=True --log_dir=./yolov11_log python train.py --config ./configs/yolov11/yolov11-n.yaml --device_target Ascend --is_parallel True
+```
+
+**Note:** For more information about msrun configuration, please refer to [here](https://www.mindspore.cn/tutorials/experts/zh-CN/r2.3.1/parallel/msrun_launcher.html).
+
+For detailed illustration of all hyper-parameters, please refer to [config.py](https://github.com/mindspore-lab/mindyolo/blob/master/mindyolo/utils/config.py).
+
+**Note:**  As the global batch size  (batch_size x num_devices) is an important hyper-parameter, it is recommended to keep the global batch size unchanged for reproduction or adjust the learning rate linearly to a new global batch size.
+
+#### - Standalone Training
+
+If you want to train or finetune the model on a smaller dataset without distributed training, please run:
+
+```shell
+# standalone training on a CPU/Ascend device
+python train.py --config ./configs/yolov11/yolov11-n.yaml --device_target Ascend
+```
+
+</details>
+
+### Validation and Test
+
+To validate the accuracy of the trained model, you can use `test.py` and parse the checkpoint path with `--weight`.
+
+```
+python test.py --config ./configs/yolov11/yolov11-n.yaml --device_target Ascend --weight /PATH/TO/WEIGHT.ckpt
+```
+
+## Performance
+
+
+### Detection
+
+Experiments are tested on Ascend 910* with mindspore 2.5.0 graph mode.
+
+|  model name  |  scale  | cards  | batch size | resolution |  jit level  | ms/step | img/s |  map  |            recipe            |                                                weight                                                |
+|  :--------:  |  :---:  |  :---: |   :---:    |   :---:    |    :---:    | :---: | :---:  |:-----:|            :---:             |:----------------------------------------------------------------------------------------------------:|
+|    YOLOv11    |    N    |    1   |     128     |  640x640   |     O2      | 383.78 | 333.52 | 39.2% |    [yaml](./yolov11-n.yaml)    | [weights](https://download.mindspore.cn/toolkits/mindyolo/yolov11/yolov11n_600e_MAP392-78fd292c.ckpt) |
+|    YOLOv11    |    S    |    1   |     128     |  640x640   |     O2      | 488.65 | 261.95 | 46.4% |    [yaml](./yolov11-s.yaml)    | [weights](https://download.mindspore.cn/toolkits/mindyolo/yolov11/yolov11s_600e_MAP464-26f6efa4.ckpt) |
+|    YOLOv11    |    M    |    1   |     108     |  640x640   |     O2      | 721.72 | 149.64 | 51.1% |    [yaml](./yolov11-m.yaml)    | [weights](https://download.mindspore.cn/toolkits/mindyolo/yolov11/yolov11m_600e_MAP511-94a7cf04.ckpt) |
+|    YOLOv11    |    L    |    2   |     64     |  640x640   |     O2      | 637.84 | 200.68 | 52.6% |    [yaml](./yolov11-l.yaml)    | [weights](https://download.mindspore.cn/toolkits/mindyolo/yolov11/yolov11l_600e_MAP526-48494760.ckpt) |
+|    YOLOv11    |    X    |    3   |     43     |  640x640   |     O2      | 622.68 | 207.17 | 54.2% |    [yaml](./yolov11-x.yaml)    | [weights](https://download.mindspore.cn/toolkits/mindyolo/yolov11/yolov11x_600e_MAP542-19131881.ckpt) |
+
+### Notes
+
+- map: Accuracy reported on the validation set.
+- When using 8 cards and 16 batch size for training, the total training time will be significantly reduced, but the accuracy may slightly decrease. Based on testing, the accuracy for both the n and x specifications has dropped by 0.3%.
+- We refer to the official [YOLOV11](https://github.com/ultralytics/ultralytics) to reproduce the P5 series model.
+
+## References
+
+<!--- Guideline: Citation format should follow GB/T 7714. -->
+[1] Jocher Glenn. Ultralytics YOLOv11. https://github.com/ultralytics/ultralytics, 2024.
@@ -0,0 +1,45 @@
+data:
+  num_parallel_workers: 16
+
+  # multi-stage data augment
+  train_transforms: {
+    stage_epochs: [ 590, 10 ],
+    trans_list: [
+      [
+        { func_name: mosaic, prob: 1.0 },
+        { func_name: copy_paste, prob: 0.5, sorted: True },
+        {func_name: resample_segments},
+        { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 },
+        { func_name: mixup, alpha: 32.0, beta: 32.0, prob: 0.15, pre_transform: [
+          { func_name: mosaic, prob: 1.0 },
+          { func_name: copy_paste, prob: 0.5, sorted: True },
+          { func_name: resample_segments },
+          { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 }, ]
+        },
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ],
+      [
+        {func_name: letterbox, scaleup: True},
+        {func_name: resample_segments},
+        {func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0},
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ]]
+  }
+
+  test_transforms: [
+    {func_name: letterbox, scaleup: False, only_image: True},
+    {func_name: image_norm, scale: 255.},
+    {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+  ]
@@ -0,0 +1,45 @@
+data:
+  num_parallel_workers: 16
+
+  # multi-stage data augment
+  train_transforms: {
+    stage_epochs: [ 590, 10 ],
+    trans_list: [
+      [
+        { func_name: mosaic, prob: 1.0 },
+        { func_name: copy_paste, prob: 0.4, sorted: True },
+        {func_name: resample_segments},
+        { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 },
+        { func_name: mixup, alpha: 32.0, beta: 32.0, prob: 0.15, pre_transform: [
+          { func_name: mosaic, prob: 1.0 },
+          { func_name: copy_paste, prob: 0.4, sorted: True },
+          { func_name: resample_segments },
+          { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 }, ]
+        },
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ],
+      [
+        {func_name: letterbox, scaleup: True},
+        {func_name: resample_segments},
+        {func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0},
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ]]
+  }
+
+  test_transforms: [
+    {func_name: letterbox, scaleup: False, only_image: True},
+    {func_name: image_norm, scale: 255.},
+    {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+  ]
@@ -0,0 +1,39 @@
+data:
+  num_parallel_workers: 16
+
+  # multi-stage data augment
+  train_transforms: {
+    stage_epochs: [ 590, 10 ],
+    trans_list: [
+      [
+        { func_name: mosaic, prob: 1.0 },
+        { func_name: copy_paste, prob: 0.1, sorted: True },
+        {func_name: resample_segments},
+        { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.5, shear: 0.0 },
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ],
+      [
+        {func_name: letterbox, scaleup: True},
+        {func_name: resample_segments},
+        {func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.5, shear: 0.0},
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ]]
+  }
+
+  test_transforms: [
+    {func_name: letterbox, scaleup: False, only_image: True},
+    {func_name: image_norm, scale: 255.},
+    {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+  ]
@@ -0,0 +1,45 @@
+data:
+  num_parallel_workers: 16
+
+  # multi-stage data augment
+  train_transforms: {
+    stage_epochs: [ 590, 10 ],
+    trans_list: [
+      [
+        { func_name: mosaic, prob: 1.0 },
+        { func_name: copy_paste, prob: 0.15, sorted: True },
+        {func_name: resample_segments},
+        { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 },
+        { func_name: mixup, alpha: 32.0, beta: 32.0, prob: 0.05, pre_transform: [
+          { func_name: mosaic, prob: 1.0 },
+          { func_name: copy_paste, prob: 0.15, sorted: True },
+          { func_name: resample_segments },
+          { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 }, ]
+        },
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ],
+      [
+        {func_name: letterbox, scaleup: True},
+        {func_name: resample_segments},
+        {func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0},
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ]]
+  }
+
+  test_transforms: [
+    {func_name: letterbox, scaleup: False, only_image: True},
+    {func_name: image_norm, scale: 255.},
+    {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+  ]
@@ -0,0 +1,45 @@
+data:
+  num_parallel_workers: 16
+
+  # multi-stage data augment
+  train_transforms: {
+    stage_epochs: [ 590, 10 ],
+    trans_list: [
+      [
+        { func_name: mosaic, prob: 1.0 },
+        { func_name: copy_paste, prob: 0.6, sorted: True },
+        {func_name: resample_segments},
+        { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 },
+        { func_name: mixup, alpha: 32.0, beta: 32.0, prob: 0.2, pre_transform: [
+          { func_name: mosaic, prob: 1.0 },
+          { func_name: copy_paste, prob: 0.6, sorted: True },
+          { func_name: resample_segments },
+          { func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0 }, ]
+        },
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ],
+      [
+        {func_name: letterbox, scaleup: True},
+        {func_name: resample_segments},
+        {func_name: random_perspective, prob: 1.0, degrees: 0.0, translate: 0.1, scale: 0.9, shear: 0.0},
+        {func_name: albumentations},
+        {func_name: hsv_augment, prob: 1.0, hgain: 0.015, sgain: 0.7, vgain: 0.4},
+        {func_name: fliplr, prob: 0.5},
+        {func_name: label_norm, xyxy2xywh_: True},
+        {func_name: label_pad, padding_size: 160, padding_value: -1},
+        {func_name: image_norm, scale: 255.},
+        {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+      ]]
+  }
+
+  test_transforms: [
+    {func_name: letterbox, scaleup: False, only_image: True},
+    {func_name: image_norm, scale: 255.},
+    {func_name: image_transpose, bgr2rgb: True, hwc2chw: True}
+  ]
@@ -0,0 +1,77 @@
+epochs: 600  # total train epochs
+per_batch_size: 128
+img_size: 640
+iou_thres: 0.7
+conf_free: True
+clip_grad: True
+ms_loss_scaler: dynamic
+ms_loss_scaler_value: 65536.0
+overflow_still_update: False
+ms_amp_level: O2
+sync_bn: False
+anchor_base: False
+opencv_threads_num: 0  # opencv: disable threading optimizations
+
+optimizer:
+  optimizer: momentum
+  lr_init: 0.01  # initial learning rate (SGD=1E-2, Adam=1E-3)
+  momentum: 0.937  # SGD momentum/Adam beta1
+  nesterov: True  # update gradients with NAG(Nesterov Accelerated Gradient) algorithm
+  loss_scale: 1.0  # loss scale for optimizer
+  warmup_epochs: 3  # warmup epochs (fractions ok)
+  warmup_momentum: 0.8  # warmup initial momentum
+  warmup_bias_lr: 0.0  # warmup initial bias lr
+  min_warmup_step: 1000  # minimum warmup step
+  group_param: yolov8  # group param strategy
+  gp_weight_decay: 0.0005  # group param weight decay 5e-4
+  start_factor: 1.0
+  end_factor: 0.01
+
+loss:
+  name: YOLOv11Loss
+  box: 7.5  # box loss gain
+  cls: 0.5  # cls loss gain
+  dfl: 1.5  # dfl loss gain
+  reg_max: 16
+
+network:
+  model_name: yolov11
+  nc: 80  # number of classes
+  reg_max: 16
+
+  stride: [8, 16, 32]
+
+  # YOLOv8.0n backbone
+  backbone:
+    # [from, repeats, module, args]
+    - [-1, 1, ConvNormAct, [64, 3, 2]]  # 0-P1/2
+    - [-1, 1, ConvNormAct, [128, 3, 2]]  # 1-P2/4
+    - [-1, 2, C3k2, [256, False, 0.25]]
+    - [-1, 1, ConvNormAct, [256, 3, 2]]  # 3-P3/8
+    - [-1, 2, C3k2, [512, False, 0.25]]
+    - [-1, 1, ConvNormAct, [512, 3, 2]]  # 5-P4/16
+    - [-1, 2, C3k2, [512, True]]
+    - [-1, 1, ConvNormAct, [1024, 3, 2]]  # 7-P5/32
+    - [-1, 2, C3k2, [1024, True]]
+    - [-1, 1, SPPF, [1024, 5]]  # 9
+    - [-1, 2, C2PSA, [1024]] # 10
+
+  # YOLO11n head
+  head:
+    - [-1, 1, Upsample, [None, 2, 'nearest']]
+    - [[-1, 6], 1, Concat, [1]]  # cat backbone P4
+    - [-1, 2, C3k2, [512, False]]  # 13
+
+    - [-1, 1, Upsample, [None, 2, 'nearest']]
+    - [[-1, 4], 1, Concat, [1] ]  # cat backbone P3
+    - [-1, 2, C3k2, [256, False]]  # 16 (P3/8-small)
+
+    - [-1, 1, ConvNormAct, [256, 3, 2]]
+    - [[ -1, 13], 1, Concat, [1]]  # cat head P4
+    - [-1, 2, C3k2, [512, False]]  # 19 (P4/16-medium)
+
+    - [-1, 1, ConvNormAct, [512, 3, 2]]
+    - [[-1, 10], 1, Concat, [1]]  # cat head P5
+    - [-1, 2, C3k2, [1024, True]]  # 22 (P5/32-large)
+
+    - [[16, 19, 22], 1, YOLOv11Head, [nc, reg_max, stride]]  # Detect(P3, P4, P5)