onnx export RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same? #41

lucasjinreal · 2020-06-12T06:40:33Z

Run onnx export got error:

RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same

weights trained on GPU, and converted both model and image to cuda device, why this error still happens

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2020-06-12T07:13:28Z

@jinfagang you should be able to export to onnx like this. Run this command from the /yolov5 directory.

export PYTHONPATH="$PWD" 
python models/onnx_export.py --weights ./weights/yolov5s.pt --img 640 640 --batch 1

glenn-jocher · 2020-06-12T07:15:24Z

Output should look like this. You might want to git pull also, we've made recent changes to onnx export.

...
  %416 = Shape(%285)
  %417 = Constant[value = <Scalar Tensor []>]()
  %418 = Gather[axis = 0](%416, %417)
  %419 = Shape(%285)
  %420 = Constant[value = <Scalar Tensor []>]()
  %421 = Gather[axis = 0](%419, %420)
  %422 = Shape(%285)
  %423 = Constant[value = <Scalar Tensor []>]()
  %424 = Gather[axis = 0](%422, %423)
  %427 = Unsqueeze[axes = [0]](%418)
  %430 = Unsqueeze[axes = [0]](%421)
  %431 = Unsqueeze[axes = [0]](%424)
  %432 = Concat[axis = 0](%427, %439, %440, %430, %431)
  %433 = Reshape(%285, %432)
  %434 = Transpose[perm = [0, 1, 3, 4, 2]](%433)
  return %output, %415, %434
}
Export complete. ONNX model saved to ./weights/yolov5s.onnx
View with https://github.com/lutzroeder/netron

lucasjinreal · 2020-06-12T08:08:59Z

@glenn-jocher turns out my model is trained on GPU, and the model serialized as cuda device, so the input does not to cuda throw this error.

However, when I force it to cuda, it still got error oppsite, seems some code inside model, still using CPU tensor instead.

is there any special reason for using cpu tensor there?

lucasjinreal · 2020-06-12T08:15:20Z

the generated onnx default seems enabled augmentation.

How to obtain boxes and scores and class from these outputs?

glenn-jocher · 2020-06-12T17:27:10Z

@jinfagang onnx export should only be done when the model is on cpu.

The netron image you show is correct. The boxes are part of the v5 architecture, they are not related to image augmentation during training.

At the moment onnx export stops at the output features. This is an example P3 output (smallest boxes) for 3 anchors with a grid size 40x24. The 85 features are xywh, objectness, and 80 class confidences.

glenn-jocher · 2020-06-12T20:56:23Z

@jinfagang I ran into a cuda issue with an onnx export today, and pushed a fix 1e2cb6b for this. This may or may not solve your original issue.

lucasjinreal · 2020-06-13T07:39:24Z

@glenn-jocher So the output is same with yolov3 in your previous repo? I wanna access the outputs and accelerate it in tensorrt.

lucasjinreal · 2020-06-15T06:13:02Z

@glenn-jocher Does anchors decodee process can also exported into onnx? So that it can be more end2end when transfer into other paltforms for inference?

glenn-jocher · 2020-06-15T07:40:06Z

@jinfagang yes, this would be more useful. It is more complicated to implement though, especially if you want a clean onnx graph. We will try to add this in the future.

lucasjinreal · 2020-06-15T08:53:00Z

@glenn-jocher I had a tiny experiments on this, it ends involved a ScatterND op there, this op is hard to convert to other platforms. If we want eliminate this op, postprocess scripts (Detect layer here) need re-written (only for export mode, in a more complicated way but can export and works perfectly)

makaveli10 · 2020-06-18T06:34:30Z

@jinfagang I also ran into this issue. Resolved it by converting the model to cuda and then saving the weights. Used those weights to convert to onnx model but I ran into some issue in converting onnx to tensorRT.

If you successfully converted the model to TensorRT please let me know how you did that.
Thanks

lucasjinreal · 2020-06-18T07:13:07Z

@makaveli10 I already converted the model to onnx and inferenced it on TensorRT.

However, this involved some special operations different than this repo does, and accordingly on TensorRT side needs some special operation to do. Overall, the TensorRT accelerated speed is about: 38ms with a 1280x768 input resolution, the performance is quite well:

you can add my wechat: jintianiloveu if you intested in this accelerate tech.

glenn-jocher · 2020-06-18T07:29:52Z

@jinfagang great work! What is the speedup compared to using detect.py? What GPU are you using?

lucasjinreal · 2020-06-18T08:28:30Z

@glenn-jocher Am using GTX1080Ti, speed tested on this. The speed measured included post process time (from engine forward to nms and copy data back to CPU etc.). I think the speed is almost same with darknet version yolov4 converted tensorrt. (I previously tested with 800x800 input).

the speed can still be optimized by including all postprocess to cuda kernel and fp16 or int8 quantization

kingardor · 2020-06-18T15:40:28Z

@jinfagang amazing to see you got it running in such a short time. I'm able to convert the pth files to onnx format but I keep gettting this error when I try to convert to tensorrt6:
(Unnamed Layer* 0) [Slice]: slice is out of input range
While parsing node number 9 [Slice]:
3
If you have some pointers for me, I would really appreciate it. Connecting on WeChat is difficult for me cause I don't have an account and don't have a friend who can validate my new account.

makaveli10 · 2020-06-19T12:10:21Z

@jinfagang I dont have an account on WeChat. Neither I have a friend who can verify a new account. Can you please share your code to inference onnx on TensorRT somehow? I am getting incorrect outputs from the engine that I generated using onnx model.

kingardor · 2020-06-19T20:30:17Z

@makaveli10 mind sharing how you were able to generate an onnx model that worked with TensorRT? Also, which version of TRT did you use?

makaveli10 · 2020-06-19T20:36:49Z

@aj-ames https://github.com/TrojanXu/yolov5-tensorrt
Let me know if you make any sort of progress please!

kingardor · 2020-06-19T21:02:14Z

@makaveli10 thanks. I will update my findings here.

yushanshan05 · 2020-06-20T03:25:19Z

@aj-ames https://github.com/TrojanXu/yolov5-tensorrt
Let me know if you make any sort of progress please!

I use this project. But I enconter the same error when I try to convert to tensorrt6:
[TensorRT] ERROR: (Unnamed Layer* 0) [Slice]: slice is out of input range
ERROR: Failed to parse the ONNX file.

If you have some pointers for me, I would really appreciate it.

yushanshan05 · 2020-06-20T03:28:12Z

@glenn-jocher Am using GTX1080Ti, speed tested on this. The speed measured included post process time (from engine forward to nms and copy data back to CPU etc.). I think the speed is almost same with darknet version yolov4 converted tensorrt. (I previously tested with 800x800 input).

the speed can still be optimized by including all postprocess to cuda kernel and fp16 or int8 quantization

When I convert onnx to tensorrt, I enconter the same error when I try to convert to tensorrt6:
[TensorRT] ERROR: (Unnamed Layer* 0) [Slice]: slice is out of input range
ERROR: Failed to parse the ONNX file.

I use tensorrt 6.0 and onnx 1.5.0 or 1.6.0, they are all not work.
If you have some pointers for me, I would really appreciate it.

github-actions · 2020-08-01T05:25:07Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

…_inference_fix onnx inference visualisation fix

lucasjinreal added the bug Something isn't working label Jun 12, 2020

github-actions bot added the Stale Stale and schedule for closing soon label Aug 1, 2020

github-actions bot closed this as completed Aug 10, 2020

YoungjaeDev pushed a commit to avikus-ai/detect_train that referenced this issue Feb 10, 2023

ultralytics#41 (MaCVi 2023) MODS Dataset EDA 코드 공유

662fe37

cool112624 mentioned this issue May 16, 2023

DDP training with multiple gpu using wsl #11519

Closed

1 task

K-tang-mkv pushed a commit to K-tang-mkv/yolov5 that referenced this issue Jun 9, 2023

Merge pull request ultralytics#41 from PulkitMishra/PulkitMishra-onnx…

ae4e0e8

…_inference_fix onnx inference visualisation fix

jcluo1994 mentioned this issue Oct 10, 2023

Using multi-GPU training reports errors #12213

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnx export RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same? #41

onnx export RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same? #41

lucasjinreal commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020 •

edited

Loading

glenn-jocher commented Jun 12, 2020 •

edited

Loading

lucasjinreal commented Jun 12, 2020 •

edited

Loading

lucasjinreal commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020

lucasjinreal commented Jun 13, 2020

lucasjinreal commented Jun 15, 2020

glenn-jocher commented Jun 15, 2020

lucasjinreal commented Jun 15, 2020 •

edited

Loading

makaveli10 commented Jun 18, 2020

lucasjinreal commented Jun 18, 2020

glenn-jocher commented Jun 18, 2020

lucasjinreal commented Jun 18, 2020 •

edited

Loading

kingardor commented Jun 18, 2020 •

edited

Loading

makaveli10 commented Jun 19, 2020 •

edited

Loading

kingardor commented Jun 19, 2020

makaveli10 commented Jun 19, 2020

kingardor commented Jun 19, 2020

yushanshan05 commented Jun 20, 2020

yushanshan05 commented Jun 20, 2020

github-actions bot commented Aug 1, 2020

onnx export RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same? #41

onnx export RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same? #41

Comments

lucasjinreal commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020 • edited Loading

glenn-jocher commented Jun 12, 2020 • edited Loading

lucasjinreal commented Jun 12, 2020 • edited Loading

lucasjinreal commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020

glenn-jocher commented Jun 12, 2020

lucasjinreal commented Jun 13, 2020

lucasjinreal commented Jun 15, 2020

glenn-jocher commented Jun 15, 2020

lucasjinreal commented Jun 15, 2020 • edited Loading

makaveli10 commented Jun 18, 2020

lucasjinreal commented Jun 18, 2020

glenn-jocher commented Jun 18, 2020

lucasjinreal commented Jun 18, 2020 • edited Loading

kingardor commented Jun 18, 2020 • edited Loading

makaveli10 commented Jun 19, 2020 • edited Loading

kingardor commented Jun 19, 2020

makaveli10 commented Jun 19, 2020

kingardor commented Jun 19, 2020

yushanshan05 commented Jun 20, 2020

yushanshan05 commented Jun 20, 2020

github-actions bot commented Aug 1, 2020

glenn-jocher commented Jun 12, 2020 •

edited

Loading

glenn-jocher commented Jun 12, 2020 •

edited

Loading

lucasjinreal commented Jun 12, 2020 •

edited

Loading

lucasjinreal commented Jun 15, 2020 •

edited

Loading

lucasjinreal commented Jun 18, 2020 •

edited

Loading

kingardor commented Jun 18, 2020 •

edited

Loading

makaveli10 commented Jun 19, 2020 •

edited

Loading