CUDA Illegal memory access was encountered #1941

neurosynapse · 2022-08-19T12:29:32Z

Hello,

Im trying test several different segmentation approaches on a custom dataset with three classes (background, object1, object2). In lot of cases (for example sem_fpn, vit) I get what(): CUDA error: an illegal memory access was encountered error. I have tried dataset with both reduce_zero_label=False and True with no changes. It would be nice if you could help me in this.

Best regards,
Roberts

sainivedh19pt · 2022-08-20T05:52:26Z

Hi @Franko9999 , I'm trying to do the same train with 3 classes (background, cls1, cls2) with reduce_zero_label-False and True In both cases the training output is very bad. Only the first class gets trained.

{"mode": "val", "epoch": 1300, "iter": 2, "lr": 0.00645, "aAcc": 0.6337, "mIoU": 0.2112, "mAcc": 0.3333, "IoU.background": 0.6337, "IoU.cat": 0.0, "IoU.dog": 0.0, "Acc.background": 1.0, "Acc.cat": 0.0, "Acc.dog": 0.0}

Not sure how to resolve this

neurosynapse · 2022-08-22T05:45:59Z

Yes, the problem is I have tried lot of models ( at least 50 %) and the same problem persists for use cases where there is 2 or 3 class dataset. Only the first class gets trained or some weird errors appear. It would be nice if someone could like in to it. Is it possible?

Best regards,
Roberts

xiexinch · 2022-08-23T02:49:27Z

Hi, @Franko9999, @sainivedh19pt,
We would like to reproduce this error, if possible, please tell us what changes you have made to the code.

xiexinch · 2022-08-23T05:08:56Z

There were similar issues before #270 and #1330.

neurosynapse · 2022-08-23T10:59:33Z

Hi,

Thank you I solved my problem. I had to change mask to labelled pixels as (0, num_classes-1). However, I experience the same problem as you, that is, only background class gets trained.

Best regards,
Roberts

neurosynapse · 2022-08-23T11:02:30Z

Is there some way to concentrate only on training the non-background class (or apply some weight to this accuracy not the background one)?

Best regards,
Roberts

neurosynapse · 2022-08-23T12:48:04Z

Hello,

Finally found the answer regarding model training. In my case in config/base/dataset configuration file I had given the dict(type='LoadAnnotations', reduce_zero_label=True). Reduce_zero_label should be False the same as in mmseg/dataset/ file you create for your dataset.

Best regards,
Roberts

timothylimyl · 2022-08-26T01:49:04Z

The weird thing about this error in this repo is that:

It is a recent error for custom dataset training, previously I have not seen this.
Sometimes training works but then the error will pop out during validation.
Sometimes without any code changes, the training and validation will suddenly work.

mm-assistant bot assigned xiexinch Aug 19, 2022

xiexinch added WIP Work in process Community help wanted Community discussion labels Aug 23, 2022

xiexinch added awaiting response and removed WIP Work in process labels Aug 23, 2022

xiexinch closed this as completed Aug 29, 2022

xiexinch mentioned this issue Aug 29, 2022

CUDA error: an illegal memory access was encountered #1974

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA Illegal memory access was encountered #1941

CUDA Illegal memory access was encountered #1941

neurosynapse commented Aug 19, 2022 •

edited

Loading

sainivedh19pt commented Aug 20, 2022

neurosynapse commented Aug 22, 2022

xiexinch commented Aug 23, 2022

xiexinch commented Aug 23, 2022 •

edited

Loading

neurosynapse commented Aug 23, 2022

neurosynapse commented Aug 23, 2022

neurosynapse commented Aug 23, 2022 •

edited

Loading

timothylimyl commented Aug 26, 2022

CUDA Illegal memory access was encountered #1941

CUDA Illegal memory access was encountered #1941

Comments

neurosynapse commented Aug 19, 2022 • edited Loading

sainivedh19pt commented Aug 20, 2022

neurosynapse commented Aug 22, 2022

xiexinch commented Aug 23, 2022

xiexinch commented Aug 23, 2022 • edited Loading

neurosynapse commented Aug 23, 2022

neurosynapse commented Aug 23, 2022

neurosynapse commented Aug 23, 2022 • edited Loading

timothylimyl commented Aug 26, 2022

neurosynapse commented Aug 19, 2022 •

edited

Loading

xiexinch commented Aug 23, 2022 •

edited

Loading

neurosynapse commented Aug 23, 2022 •

edited

Loading