Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

“died with <Signals.SIGSEGV: 11>.” when evaluate #56

Closed
ppbangKGT opened this issue Oct 19, 2022 · 4 comments
Closed

“died with <Signals.SIGSEGV: 11>.” when evaluate #56

ppbangKGT opened this issue Oct 19, 2022 · 4 comments

Comments

@ppbangKGT
Copy link

Hi, thanks for your nice work! When I learned your code and evaluate the validation dataset of openlane, there was an error:
subprocess.CalledProcessError: Command '['/home/pbw/anaconda3/envs/lanemd_torch18/bin/python3', '-u', 'main_persformer.py', '--local_rank=1', '--mod=PersFormer', '--batch_size=8']' died with <Signals.SIGSEGV: 11>.
Here is the screenshot:
image

@ilnehc
Copy link
Collaborator

ilnehc commented Oct 19, 2022

@ppbangKGT Maybe you could check https://github.com/OpenPerceptionX/PersFormer_3DLane/issues/16 if it is the same problem concerning of CUDA environment.

@ppbangKGT
Copy link
Author

@ppbangKGT Maybe you could check #16 if it is the same problem concerning of CUDA environment.

Actually, I don't meet the CUDA_HOME error. And if i run the code like #8 python main_persformer.py --mod=PersFormer --batch_size=1, there will be an error "Segmentation fault".
image

@ilnehc
Copy link
Collaborator

ilnehc commented Oct 19, 2022

Did you try to use 1 GPU (launch with --nproc_per_node 1)?

In similar issues, others said "Signals.SIGSEGV: 11 seems to be a common bug when you are using different libraries/environments for compiling and running the code."

Could you post your environment information and driver version? A helpful script could be collect_env.

@ppbangKGT
Copy link
Author

Did you try to use 1 GPU (launch with --nproc_per_node 1)?

In similar issues, others said "Signals.SIGSEGV: 11 seems to be a common bug when you are using different libraries/environments for compiling and running the code."

Could you post your environment information and driver version? A helpful script could be collect_env.

I think this error may be caused by the version of cuda. When I use another server whose CUDA version is 11.3. The problem is solved. Thanks for your time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants