Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

check_sumstat_format failed (likely due to leading space in the first column) #261

Open
ofrei opened this issue Dec 13, 2021 · 3 comments
Open

Comments

@ofrei
Copy link
Collaborator

ofrei commented Dec 13, 2021

I'm getting the following error
image

My input has an extra space in front of the first column name:
image

The second file, .pandas, shows the input after fixing this problem - indeed this enabled cleansumstats.sh pipeline to run without errors.

@pappewaio
Copy link
Contributor

It seems you managed to solve the issue yourself, am I correct?

For the future, it is possible to access more log-files with more specific error reporting if you enter the work directory specified at the last line of your output above. But remember, the path you see is within the image, so if you want to access the workdir from outside the image it will be present in the tmp/fake-home/ directory, which is automatically mounted every time the image is being called. It might be a nice feature to make them more easily available directly in the user specified "outdir".

@ofrei
Copy link
Collaborator Author

ofrei commented Dec 22, 2021

Thanks!

Thanks for pointing to the tmp/fake-home folder - indeed, it had very useful info:
image

Yes, I solved this using python's pandas to read a .csv, then 'strip' all column names to get rid of spaces, and save back to disk. It's nice if cleansumstats.sh can do this, but it's not a showstopper - after all it's quite weird to have spaces in column names.

@pappewaio
Copy link
Contributor

I think we can add a comment in the FAQ on how to do this removal of spaces in the beginning. Maybe it is even worth having a whole docs section only about how to handle strangeness in sumstats.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants