-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add command to extract Neff scores for MSA #647
Conversation
It appears like some of the CI tests are not passing. Am I missing something or are parts of the CI pipeline broken? Can someone help me on that? |
No idea why windows is failing in azure, you didn't change anything that would affect that. Cirrus is currently okay to fail, something changed on their side and I didn't get around to fix the issue. I think you are still using the wrong function. Neff is stored as a char, you need to use |
Ok good, then I will ignore these pipelines. The Neff scores I use come from the MMseqs2/src/commons/Sequence.h Line 453 in 7b95387
Therefore I think the MMseqs2/src/commons/MathUtil.h Lines 216 to 224 in 7b95387
Sorry if I am missing something here. Is there another location/a better way of extracting the Neff scores? (I don't know whether this is just personal preference, but I like the idea of values not being floats when writing them to an output. A fixed range from [1;255] somehow sound more appealing to me than a floating point number with an obscure precision.) |
Okay, sorry I didn't remember the code very well. Your initial implementation without the MathUtil functions was correct, the Sequence object already deals with the correction to float. I wouldn't use |
Sorry for the long round-trip delay, I've reverted my changes to the original implementation :) |
Thank you. I was traveling and forgot about the PR, sorry! |
As discussed in #638, this code adds a new command to output Neff scores. The command is called
profile2neff
. It takes a profile database as input and outputs per-residue Neff scores for a query sequence.The scores are written to a
DBWriter
that then contains two lines for each sequence: a header similar toprofile2pssm
's output and a line containing tab-separated Neff scores (from the range [1;255]) for each residue. The score is converted from the internal float representation to char using theconvertNeffToChar
function fromMathUtil.h
.