Skip to content

What is the significance of the numbering system for the recommended mutations in the python bin/recommend.py output? #40

Answered by brianhie
danielguion asked this question in Q&A
Discussion options

You must be logged in to vote

This is documented in the README: "the script will output a list of substitutions and the number of recommending language models."

The number indicates the count of language models for which the corresponding mutation has higher LM likelihood than wildtype. We use an ensemble of six language models, which is why the number is out of 6. We use these counts to prioritize mutations that have a consensus across multiple language models, as described in the methods of the paper.

Hope that helps!

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@brianhie
Comment options

Answer selected by brianhie
@jamesrgraham
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants