-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About arguments in hallucination.py for two-chain hallucination #9
Comments
Sorry, the free binder hallucination was added somewhat recently, toward
the end of our paper, so these features are not as well documented. See my
responses below.
*Q1*
I want to know whether the --mask=10,A6-10,65 (total 80 aa) and --mask=80
are same when the argument --spike is set to 0.05.
your first --mask argument specifies to hallucinate 10 AAs, followed by a
motif consisting of residues A6-10 from the input pdb, followed by 65
hallucinated residues. The second mask argument would in theory specify a
completely hallucinated protein of 80 residues, but in practice will
probably trigger an error. If you want to do completely unconstrained
hallucination, you should put at least 1 motif residue, like 80,A6, but
then set the cce loss weight to 0 (--w_cce 0). That way the motif won't
affect the optimization.
the --spike argument specifies an initialization for the sequence, so the
behavior will still be different depending on which --mask you input.
In addition,
Since the --spike=0.05 means random sequence, I think that there is no
significancy which sequence is enter to the --spike_fas, but there is
significancy only in the sequence length. is it right ?
Yes I think this is right. Nowadays we only really use --spike=0.99 (or
something else close to 1) with MCMC because that tells the script to
completely initialize with a particular sequence. It's unclear how useful
--spike is for gradient descent.
*Q2*
In the Fig S17-C of the 2022 Science paper,
it seems that only the binding target (= receptor?) was used as template.
If it is right, how can I enter the binding target as template ? which
argument is required ?
or is binding target automatically provided as template to hallucination
if the binding target pdb file is provided to the --receptor argument ?
Yes, providing --receptor will automatically cause the receptor to be used
as template
*Q3*
To perform free hallucination, I run the script without --use_template,
but some error was occurred.
Saving /proc_1/RFDesign/binder/test/test_1: Traceback (most recent call last):
File "/opt/tools/RFDesign/hallucination/hallucinate.py", line 739, in <module>
main()
File "/opt/tools/RFDesign/hallucination/hallucinate.py", line 731, in main
optimization.save_result(out_prefix, Net, ml, trb['msa'], args, trb,
File "/opt/tools/RFDesign/hallucination/util/../optimization.py", line 179, in save_result
idx_tmpl = net_kwargs['idx'].cpu().numpy()[0]
KeyError: 'idx'
When I enter the dummy value (like Z999-1000) to the --use_template, then
the script worked well.
If you don't want to template any part of the binder, you can input
--use_template no_contig. You can also put --use_template True (to template
all of the contigs, usually a binding interface motif) or --use_template
A6-10 (representing, e.g., some subset of the interface motif residues).Message
ID: ***@***.***>
… |
Thank you for kind and helpful reply ! I have last two questions. Q1 Is there any way to specify binder binding site on the receptor without I want to design binders using unconstrained hallucination with the specific binding site on the receptor. or is it a right way that use the Q2 (Cao et al., 2022 Nature // Dauparas et al., 2022 BioRxiv // Bennett et al., 2022 BioRxiv) Comparing those two similar methods with this RFDesign, which approach has a higher success rate to design binder proteins ? Sincerely, Jongseo |
*Q1*
Is there any way to specify binder binding site on the receptor without
--use_template ?
I want to design binders using unconstrained hallucination with the
specific binding site on the receptor.
or is it a right way that use the --mask and --use_template as well as
--w_cce=0
There isn't a way to specify the binding site in this version of the code.
One of the authors is working on a followup manuscript with loss functions
for controlling the binding site, but this isn't ready to be released
publicly yet.
*Q2*
In the case of protein binder design, I already read 2~3 previous papers
that describe docking > seq. design > scoring.
(Cao et al., 2022 Nature // Dauparas et al., 2022 BioRxiv // Bennett et
al., 2022 BioRxiv)
https://www.nature.com/articles/s41586-022-04654-9
https://www.biorxiv.org/content/10.1101/2022.06.03.494563v1
https://www.biorxiv.org/content/10.1101/2022.06.15.495993v1
Comparing those two similar methods with this RFDesign, which approach has
a higher success rate to design binder proteins ?
The highest success rate will be the method in Bennett et al.
… Message ID: ***@***.***>
|
I have several questions about free hallucination to generate protein binders.
Here is my command line input.
Q1
I want to know whether the
--mask=10,A6-10,65
(total 80 aa) and--mask=80
are same when the argument--spike
is set to 0.05.In addition,
Since the
--spike=0.05
means random sequence, I think that there is no significancy which sequence is enter to the--spike_fas
, but there is significancy only in the sequence length. is it right ?Q2
In the Fig S17-C of the 2022 Science paper,
it seems that only the binding target (= receptor?) was used as template.
If it is right, how can I enter the binding target as template ? which argument is required ?
or is binding target automatically provided as template to hallucination if the binding target pdb file is provided to the
--receptor
argument ?Q3
To perform free hallucination, I run the script without
--use_template
, but some error was occurred.When I enter the dummy value (like Z999-1000) to the
--use_template
, then the script worked well.is it proper way to perform free hallucination ?
Sincerely,
jongseo
The text was updated successfully, but these errors were encountered: