Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate 3D images of flu HA proteins with variants highlighted #2

Open
Deena-B opened this issue Jul 10, 2019 · 0 comments
Open

Generate 3D images of flu HA proteins with variants highlighted #2

Deena-B opened this issue Jul 10, 2019 · 0 comments
Assignees

Comments

@Deena-B
Copy link
Collaborator

Deena-B commented Jul 10, 2019

Background

3D protein structures hold richer information than nucleotide sequences. Images can be analyzed using CNNs. Need I say more? Let's make some images!

Aim

Generate about 5 panels of 3D images of flu H1 HA proteins with amino acid variants highlighted

Method

  1. Find a good PDB structure for influenza H1N1 A/California/4/2009 or A/California/7/2009. Good is defined by:
  • not having an antibody bound
  • no (or few) missing residues when the surface view of the protein trimer is spun 360 degrees
  1. Download human H1N1 HA sequences from the 2013-14 flu season (see fluHA issue 1 on where to get the sequences)
  2. Align & cluster using a hierarchical tree method
  3. Create consensus sequences for the whole season and a consensus sequence for each major cluster
  4. Use Pymol to generate 3D images using each cluster's consensus with mutated residues from where they differ from the PDB structure and color the residues that vary from the consensus of the whole season
  5. Take X number of snapshots of each HA from different angles and degrees of rotation and put the snapsots into a single 'panel' image

Document your work

Please fork & clone this repo. Check out a branch with the issue number (e.g. 3Di2) for your work, then push and make a PR for us to merge.

Add a note with your process to the nb folder (nb stands for notebook). Your note can be .md or .ipynb or other types. For an example directory structure, see overview/wiki dir-struct.

Your note should be named as follows:

  • DATE-issue#-shortdescription.ext

e.g.:

  • 20190701-i02-extract-chrM-fa.md

If you keep large files within the fluHA repo, remember to add their extension to the .gitignore file. If you can make an example file that is small, then use the annotation .eg.ext and create a .gitignore exemption with !. (See mitolin .gitignore for examples)

Questions?

Please put questions related to this issue in this issue thread. If you want a quick response, post a link to your comment in this thread to Slack #deepcelllineage or DM @deena. To join Slack enter your email address here. For questions NOT specifically related to this issue, get in touch through any of the communication methods listed in DCL's overview README.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants