Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

anvi-profile improvements #1362

Merged
merged 462 commits into from
Mar 5, 2020
Merged

anvi-profile improvements #1362

merged 462 commits into from
Mar 5, 2020

Conversation

ekiefl
Copy link
Contributor

@ekiefl ekiefl commented Feb 26, 2020

This is a big change to anvi'o so I wanted to open up the pull request early to get critique from others early on throughtout the process.

As of this post, I am currently focused on developing tests that compare results to v6.1

Instructions to access the tests run below

  1. Download and unzip this: testing.zip

  2. Set your anvio repository to a

a=/Users/evan/Software/anvio
  1. Your working directory should look like
SNV-IMPROVEMENTS/
├── test_SKIP_SNV.py
├── test_SNV.py
├── test_SCV.py
├── run.sh
├── ANE_004_05M.bam
├── ANE_004_05M.bam.bai
├── 05_CONTIGS.db
└── with.sh
  1. Fire away
./run.sh

class fetches all its attributes from self.read to mimick the
AlignedSegment class
successful; aligned_sequence matches with reference sequence
return a pysam.AlignmentFile object. No more get() method and no more
run and progress attributes :(
except trims reads if they overhang the start and start
positions as additional data to ProcessAlleleCounts and it filters them
alongside its normal operations
contig in one function call. For a contig of length 1M, this info is
retreived in 0.03s, compared 4s if calling get_nt_info for each position
@ekiefl
Copy link
Contributor Author

ekiefl commented Mar 4, 2020

The above solves SNV calculations from the memory leak, but it does not save the SCVs routine since the routine calls other numba-ized functions. After digging in, I discovered that it was caused by a numba generators in utils, so I made this reproducible example in the numba repo: numba/numba#5350

For now, I will avoid the use of numba generators :\

@ekiefl
Copy link
Contributor Author

ekiefl commented Mar 5, 2020

@meren, I have reached the point where I am just adding emojis to the codebase (2d326b6). I think this means I have nothing more to do here. I'm not sure how much you have reviewed this branch, but at this point I'll wait for you to review as much as you can. If everything is good I want to merge tomorrow :S

@meren
Copy link
Member

meren commented Mar 5, 2020

I say go for it and merge, Evan, and we all play with it for a while.

@ekiefl ekiefl merged commit 107ae4c into master Mar 5, 2020
meren added a commit that referenced this pull request Mar 5, 2020
@meren meren deleted the snv-improvements branch July 31, 2020 20:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants