You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Angelica informed me that there isn't really much difference in speed and that the first time running through the annotation will be longer due to cacheing of the results in mongoDB.
I experienced - around 400 VCFs an hour first time running through the site. (Granted the process probably sped up if there are duplicated variants)
I think it would be a good idea to do some real performance comparisons ... seeing how long a run for a center using hundreds of small vcf files take to complete. And compare that to an annotation run for the same center using a single merged maf (perhaps from the output of the first run) as the input.
Based on a previous conversation with @thomasyu888 the original decision to annotate the MAFs individually first before generating a "per-center" MAF was based on whether the suggested approach would be scalable or not. I believe it's been decided to do some performance tests first based on the currently available GENIE data. Performance tests will be:
total time to standardize and annotate individual MAFs from a center
total time to standardize and merge individual MAFs into a "per-center" MAF and then run that through the annotator
average distribution of variants of VCFs:
The text was updated successfully, but these errors were encountered: