This solution has been replaced by the AWS Solution for secondary analysis:
This project demonstrates how to run a large-scale, genomics secondary-analysis pipeline on AWS using AWS Step Functions and AWS Batch. You can learn more about how we used the new AWS Step Functions integration with AWS Batch in our blog post, Building Simpler Genomics Workflows on AWS Step Functions.
- AWS CLI
- Admin permissions for deployment
Will build and deploy everything needed to run a secondary-analysis pipeline on AWS. This includes:
- Custom AMI with 1TB scratch volume using the ECS AMI for your region (10 min)
- Isaac, Strelka, Samtools-Stats and SnpEff as Docker images in ECR (40 min)
- AWS Batch queues, compute environments, job definitions and an AWS Step Functions state machine (10 min)
- Roles and buckets needed.
$ setup.sh
Run the Workflow
Copy the workflow input that was output to the terminal window after the deployment completed. Use that input to run the pipeline in the StepFunctions console. You can also find example workflow inputs in the outputs section of the batch-genomics-pipeline Cloudformation stack.
Update Pipeline Stack
Deploy changes made to local Cloudformation templates.
$ update.sh
Teardown Pipeline Stack
Deletes everything except the custom AMI and ECR images.
$ teardown.sh