2.4.0
ProTECT 2.4.0 is here with so many goodies to share!
Major new features include:
- HG38 support: ProTECT now supports neoantigen prediction on samples using the GRCh38/HG38 reference genome. Like hg19, hg38 references are provided in our s3 bucket at
s3://cgl-protect-data/hg38_references/
- Altered-self support: ProTECT now has measures to calculate the binding scores for the corresponding wild-type peptides for each neoepitope and then consider that while ranking the results.
- User-specified dockerized-tool versions: ProTECT now allows the user to specify the version of a tool to use in the analysis. This requires the user to ensure that the tag exists in the provided dockerhub.
- http, https and ftp download support: ProTECT can now pull samples and references from http, https and ftp endpoints.
- Support for non-standard naming scheme for fastqs: ProTECT now allows users to optionally specify a separate link for the _2.fastq files (and ignores the previous requirement of the two _1 and _2 files sharing a common naming schema and existing in the same directory). This functionality along with https support allows users to pull signed aws s3 links into their pipelines.
- Added code that produces Dockerised versions of ProTECT (Currently is hardcodes the version to be 3.2.0 but this will be addressed in #173)
Other minor features and bugfixes:
- Updated the flow chat, doc strings and all associated supporting material to more accurately describe ProTECT.
- Bumped Transgene to 2.0.0 (using the correct tag of the Docker image)
- Fixed rsem to correctly request disk space
- Bumped Toil dependency to 3.5.2 (Handles some issues seen in a 90-samples scale-run)
- Fixed a small bug where in certain cases star was asking for float disk requirements and causing Toil to crash.