Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SAM format as input/output #1

Closed
kclem opened this issue Feb 17, 2022 · 3 comments
Closed

SAM format as input/output #1

kclem opened this issue Feb 17, 2022 · 3 comments
Assignees

Comments

@kclem
Copy link
Collaborator

kclem commented Feb 17, 2022

It looks like the core implementation of process_bam should easily be applied to sam files too. The CIGAR string and alignment position need to be modified (implemented properly).

Helpful links:
Explain sam flags:
https://broadinstitute.github.io/picard/explain-flags.html

IVG viewer:
https://software.broadinstitute.org/software/igv/

@kclem
Copy link
Collaborator Author

kclem commented Feb 17, 2022

OK, I pushed some demo code to 2f7d321. If you change to that directory ('CRISPResso2/sam_test') you should be able to run python sam_test.py.

Note that sam_test.py includes a function process_bam that mirrors the code in CRISPRessoCORE. The goal is that once this is working, we can plop process_bam from sam_test.py back into CRISPRessoCORE, so be aware that any changes you make to the functionality or process_bam signature, etc. mean we have to make changes to other parts in CRISPRessoCORE.

if your run python sam_test.py you'll see demo output from a bam file with 50 reads.

I just printed out a couple of things including

  • the old cigar string for when this read was aligned to the genome,
old cigar was 164M34D76M4I4M
  • The sam line elements (separated by tabs so you can index into them as in the sam specification. Note that CRISPResso adds on the "c2:Z" tags at the end. The rest are from the input sam file.
line els: ['M05634:17:000000000-BKJP5:1:1101:23018:4446', '16', 'chr11', '992', '255', '164M34D76M4I4M', '*', '0', '0', 'CTTACCTCCAGTGTTTGTGCAGCCGCCGCTCCAGAGCCGTGCGAATGGGGCCATGCCGACCAAAGCGCCGATGGATGTGGCGCAGGTAGCGCGCCCACTGCAAGGCCCGGCGCACGGTGGCGGGGTCCCAGGTGCTGACGTAGGTAGTGCTTGAGACCGCCAGAATTCCATGAGGTGCGCGAAGGCCCTACTTCCGCTTTCACCTTGGAGACGGCGACTCTCTGCGTACTGATTGGAACATCCGTCAC', ';9/..9;.9/.A9.FFFF.;-999=>-F99/FBD=A;CBDFFBB9.-:/A9;;B=AFFFF9.D;ABA?:/BFFBF9AA.;@A/FB9B@@DCB.A9F00BC9/GCGD@?DGDC.GGHGCFCGGFDGG/HG0?0EHGGHHGGHFHGBHHGHG2GG?E<GGEGHGFHHHHGHGHHFFCGGGGGFEFEBAGFFBGCGCGHHGE5GEGHHFDFGGGGGFCHHDG2EGGGGHGFDGGGGGGGFFBBBDABAAAB', 'AS:i:-143', 'XN:i:0', 'XM:i:5', 'XO:i:2', 'XG:i:38', 'NM:i:43', 'MD:Z:3G6C2G150^AGCTCGGAAAAGCGATCCAGGTGCTGCAGAAGGG78C0G0', 'YT:Z:UU', 'c2:Z:ALN=Reference CLASS=Reference_MODIFIED MODS=D34;I0;S0 DEL=80(34) INS= SUB= ALN_REF=----CGGATGTTCCAATCAGTACGCAGAGAGTCGCCGTCTCCAAGGTGAAAGCGGAAGTAGGGCCTTCGCGCACCTCATGGAATCCCTTCTGCAGCACCTGGATCGCTTTTCCGAGCTTCTGGCGGTCTCAAGCACTACCTACGTCAGCACCTGGGACCCCGCCACCGTGCGCCGGGCCTTGCAGTGGGCGCGCTACCTGCGCCACATCCATCGGCGCTTTGGTCGG------------------------------------------------------- ALN_SEQ=GTGACGGATGTTCCAATCAGTACGCAGAGAGTCGCCGTCTCCAAGGTGAAAGCGGAAGTAGGGCCTTCGCGCACCTCATGGAAT----------------------------------TCTGGCGGTCTCAAGCACTACCTACGTCAGCACCTGGGACCCCGCCACCGTGCGCCGGGCCTTGCAGTGGGCGCGCTACCTGCGCCACATCCATCGGCGCTTTGGTCGGCATGGCCCCATTCGCACGGCTCTGGAGCGGCGGCTGCACAAACACTGGAGGTAAG']
  • the new cigar string when the read is aligned to this amplicon sequence
potentially setting cigar to : 4I80M34D109M55I from alignment
  • The alignment between the read (from the input bam) and the reference amplicon (provided by the user):
aln: GTGACGGATGTTCCAATCAGTACGCAGAGAGTCGCCGTCTCCAAGGTGAAAGCGGAAGTAGGGCCTTCGCGCACCTCATGGAAT----------------------------------TCTGGCGGTCTCAAGCACTACCTACGTCAGCACCTGGGACCCCGCCACCGTGCGCCGGGCCTTGCAGTGGGCGCGCTACCTGCGCCACATCCATCGGCGCTTTGGTCGGCATGGCCCCATTCGCACGGCTCTGGAGCGGCGGCTGCACAAACACTGGAGGTAAG
ref: ----CGGATGTTCCAATCAGTACGCAGAGAGTCGCCGTCTCCAAGGTGAAAGCGGAAGTAGGGCCTTCGCGCACCTCATGGAATCCCTTCTGCAGCACCTGGATCGCTTTTCCGAGCTTCTGGCGGTCTCAAGCACTACCTACGTCAGCACCTGGGACCCCGCCACCGTGCGCCGGGCCTTGCAGTGGGCGCGCTACCTGCGCCACATCCATCGGCGCTTTGGTCGG-------------------------------------------------------
  • and the alignment position reported by the alignment. These positions were produced by the short read aligner bowtie2 and are all in reference to a very short genome at smallGenome/smallGenome.fa.
alignment position from aligner is 255

@kclem
Copy link
Collaborator Author

kclem commented Feb 17, 2022

You should be able to view the input and output bam files in IGV.

The goal is to have the CRISPResso alignment reflected in the output bam and viewable in the IGV. So for example, many of these reads have a 4bp insertion at the beginning (4I at the beginning of the new CIGAR) and we'd like this to show up in the IGV as well, while modifying the start position to be accurate.

I've created code to create the CIGAR string, but you will have to change the "MD:Z" tag - which the IGV may use for display. You can read about this in the SAM spec.

@Colelyman
Copy link
Collaborator

Something I stumbled on that we should be aware of, apparently there is a design flaw in BAM format that CIGAR strings will break when they have >65535 operations (source: https://github.com/lh3/minimap2#working-with-65535-cigar-operations). This will obviously only affect us when we start using long reads, but it wouldn't be hard to do what minimap2 does and have a clipped CIGAR and add the full one to the CG tag.

Snicker7 pushed a commit that referenced this issue Sep 19, 2022
Colelyman added a commit that referenced this issue Sep 21, 2023
* Add converted Python3 files

These were just run with 2to3 from the original Python2 repository.

* Add Python3 compatible CRISPResso2Align Cython module

* Add Python3 compatible CRISPRessoCOREResources file

Had to convert some strings to PyBytes objects to get rid of compiler errors.

* Allow for int-looking chromosome names in WGS input file

In CRISPRessoWGS, the region file contains a 'chr_id' column which is sometimes mis-recognized as ints when read by pandas if using the chromosome notation without 'chr' (e.g. 1,2,3 in stead of chr1,chr2,chr3). This bug fix forces chr_ids to be read as strs.

* Add setup.py that compiles

* Add .gitattributes

* Add .gitignore

* Spacing Updates

* Remove repeated source files within CRISPResso2 package

I'm not sure how these got here... it must have been a mistake when running
2to3.

* Update CRISPResso2Align compiled file

* Fix float error when displaying logo

* Add test cases and data

* Add distance matrix and MANIFEST.in

* Add templates directory

* Fix encoding type on CRISPRessoCORE

* Encode strings as bytes in `global_align` calls

* Fix bytes and string error in CRISPRessoCOREResources.pyx

* Update methods for modern Pandas

- Remove check based on Pandas version for sorting
- Refactor .ix calls to .loc
- Cast string numbers to number types

* Fix float error when computing nSamples while plotting

* Fix casting to unicode (because strings are already unicode encoded)

* Replace using bytes instead of strings in CRISPRessoShared

* Fix bug where the ref_positions weren't being properly computed

This was because the CRISPRessoCOREResources.pyx was dealing with a bytes array
instead of a string, after properly decoding the issue seems to be resolved.

* Change iteritems to items for Python3 compatibility

* Cast map iterator to list for seaborn version check

* Cast int value to str for heatmap annotation

* Update pad to pad_inches keyword when saving figure

The pad keyword argument is deprecated and will soon result in an error. I
believe that pad_inches is somewhat equivalent, but may require some tweaking.

* Fix indentation errors in filterFastqs

* Properly import filterFastqs module in CRISPResssoCORE

* Further byte/string fixes in CRISPRessoCORE

* Read gzipped files in text mode

* Open gzipped files in test mode

* Pandas casting to numeric in DataFrame

* Change iteritems to items for Python3 compatibility

* Change `.upper` calls and unneeded string conversion

* pull out some aln methods to crispressoshared

pulled out and refactored:
get_sgRNA_mismatch_vals
get_best_aln_pos_and_mismatches
get_mismatches
get_alignment_coordinates

* Fix previous push for aln_matrix

* Add first simple unit test for `get_mismatches`

* Fix spacing in CRISPResso2Align.pyx

This normalizes the indentation to multiples of 4, to ensure that there isn't
any unexpected behavior.

* Remove strlen import and use Python methods to calculate string length

This should be more performant because strlen is always takes linear time, but
often the Python string length is cached.

* Fix CRISPRessoShared typo and add aln_matrix parameter to get_mismatches

* Closes #1 by adding an additional length check

* Properly convert bytes to string in slugify

* Implement extra logic in global_align to prevent integer overflow

* Add extra test case that seg faults before the previous commit

* Fix float error

* Change pad to pad_inches in matplotlib savefig calls

* Some more pandas fixes to work with updated pandas version

* Fix clean_filename to work with Python 3 strings

* Update paths in FANC.batch

* Fix indentation level in CRISPRessoWGSSCORE

* Properly decode strings in cleaning filenames and gzip

* Update test cases

Update bam alignment for testing

* Update test scripts and expected results

* CRISPRessoCompare pickle fix

Create method 'write_crispresso_info' in CRISPRessoShared to handle future removal of cpickle

* Add Reference_Sequence as strings (instead of bytes) in df

* Update genome for tests

Update genome files so CRISPRessoWGS tests pass

* delete old tests

* Re-add new tests

* Move test diffs to after their running commands and ignore whitespace

* test for COREResources

* Fix byte decoding issues

Finally moved the decoding into the Align Cython code, which hopefully cleans
some things up and fixes bugs.

* Pass in strings to global_align

This also includes changes to the alignment algorithm because without them there
is an indexing error. I'm not sure why there is an error when you change the
type from bytes to string, but I worry that the error was failing silently
before and only surfaced now.

* Pass in strings to global_align

* Update checking for edge cases in global_align

* Making string construction more Cythonic

These changes include allocating the alignment strings using `malloc` and then
decoding them to Python strings after they have been set.

* Create test_CRISPResso2Align.py

Test cases CRISPResso2Align

* Change array type from np.int to int

* Update test_CRISPRessoCOREResources.py

Add test and fix wrong test

* Update test_CRISPResso2Align.py

Change gap incentive type to int

* Update test_CRISPResso2Align.py

ATTA test

* Close alignment matrix file after opening

* Fix test case error (by making gap arguments negative)

* Update CRISPRessoShared.py

Add commentary to get_mismatches

* Update CRISPRessoShared.py

Move last comment

* gap_incentive test cases + bug fix for when gap incentive is 1 at position 0

* Update WGS test fastq and expected results

* Fix numpy array/scalar complaint

* Update smallGenome.fa

Test change to smallGenome.fa (this is just a test)

* Update smallGenome.fa

Revert test change to smallGenome.fa

* test remove fasta reference

* re-add fasta reference + gitattributes to force lf

* Round alignment scores to 3 decimal places

* Fix bug in the 3b plot "Substitutions Frequency" link

This fixes the last link in the modifications lengths portions of the report to
point to the correct file path.

* Fix padding around nucleotide quilt plot

* Implement testing script to handle ignoring diffs in floats

* Add more test files for CRISPResso_on_FANC.Cas9

* Add more test files for CRISPRessoBatch_on_FANC

* Add more test files for CRISPRessoWGS

* Add more test files for CRISPResso_on_params

* Use diff.py in testRelease.sh

* Add all test files for CRISPRessoPooled

* Add test files for CRISPRessoCompare

* Fix #5

Frameshift plots (plot 5) are skipped if no frameshift data is available. Previously this resulted in a Matplotlib deprecation warning.

* Implement encoding and decoding of CRISPResso2_info object in JSON

* Add crispresso_info_file_name to load_crispresso_info function

This is mainly used in CRISPRessoAggregate. Everywhere this function is used, it
just uses the default value.

* Use JSON serialization in CRISPRessoCORE

* Use JSON serialization in CRISPRessoBatch

* Use JSON serialization in CRISPRessoPooledCORE

* Use JSON serialization in CRISPRessoWGS

* Use JSON serialization in CRISPRessoCompare

* Use JSON serialization in CRISPRessoMeta

* Use JSON serialization in CRISPRessoAggregate

* Remove pickle imports from CRISPRessoReport

Co-authored-by: Cole Lyman <cole@colelyman.com>
Colelyman added a commit that referenced this issue Mar 22, 2024
Snicker7 added a commit that referenced this issue May 10, 2024
* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml
Colelyman added a commit that referenced this issue May 13, 2024
* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Fix d3 sgRNA sequences (#76)

* Pass correct sgRNA_sequences to d3 plot

* Pass correct sgRNA sequence to prime editor plot for d3

* Resize plotly (#75)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Pass div id for plotly

* Remove debug

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Jun 19, 2024
* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Fix d3 sgRNA sequences (#76)

* Pass correct sgRNA_sequences to d3 plot

* Pass correct sgRNA sequence to prime editor plot for d3

* Resize plotly (#75)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Pass div id for plotly

* Remove debug

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Jul 31, 2024
commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: trevormartinj7 <trevormartinj7@gmail.com>

        * Batch d3 clean (#55)

        * imports C2Pro plots if available

        * added --use_matplotlib flag

        * added C2Pro
        matched api funciton signatures

        * added api args for plotly

        * added **kwargs

        * renamed config to custom_config, more specificity

        * added backend flag for plotly kaleido

        * added pro_installed boolean for templates, added plotly dependency to report templates

        * Squashed commit of the following:

        commit c909ea3b34e87ce637e00dac075d2bb2f8bfb954
        Author: McKay <mbow…
Colelyman added a commit that referenced this issue Aug 15, 2024
* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Colelyman added a commit that referenced this issue Aug 15, 2024
* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Aug 16, 2024
commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-…
mbowcut2 added a commit that referenced this issue Aug 19, 2024
commit 354f962d4201ae4784108b0a89467796af6ec326
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:49:32 2024 -0600

    specify encoding

commit 823250b8108ede27b29d59d1239c74c93d74395f
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:46:17 2024 -0600

    changed _ROOT to _root

commit fb8cfa7614598f23e578b55b92ca9b685eef3156
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:44:32 2024 -0600

    added docstrings

commit e0777a805e4da9729118b429fb44687ecc15d590
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:32:34 2024 -0600

    fixes for linting

commit 3ade759b85992484ae59aa7db3f10fe8e6bbdc50
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:04:43 2024 -0600

    Squashed commit of the following:

    commit 42be4bb0bcd58cdf9c902a448a7e5bde888fb1d3
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Mon Aug 19 14:36:55 2024 -0600

        added pro check for plotly import in batchReport

    commit 31873d15fff9b9b12c1e8d8c62efa686c12cbd37
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 15:05:58 2024 -0600

        pointed integration tests at test branch

    commit d449600561074c8721322e94a8375b3ad742a88c
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 14:24:03 2024 -0600

        Squashed commit of the following:

        commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Tue Aug 13 16:44:39 2024 -0600

            dict key changes

        commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Thu Aug 8 15:30:31 2024 -0600

            added C2PRO install check back

        commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 13:08:12 2024 -0600

            fixed key error conditionals

        commit 84444e7480605206cb3efa4a0db675c55e717304
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 09:22:44 2024 -0600

            use local jinja_paritals file

        commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Wed Jul 31 14:10:29 2024 -0600

            Squashed commit of the following:

            commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
            Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
            Date:   Mon Jul 22 09:31:44 2024 -0600

                D3-Enhancements (#78)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Fix typo and move flexiguide to debug (#77)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

                * Adding id tags for d3 script enhancements

                * pointing to test branch

                * Add amplicon_name parameter to allele heatmap and line plots

                * Add function to extract quantification window regions from include_idxs

                * Scale the quantification window according to the coordinates of the sgRNA plot

                * added c2pro check, added space in args.json

                * Correct the quantification window indexes for multiple guides

                * Fix name of nucleotide conversion plot when guides are not the same

                * Fix jinja variables that aren't found

                * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

                * Remove unneeded variable and extra whitespace

                * Switch test branch to master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

            commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu Jul 18 14:31:54 2024 -0600

                Asymmetrical cut point (#457)

                * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

                * Cole asymmetrical cut point (#453)

                * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

                * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

                ---------

                Co-authored-by: Cole Lyman <Cole@colelyman.com>

            commit 8d92972694ddff629dad844a6ad100459f69751d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Jul 18 14:29:40 2024 -0600

                Cole/update args (#85) (#456)

            commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon Jul 15 16:17:29 2024 -0600

                Implement new pooled mixed-mode default behavior (#454)

                * changes for pooled mixed-mode default (#83)

                * changes for pooled mixed-mode default

                * deprecated old arg

                * added integration tests for mixed mode

                * fixed test target

                * updated test name

                * pinned numpy

                * Fix integration tests yml

                * pinning matplotlib

                * added print to CI tests

                * changed mixed mode info string

                * Remove pooled-mixed-mode-align-to-genome step from Github Actions

                * Update demultiplex_genome_wide parameter and help

                * Convert args.json to unix line endings

                * Add Pooled mixed mode demux run

                * Update the name of the argument in Pooled

                * Point integration tests back to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Revert change to pooled mixed mode info statement (#86)

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Jul 9 12:53:23 2024 -0600

                Pin versions of numpy and matplotlib in CI environment (#84) (#452)

            commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 14:07:42 2024 -0600

                Add padding to image

            commit 381755daf0939aaf2745df0a802c809633aff47d
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 13:59:57 2024 -0600

                White background for schematic for dark mode

            commit d649db71e610bd8840fbb8d46fadb07789b67390
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri May 24 12:45:53 2024 -0600

                Fix typo and move flexiguide to debug (#77) (#438)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

            commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 13:34:00 2024 -0600

                Prefix the release Docker tag with a `v` (#434)

            commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 09:41:32 2024 -0600

                Showing sgRNA sequences on hover in CRISPRessoPro (#432)

                * Passing sgRNA sequences to regular and Batch D3 plots (#73)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Update integration_tests.yml to point back at master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Push new releases to ECR (#74)

                * Create aws_ecr.yml (#1)

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * us-east-1

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Fix d3 sgRNA sequences (#76)

                * Pass correct sgRNA_sequences to d3 plot

                * Pass correct sgRNA sequence to prime editor plot for d3

                * Resize plotly (#75)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Pass div id for plotly

                * Remove debug

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1c504274818b6b17fb60620d48fd92cb2e50566d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu May 9 14:16:25 2024 -0600

                Fix plots and improve plot error handling (#431)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 2 13:49:33 2024 -0600

                Use recent docker image for CircleCI testing that includes updated pandas

            commit 38fd76dbd7ce2087468f9f454b548777de959a68
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 16:42:28 2024 -0600

                Cole/fix status file name (#69) (#430)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

            commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed May 1 13:08:11 2024 -0600

                Remove linked space in readme

            commit 340a4e16795a5e500411e11572ec267525985009
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 13:07:14 2024 -0600

                Fix batch mode pandas warning. (#70) (#429)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 26 16:26:27 2024 -0600

                Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

            commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed Apr 24 18:00:43 2024 -0600

                Spelling fixes

            commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed Apr 24 09:33:53 2024 -0600

                Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

                * Updated README (#64)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Extract jinja_partials  (#65)

                * Extract jinja_partials code

                * Remove Plotly dependency from setup.py

                * Fix CRISPRessoPooled flash errors (#68)

                * Fix replacing flash intermediate files with fastp intermediate files

                This also moves where the files are added to `files_to_remove` up to
                near where they are created.

                * Update to run test branch with paired end Pooled test

                * Add pooled-paired-sim test to integration tests

                * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

                * Change test branch back to master

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 23 17:00:28 2024 -0600

                Updated README (#64) (#424)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Mon Apr 22 11:24:59 2024 -0600

                Update CRISPRessoPooledCORE.py (#423)

                Fix bug in error reporting if duplicate names are present

            commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 16:55:39 2024 -0600

                Remove extra imports from CRISPRessoCore (#67) (#422)

            commit 4aae57e5be475cd717792265bee36a71a99425de
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 10:00:19 2024 -0600

                Cole/refactor jinja undefined (#66) (#421)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Refactor logging Jinja2 undefined variable warnings

                * Revert plot_11a update

                * Update intedration test branch

                * Update jinja to warn on undefined but not fail. Fix all undefined warnings

                * Fix github integration tests ref

                * One more undefined variable

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

            commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 9 17:30:10 2024 -0600

                Fix Jinja2 undefined variables (#63) (#417)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Revert plot_11a update

                * Update intedration test branch

                * Update branch for integration tests

            commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
            Author: Han Dai <github@daihan.me>
            Date:   Fri Apr 5 18:36:41 2024 -0400

                fix: change all U+00A0 to U+0020 (#400)

            commit 235dc29c0cd0fcca2e999148d4660acf00b07221
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 5 16:36:16 2024 -0600

                Fastp, args as data, guardrails, and PE fix (#415)

                * Change CRISPResso_status.txt format to JSON (#46)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * add json read for status file

                * changed Formatter to json format

                * fixed json access variable name: message

                * changed  perentage_complete to numeric

                * changed status file to .json

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * New makefile commands

                * changed file to .json

                * changed status to json file

                * Make JSON human readable by adding new lines

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * point to test branch

                * pointed CI config to testing branch

                * Update integration_tests.yml

                point to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Trevor/fastp integration (#50)

                * Update check_program to check versions and create check_fastq function

                * Update fastq arg, implement fastp in get_most_frequent_reads

                * Bump version to 2.3.0

                * Deprecate Flash and Trimmomatic parameters, and update fastp params

                * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

                * Implement trimming of single end reads

                * Merge (and trim) reads in CRISPRessoCORE with fastp

                * Modify error handling to account for fastp errors

                * Replace flash and trimmomatic with fastp in Docker dependencies

                * Update LICENSE.txt with fastp info

                * Remove min and max amplicon length (no longer needed)

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Implement trimming with fastp in CRISPRessoPooled

                * Implemend merging (and trimming) with fastp in CRISPRessoPooled

                * Fixed minor fastp errors

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Update where the test point to

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension_seq` parameter is empty.

                * Only search for scaffold integration when pegRNA extension seq is provided

                * Remove spaces at the end of lines

                * Docker size (#49)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * 3.4->2.08

                * Put ttf-mscorefonts-installer back above apt-get clean

                * restore slash, replace fastp with trimmomatic and flash, add autoremove step

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * initial readme modifications

                * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

                * Pointing test branch back at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Guardrails clean history (#34)

                * Include guardrail functions

                * Add CRISPRessoReports subtree

                * Refactor to use CRISPRessoReports module

                * Include guardrail functions

                * Functional guardrails, needs reports update

                * Add guardrail partial

                * fix guardrials partial

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Update C cythonized files

                * Add exact numbers to guardrails printouts

                * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

                * Fix calculation of `total_mods` from being negative

                The issue was that `all_deletion_coordinates` just tells you how many deletions
                were present, but not how long the deletion is.

                * Changes to message

                * Remove old tag

                * Point tests at guardrails

                * Restore C2 pro check

                * Save message with guardrail name

                * Point tests repo at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

                * Fix case sensitivity in Prime Editing mode (#54)

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Make all amplicons in amplicon_seq_arr uppercase

                This fixes https://github.com/pinellolab/CRISPResso2/issues/396

                * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension_seq` parameter is empty.

                * Only search for scaffold integration when pegRNA extension seq is provided

                * Remove spaces at the end of lines

                * Docker size (#49)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

   …
mbowcut2 added a commit that referenced this issue Aug 22, 2024
commit bb7533405af9f355ee9b367903e1f391375c7ca8
Merge: 354f962 2eff6a1
Author: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Date:   Thu Aug 22 13:47:21 2024 -0600

    Merge branch 'master' into c2pro-reports

commit 354f962d4201ae4784108b0a89467796af6ec326
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:49:32 2024 -0600

    specify encoding

commit 823250b8108ede27b29d59d1239c74c93d74395f
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:46:17 2024 -0600

    changed _ROOT to _root

commit fb8cfa7614598f23e578b55b92ca9b685eef3156
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:44:32 2024 -0600

    added docstrings

commit e0777a805e4da9729118b429fb44687ecc15d590
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:32:34 2024 -0600

    fixes for linting

commit 3ade759b85992484ae59aa7db3f10fe8e6bbdc50
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:04:43 2024 -0600

    Squashed commit of the following:

    commit 42be4bb0bcd58cdf9c902a448a7e5bde888fb1d3
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Mon Aug 19 14:36:55 2024 -0600

        added pro check for plotly import in batchReport

    commit 31873d15fff9b9b12c1e8d8c62efa686c12cbd37
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 15:05:58 2024 -0600

        pointed integration tests at test branch

    commit d449600561074c8721322e94a8375b3ad742a88c
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 14:24:03 2024 -0600

        Squashed commit of the following:

        commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Tue Aug 13 16:44:39 2024 -0600

            dict key changes

        commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Thu Aug 8 15:30:31 2024 -0600

            added C2PRO install check back

        commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 13:08:12 2024 -0600

            fixed key error conditionals

        commit 84444e7480605206cb3efa4a0db675c55e717304
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 09:22:44 2024 -0600

            use local jinja_paritals file

        commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Wed Jul 31 14:10:29 2024 -0600

            Squashed commit of the following:

            commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
            Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
            Date:   Mon Jul 22 09:31:44 2024 -0600

                D3-Enhancements (#78)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Fix typo and move flexiguide to debug (#77)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

                * Adding id tags for d3 script enhancements

                * pointing to test branch

                * Add amplicon_name parameter to allele heatmap and line plots

                * Add function to extract quantification window regions from include_idxs

                * Scale the quantification window according to the coordinates of the sgRNA plot

                * added c2pro check, added space in args.json

                * Correct the quantification window indexes for multiple guides

                * Fix name of nucleotide conversion plot when guides are not the same

                * Fix jinja variables that aren't found

                * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

                * Remove unneeded variable and extra whitespace

                * Switch test branch to master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

            commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu Jul 18 14:31:54 2024 -0600

                Asymmetrical cut point (#457)

                * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

                * Cole asymmetrical cut point (#453)

                * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

                * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

                ---------

                Co-authored-by: Cole Lyman <Cole@colelyman.com>

            commit 8d92972694ddff629dad844a6ad100459f69751d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Jul 18 14:29:40 2024 -0600

                Cole/update args (#85) (#456)

            commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon Jul 15 16:17:29 2024 -0600

                Implement new pooled mixed-mode default behavior (#454)

                * changes for pooled mixed-mode default (#83)

                * changes for pooled mixed-mode default

                * deprecated old arg

                * added integration tests for mixed mode

                * fixed test target

                * updated test name

                * pinned numpy

                * Fix integration tests yml

                * pinning matplotlib

                * added print to CI tests

                * changed mixed mode info string

                * Remove pooled-mixed-mode-align-to-genome step from Github Actions

                * Update demultiplex_genome_wide parameter and help

                * Convert args.json to unix line endings

                * Add Pooled mixed mode demux run

                * Update the name of the argument in Pooled

                * Point integration tests back to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Revert change to pooled mixed mode info statement (#86)

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Jul 9 12:53:23 2024 -0600

                Pin versions of numpy and matplotlib in CI environment (#84) (#452)

            commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 14:07:42 2024 -0600

                Add padding to image

            commit 381755daf0939aaf2745df0a802c809633aff47d
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 13:59:57 2024 -0600

                White background for schematic for dark mode

            commit d649db71e610bd8840fbb8d46fadb07789b67390
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri May 24 12:45:53 2024 -0600

                Fix typo and move flexiguide to debug (#77) (#438)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

            commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 13:34:00 2024 -0600

                Prefix the release Docker tag with a `v` (#434)

            commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 09:41:32 2024 -0600

                Showing sgRNA sequences on hover in CRISPRessoPro (#432)

                * Passing sgRNA sequences to regular and Batch D3 plots (#73)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Update integration_tests.yml to point back at master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Push new releases to ECR (#74)

                * Create aws_ecr.yml (#1)

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * us-east-1

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Fix d3 sgRNA sequences (#76)

                * Pass correct sgRNA_sequences to d3 plot

                * Pass correct sgRNA sequence to prime editor plot for d3

                * Resize plotly (#75)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Pass div id for plotly

                * Remove debug

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1c504274818b6b17fb60620d48fd92cb2e50566d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu May 9 14:16:25 2024 -0600

                Fix plots and improve plot error handling (#431)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 2 13:49:33 2024 -0600

                Use recent docker image for CircleCI testing that includes updated pandas

            commit 38fd76dbd7ce2087468f9f454b548777de959a68
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 16:42:28 2024 -0600

                Cole/fix status file name (#69) (#430)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

            commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed May 1 13:08:11 2024 -0600

                Remove linked space in readme

            commit 340a4e16795a5e500411e11572ec267525985009
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 13:07:14 2024 -0600

                Fix batch mode pandas warning. (#70) (#429)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 26 16:26:27 2024 -0600

                Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

            commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed Apr 24 18:00:43 2024 -0600

                Spelling fixes

            commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed Apr 24 09:33:53 2024 -0600

                Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

                * Updated README (#64)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Extract jinja_partials  (#65)

                * Extract jinja_partials code

                * Remove Plotly dependency from setup.py

                * Fix CRISPRessoPooled flash errors (#68)

                * Fix replacing flash intermediate files with fastp intermediate files

                This also moves where the files are added to `files_to_remove` up to
                near where they are created.

                * Update to run test branch with paired end Pooled test

                * Add pooled-paired-sim test to integration tests

                * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

                * Change test branch back to master

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 23 17:00:28 2024 -0600

                Updated README (#64) (#424)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Mon Apr 22 11:24:59 2024 -0600

                Update CRISPRessoPooledCORE.py (#423)

                Fix bug in error reporting if duplicate names are present

            commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 16:55:39 2024 -0600

                Remove extra imports from CRISPRessoCore (#67) (#422)

            commit 4aae57e5be475cd717792265bee36a71a99425de
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 10:00:19 2024 -0600

                Cole/refactor jinja undefined (#66) (#421)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Refactor logging Jinja2 undefined variable warnings

                * Revert plot_11a update

                * Update intedration test branch

                * Update jinja to warn on undefined but not fail. Fix all undefined warnings

                * Fix github integration tests ref

                * One more undefined variable

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

            commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 9 17:30:10 2024 -0600

                Fix Jinja2 undefined variables (#63) (#417)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Revert plot_11a update

                * Update intedration test branch

                * Update branch for integration tests

            commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
            Author: Han Dai <github@daihan.me>
            Date:   Fri Apr 5 18:36:41 2024 -0400

                fix: change all U+00A0 to U+0020 (#400)

            commit 235dc29c0cd0fcca2e999148d4660acf00b07221
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 5 16:36:16 2024 -0600

                Fastp, args as data, guardrails, and PE fix (#415)

                * Change CRISPResso_status.txt format to JSON (#46)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * add json read for status file

                * changed Formatter to json format

                * fixed json access variable name: message

                * changed  perentage_complete to numeric

                * changed status file to .json

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * New makefile commands

                * changed file to .json

                * changed status to json file

                * Make JSON human readable by adding new lines

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * point to test branch

                * pointed CI config to testing branch

                * Update integration_tests.yml

                point to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Trevor/fastp integration (#50)

                * Update check_program to check versions and create check_fastq function

                * Update fastq arg, implement fastp in get_most_frequent_reads

                * Bump version to 2.3.0

                * Deprecate Flash and Trimmomatic parameters, and update fastp params

                * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

                * Implement trimming of single end reads

                * Merge (and trim) reads in CRISPRessoCORE with fastp

                * Modify error handling to account for fastp errors

                * Replace flash and trimmomatic with fastp in Docker dependencies

                * Update LICENSE.txt with fastp info

                * Remove min and max amplicon length (no longer needed)

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Implement trimming with fastp in CRISPRessoPooled

                * Implemend merging (and trimming) with fastp in CRISPRessoPooled

                * Fixed minor fastp errors

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Update where the test point to

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension_seq` parameter is empty.

                * Only search for scaffold integration when pegRNA extension seq is provided

                * Remove spaces at the end of lines

                * Docker size (#49)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * 3.4->2.08

                * Put ttf-mscorefonts-installer back above apt-get clean

                * restore slash, replace fastp with trimmomatic and flash, add autoremove step

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * initial readme modifications

                * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

                * Pointing test branch back at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Guardrails clean history (#34)

                * Include guardrail functions

                * Add CRISPRessoReports subtree

                * Refactor to use CRISPRessoReports module

                * Include guardrail functions

                * Functional guardrails, needs reports update

                * Add guardrail partial

                * fix guardrials partial

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Update C cythonized files

                * Add exact numbers to guardrails printouts

                * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

                * Fix calculation of `total_mods` from being negative

                The issue was that `all_deletion_coordinates` just tells you how many deletions
                were present, but not how long the deletion is.

                * Changes to message

                * Remove old tag

                * Point tests at guardrails

                * Restore C2 pro check

                * Save message with guardrail name

                * Point tests repo at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

                * Fix case sensitivity in Prime Editing mode (#54)

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Make all amplicons in amplicon_seq_arr uppercase

                This fixes https://github.com/pinellolab/CRISPResso2/issues/396

                * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension_seq` parameter is empty.

                * Only search for scaffold integration when pegRNA extension seq is provided

                * Remove spaces at the end of lines

                * Docker size (#49)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no t…
mbowcut2 added a commit that referenced this issue Aug 22, 2024
commit ae22c92e10c757667847a043eddef5a2fe333be7
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 22 16:12:08 2024 -0600

    fix cup download

commit f4f8a9027a4742b883c562e31fc7fe5f45a5f8cf
Merge: 2eff6a1 bb75334
Author: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Date:   Thu Aug 22 15:50:20 2024 -0600

    Merge pull request #24 from edilytics/c2pro-reports

    C2pro reports

commit bb7533405af9f355ee9b367903e1f391375c7ca8
Merge: 354f962 2eff6a1
Author: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Date:   Thu Aug 22 13:47:21 2024 -0600

    Merge branch 'master' into c2pro-reports

commit 354f962d4201ae4784108b0a89467796af6ec326
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:49:32 2024 -0600

    specify encoding

commit 823250b8108ede27b29d59d1239c74c93d74395f
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:46:17 2024 -0600

    changed _ROOT to _root

commit fb8cfa7614598f23e578b55b92ca9b685eef3156
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:44:32 2024 -0600

    added docstrings

commit e0777a805e4da9729118b429fb44687ecc15d590
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:32:34 2024 -0600

    fixes for linting

commit 3ade759b85992484ae59aa7db3f10fe8e6bbdc50
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Mon Aug 19 16:04:43 2024 -0600

    Squashed commit of the following:

    commit 42be4bb0bcd58cdf9c902a448a7e5bde888fb1d3
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Mon Aug 19 14:36:55 2024 -0600

        added pro check for plotly import in batchReport

    commit 31873d15fff9b9b12c1e8d8c62efa686c12cbd37
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 15:05:58 2024 -0600

        pointed integration tests at test branch

    commit d449600561074c8721322e94a8375b3ad742a88c
    Author: mbowcut2 <mbowcut@gmail.com>
    Date:   Fri Aug 16 14:24:03 2024 -0600

        Squashed commit of the following:

        commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Tue Aug 13 16:44:39 2024 -0600

            dict key changes

        commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Thu Aug 8 15:30:31 2024 -0600

            added C2PRO install check back

        commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 13:08:12 2024 -0600

            fixed key error conditionals

        commit 84444e7480605206cb3efa4a0db675c55e717304
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Fri Aug 2 09:22:44 2024 -0600

            use local jinja_paritals file

        commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
        Author: mbowcut2 <mbowcut@gmail.com>
        Date:   Wed Jul 31 14:10:29 2024 -0600

            Squashed commit of the following:

            commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
            Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
            Date:   Mon Jul 22 09:31:44 2024 -0600

                D3-Enhancements (#78)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Fix typo and move flexiguide to debug (#77)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

                * Adding id tags for d3 script enhancements

                * pointing to test branch

                * Add amplicon_name parameter to allele heatmap and line plots

                * Add function to extract quantification window regions from include_idxs

                * Scale the quantification window according to the coordinates of the sgRNA plot

                * added c2pro check, added space in args.json

                * Correct the quantification window indexes for multiple guides

                * Fix name of nucleotide conversion plot when guides are not the same

                * Fix jinja variables that aren't found

                * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

                * Remove unneeded variable and extra whitespace

                * Switch test branch to master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

            commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu Jul 18 14:31:54 2024 -0600

                Asymmetrical cut point (#457)

                * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

                * Cole asymmetrical cut point (#453)

                * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

                * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

                ---------

                Co-authored-by: Cole Lyman <Cole@colelyman.com>

            commit 8d92972694ddff629dad844a6ad100459f69751d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Jul 18 14:29:40 2024 -0600

                Cole/update args (#85) (#456)

            commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon Jul 15 16:17:29 2024 -0600

                Implement new pooled mixed-mode default behavior (#454)

                * changes for pooled mixed-mode default (#83)

                * changes for pooled mixed-mode default

                * deprecated old arg

                * added integration tests for mixed mode

                * fixed test target

                * updated test name

                * pinned numpy

                * Fix integration tests yml

                * pinning matplotlib

                * added print to CI tests

                * changed mixed mode info string

                * Remove pooled-mixed-mode-align-to-genome step from Github Actions

                * Update demultiplex_genome_wide parameter and help

                * Convert args.json to unix line endings

                * Add Pooled mixed mode demux run

                * Update the name of the argument in Pooled

                * Point integration tests back to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Revert change to pooled mixed mode info statement (#86)

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Jul 9 12:53:23 2024 -0600

                Pin versions of numpy and matplotlib in CI environment (#84) (#452)

            commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 14:07:42 2024 -0600

                Add padding to image

            commit 381755daf0939aaf2745df0a802c809633aff47d
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 30 13:59:57 2024 -0600

                White background for schematic for dark mode

            commit d649db71e610bd8840fbb8d46fadb07789b67390
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri May 24 12:45:53 2024 -0600

                Fix typo and move flexiguide to debug (#77) (#438)

                * Change flexiguide output to debug level

                * Fix typo in fastp merged output file name

            commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 13:34:00 2024 -0600

                Prefix the release Docker tag with a `v` (#434)

            commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Mon May 13 09:41:32 2024 -0600

                Showing sgRNA sequences on hover in CRISPRessoPro (#432)

                * Passing sgRNA sequences to regular and Batch D3 plots (#73)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Provide sgRNA_sequences to plot_nucleotide_quilt plots

                * Passing sgRNA_sequences to plot

                * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

                * Add max-height to Batch report samples

                * Change testing branch

                * Fix wrong check for large Batch plots

                * Update integration_tests.yml to point back at master

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Push new releases to ECR (#74)

                * Create aws_ecr.yml (#1)

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * us-east-1

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Update aws_ecr.yml

                * Fix d3 sgRNA sequences (#76)

                * Pass correct sgRNA_sequences to d3 plot

                * Pass correct sgRNA sequence to prime editor plot for d3

                * Resize plotly (#75)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                * Pass div id for plotly

                * Remove debug

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1c504274818b6b17fb60620d48fd92cb2e50566d
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu May 9 14:16:25 2024 -0600

                Fix plots and improve plot error handling (#431)

                * Sam/try plots (#71)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Point test to try-plots

                * Fix d3 not showing and plotly mixing with matplotlib

                * Use logger for warnings and debug statements

                * Point tests back at master

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Sam/fix plots (#72)

                * Fix batch mode pandas warning. (#70)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Functional

                * Cole/fix status file name (#69)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

                * Try catch all futures

                * Fix test fail plots

                * Fix d3 not showing and plotly mixing with matplotlib

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Remove token from integration tests file

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Thu May 2 13:49:33 2024 -0600

                Use recent docker image for CircleCI testing that includes updated pandas

            commit 38fd76dbd7ce2087468f9f454b548777de959a68
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 16:42:28 2024 -0600

                Cole/fix status file name (#69) (#430)

                * Update config file logging messages

                This removes printing the exception (which is essentially a duplicate),
                and adds a condition if no config file was provided. Also changes `json`
                to `config` so that it is more clear.

                * Fix divide by zero when no amplicons are present in Batch mode

                * Don't append file_prefix to status file name

                * Place status files in output directories

                * Update tests branch for file_prefix addition

                * Load D3 and plotly figures with pro with multiple amplicons

                * Update batch

                * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

                Before this fix, when using a file_prefix the second run that was compared
                would not be displayed as a data in the first figure of the report.

                * Import CRISPRessoPro instead of importing the version

                When installed via conda, the version is not available

                * Remove `get_amplicon_output` unused function from CRISPRessoCompare

                Also remove unused argparse import

                * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

                * Allow for matching of multiple guides in the same amplicon

                * Fix pandas FutureWarning

                * Change test branch back to master

                ---------

                Co-authored-by: Sam <snic9004@gmail.com>

            commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed May 1 13:08:11 2024 -0600

                Remove linked space in readme

            commit 340a4e16795a5e500411e11572ec267525985009
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed May 1 13:07:14 2024 -0600

                Fix batch mode pandas warning. (#70) (#429)

                * refactor to call method on DataFrame, rather than Series.
                Removes warning.

                * Fix pandas future warning in CRISPRessoWGS

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

            commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 26 16:26:27 2024 -0600

                Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

            commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Wed Apr 24 18:00:43 2024 -0600

                Spelling fixes

            commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Wed Apr 24 09:33:53 2024 -0600

                Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

                * Updated README (#64)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Extract jinja_partials  (#65)

                * Extract jinja_partials code

                * Remove Plotly dependency from setup.py

                * Fix CRISPRessoPooled flash errors (#68)

                * Fix replacing flash intermediate files with fastp intermediate files

                This also moves where the files are added to `files_to_remove` up to
                near where they are created.

                * Update to run test branch with paired end Pooled test

                * Add pooled-paired-sim test to integration tests

                * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

                * Change test branch back to master

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 23 17:00:28 2024 -0600

                Updated README (#64) (#424)

                * Updating README to fix argument, email, and formatting

                * removing superfluous files

                * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

                * Remove link to CRISPRessoPro

                * Replace Docker badge with link to tags

                * Add bullet points to Guardrails section and improve formatting

                * Fix typo and removed colons from guardrails

                ---------

                Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

            commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
            Author: Kendell Clement <k.clement.dev@gmail.com>
            Date:   Mon Apr 22 11:24:59 2024 -0600

                Update CRISPRessoPooledCORE.py (#423)

                Fix bug in error reporting if duplicate names are present

            commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 16:55:39 2024 -0600

                Remove extra imports from CRISPRessoCore (#67) (#422)

            commit 4aae57e5be475cd717792265bee36a71a99425de
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Thu Apr 18 10:00:19 2024 -0600

                Cole/refactor jinja undefined (#66) (#421)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Refactor logging Jinja2 undefined variable warnings

                * Revert plot_11a update

                * Update intedration test branch

                * Update jinja to warn on undefined but not fail. Fix all undefined warnings

                * Fix github integration tests ref

                * One more undefined variable

                ---------

                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

            commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Tue Apr 9 17:30:10 2024 -0600

                Fix Jinja2 undefined variables (#63) (#417)

                * Replace Jinja2 PackageLoader with FileSystemLoader

                The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
                and Python 3.9. Replacing with FileSystemLoader work with the older version and
                the latest version.

                * Fix undefined variable `amplicon_name` in report template

                * Revert plot_11a update

                * Update intedration test branch

                * Update branch for integration tests

            commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
            Author: Han Dai <github@daihan.me>
            Date:   Fri Apr 5 18:36:41 2024 -0400

                fix: change all U+00A0 to U+0020 (#400)

            commit 235dc29c0cd0fcca2e999148d4660acf00b07221
            Author: Cole Lyman <Cole@colelyman.com>
            Date:   Fri Apr 5 16:36:16 2024 -0600

                Fastp, args as data, guardrails, and PE fix (#415)

                * Change CRISPResso_status.txt format to JSON (#46)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * add json read for status file

                * changed Formatter to json format

                * fixed json access variable name: message

                * changed  perentage_complete to numeric

                * changed status file to .json

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * New makefile commands

                * changed file to .json

                * changed status to json file

                * Make JSON human readable by adding new lines

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * point to test branch

                * pointed CI config to testing branch

                * Update integration_tests.yml

                point to master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Trevor/fastp integration (#50)

                * Update check_program to check versions and create check_fastq function

                * Update fastq arg, implement fastp in get_most_frequent_reads

                * Bump version to 2.3.0

                * Deprecate Flash and Trimmomatic parameters, and update fastp params

                * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

                * Implement trimming of single end reads

                * Merge (and trim) reads in CRISPRessoCORE with fastp

                * Modify error handling to account for fastp errors

                * Replace flash and trimmomatic with fastp in Docker dependencies

                * Update LICENSE.txt with fastp info

                * Remove min and max amplicon length (no longer needed)

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Implement trimming with fastp in CRISPRessoPooled

                * Implemend merging (and trimming) with fastp in CRISPRessoPooled

                * Fixed minor fastp errors

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Update where the test point to

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension_seq` parameter is empty.

                * Only search for scaffold integration when pegRNA extension seq is provided

                * Remove spaces at the end of lines

                * Docker size (#49)

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * 3.4->2.08

                * Put ttf-mscorefonts-installer back above apt-get clean

                * restore slash, replace fastp with trimmomatic and flash, add autoremove step

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * initial readme modifications

                * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

                * Pointing test branch back at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

                * Guardrails clean history (#34)

                * Include guardrail functions

                * Add CRISPRessoReports subtree

                * Refactor to use CRISPRessoReports module

                * Include guardrail functions

                * Functional guardrails, needs reports update

                * Add guardrail partial

                * fix guardrials partial

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * GitHub actions integration tests (#48)

                * GitHub actions clean (#40)

                * Create pytest.yml

                * Create pylint.yml

                * Create .pylintrc

                * Create test_env.yml

                * Full path

                * Remove conda install

                * Replace path

                * Pytest tests

                * pip -e

                * Create integration_tests.yml

                * Simplify name

                * CRISPRESSO2_DIR environment variable

                * Up one dir

                * ls workspace

                * Install CRISPResso and ydiff

                * Clone repo instead of checkout

                * submodule

                * ls

                * CRISPResso2_copy

                * ls

                * Update env

                * Simplify

                * Pull from githubactions branch

                * Pull githubactions repo

                * Checkout githubactions

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Run tests individually

                * Pin plotly version

                * Run all tests even if one fails

                * Test on another branch

                * Switch branch with token

                * Update integration_tests.yml

                * Introduce pandas sorting in CRISPRessoCompare (#47)

                * New makefile commands

                * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

                * Extract out split_interleaved_fastq function to CRISPRessoShared

                * Implement splitting interleaved fastq files in CRISPRessoPooled

                * Suppress split_interleaved_input from CRISPRessoWGS parameters

                * Suppress other parameters in CRISPRessoWGS

                * Move where interleaved fastq files are split to be trimmed properly

                * Bug Fix - 367 (#35)

                * - Fixed references to ref_names_for_pe

                * removed extra tabs

                * trying to match empty line, no tabs

                * - changed references to ref_names[0]

                * Mckay/pd warnings (#45)

                * refactor errors='ignore' to try except

                * refactored integer slice to iloc[]

                * moved to_numeric try except to function

                * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

                This change is slightly cleaner because it addresses the root issue that some
                columns are strings (and can therefore not be converted to numeric types). Now
                if an error does occur when converting the dfs to numeric types it won't be
                swallowed up.

                * Add documentation to to_numeric_ignore_columns

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * On push no branches

                * On push no branches

                * All in one file

                * Fix yml errors

                * Rename jobs

                * Remove old workflow files

                * Remove paths

                * Run jobs in parallel

                ---------

                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
                Co-authored-by: Cole Lyman <cole@colelyman.com>

                * Update C cythonized files

                * Add exact numbers to guardrails printouts

                * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

                * Fix calculation of `total_mods` from being negative

                The issue was that `all_deletion_coordinates` just tells you how many deletions
                were present, but not how long the deletion is.

                * Changes to message

                * Remove old tag

                * Point tests at guardrails

                * Restore C2 pro check

                * Save message with guardrail name

                * Point tests repo at master

                ---------

                Co-authored-by: Cole Lyman <cole@colelyman.com>
                Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

                * Fix case sensitivity in Prime Editing mode (#54)

                * Move read filtering to after merging in CRISPResso (#39)

                * Move read filtering to after merging

                This is in an effort to be consistent with the behavior and results of
                CRISPRessoPooled.

                * Properly assign the correct file names for read filtering

                * Add space around operators

                * GitHub actions on pr (#51)

                * Run integration tests on pull_request

                * Run pytest on pull_request

                * Run pylint on pull_request

                * Run tests on PR only when opening PR (#53)

                * Update reports (#52)

                * Update report changes

                * Switch branch of integration test repo

                * Remove extraneous `crispresso_data_path`

                * Point integration tests back to master

                * Make all amplicons in amplicon_seq_arr uppercase

                This fixes https://github.com/pinellolab/CRISPResso2/issues/396

                * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

                * Fix 'Prime-edited' key not found (#32)

                * Move 'Prime-edited' amplicon name check

                By moving this, it will check if there is an amplicon named
                'Prime-edited' (which is a reserved name) even if the
                `prime_editing_pegRNA_extension…
mbowcut2 added a commit that referenced this issue Aug 22, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…
trevormartinj7 added a commit that referenced this issue Aug 23, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Colelyman added a commit that referenced this issue Aug 26, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
trevormartinj7 added a commit that referenced this issue Aug 26, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Aug 26, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Colelyman added a commit that referenced this issue Aug 28, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------





* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------





* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------





* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------





* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Colelyman added a commit that referenced this issue Oct 2, 2024
* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_nu…
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------





* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------





* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------





* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------





* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_nu…
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Oct 14, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Colelyman added a commit that referenced this issue Oct 17, 2024
* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_nu…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Fix d3 sgRNA sequences (#76)

* Pass correct sgRNA_sequences to d3 plot

* Pass correct sgRNA sequence to prime editor plot for d3

* Resize plotly (#75)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Pass div id for plotly

* Remove debug

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------





* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------





* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------





* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------





* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_nu…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ----…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Initial parallization work

* Initial parallization work

* Fleshed out process function, added tracking for manager dictionary

* lots of debugging of the process function

* parallelization achieved

* Improved boundary function

* Removing prints

* removing old code

* Failed cache generator function

* Created single thread seq_cache generator

* initial functioning parallelization

* adding data to variantCache and N_ constants

* more edits

* replacing old aln stats

* Fixed return values

* changing output file

* fixed boundary error

* Adding more tracking of timings

* optimized stat tracking considerably

* removing imports and unneccesary checks

* more logging

* Unbalancing the processes to allow for better locking interactions

* Adding amplicon name to plotting

* fixing stat tracking

* removed old code

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* old changes

* removed some superfluous code

* removed some superfluous code

* reverting to old flow if one process

* Enriched aln_stats with aligned read_length

* fixed declaration bug

* Fixing stats tracking for irregular reads

* fixed stat tracking by multiplying variant count

* adding some function explanations, cleaning up code, optimizing lock updating

* Cleaned up code, potentially breaks tests

* Revert "Cleaned up code, potentially breaks tests"

This reverts commit e7cfe3d.

* Revert "adding some function explanations, cleaning up code, optimizing lock updating"

This reverts commit 96deee2.

* replaced test branch, reverted to working version, added .upate() inside lock

* checking old cython branch

* reverting breaking changes

* pointing back at my test branch, unreverting non breaking changes

* timing log change

* pin numpy

* Pin versions of numpy and matplotlib in CI environment (#84)

* pointing to correct test branch

* removed debug statement

* memory tracking

* Replaced seq_cache with variantCache for improved memory

* removing print statements

* commenting out memory tracking

* Removing one extra loop to improve processing time

* optimizing info, removing old functions and print statements

* removing empty string handling

* removed empty string, reworded comments

* removing uneccesary files

* Literally minding my p's and q's

* Reset .c files

* Editing function comments, renaming managerCache, removing debug print statement

* Initial attempt at creating a temp_variant file for memory optimization

* update

* reading and writing to tsv to save on memory

* Merged in memory optimization branch

* Updating pytests for equal boundary changes

* Refactored write out fastq, improved tests

* Replace zcat (#94)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Replace zcat with gunzip -c in `get_most_frequent_reads`

---------





* Limiting plotting processes

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Add BAM test cases to Github Actions integration tests

* Add "Run" to step names

* Point to trevor/fastq-testing test branch

* Add in Aggregate test case to Github Actions

* parallelizing process_single_fastq_write_bam_out function

* wip parallelization for process_bam function

* Fixing write out bug

* Functional bam input parallelized function

* cleaned up code, removed old versions of functions

* trying again

* Remove duplicate c2_tests

* Remove commented out copy c2_tests

* Updating file access to close on error, removing unnecesary bam appendings

* Add `percent_complete`to info statements (#102)

* Fix CRISPRessoAggregate bug and other improvements (#95) (pinellolab#470)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Display percentages in the CLI output (#88) (pinellolab#473)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Display percentages in the CLI output

---------





* No pool (#79) (pinellolab#474)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Passing sgRNA sequences to regular and Batch D3 plots (#73)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Update integration_tests.yml to point back at master

---------





* Push new releases to ECR (#74)

* Create aws_ecr.yml (#1)

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* us-east-1

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Update aws_ecr.yml

* Pass div id for plotly

* Remove debug

* Don't use thread pool with 1 process

* Fix logger issue

* Catchup

* Remove extra print statements

* Restrict generation of multiprocessing pool to when n_processes > 1

* Switch test branch to version bump

* Fix variable name error

* Change test branch back to master

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





---------





* Add percent_complete to subprocess alignment

* Remove extraneous spaces

* Added more percent_complete statements to info blocks

---------





* Updated functions with newly merged progress percent from Cole

* Minor code cleanup, removal of comments and print statements etc

* Pointing test branch back at CRISPResso2_tests master branch

* Added checking for failed processes causing not all reads to be counted

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <Cole@colelyman.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
mbowcut2 added a commit that referenced this issue Nov 8, 2024
* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_nu…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Colelyman added a commit that referenced this issue Dec 13, 2024
* Mckay/halt on plot fail (#103)

* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------

Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

…
Colelyman added a commit that referenced this issue Dec 13, 2024
* Mckay/halt on plot fail (#103)

* Mckay/c2pro reports test (#99)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Update C cythonized files

        * Add exact numbers to guardrails printouts

        * Remove extraneous whitespace from CRISPRessoCOREResources.pyx

        * Fix calculation of `total_mods` from being negative

        The issue was that `all_deletion_coordinates` just tells you how many deletions
        were present, but not how long the deletion is.

        * Changes to message

        * Remove old tag

        * Point tests at guardrails

        * Restore C2 pro check

        * Save message with guardrail name

        * Point tests repo at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

        * Fix case sensitivity in Prime Editing mode (#54)

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Make all amplicons in amplicon_seq_arr uppercase

        This fixes https://github.com/pinellolab/CRISPResso2/issues/396

        * Allow RNA values to be provided for prime_editing_pegRNA_scaffold_seq

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: Trevor Martin <trevormartinj7@gmail.com>
Colelyman added a commit that referenced this issue Dec 14, 2024
* slots implementation

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------

Co-authored-by: Cole Lyman <cole@colelyman.com>

* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------

Co-authored-by: Sam <snic9004@gmail.com>

* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------

Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Cole Lyman <cole@colelyman.com>

* Add missing keys to slots class

* Remove extra return statement from find_indels_substitutions

* Round percentage complete in CLI and add initial 0% complete (#100) (#477)

* Reduce memory usage for allele plots (#478)

* Create new function to plot memory reduced alleles table plot

* Round percentage complete in CLI and add initial 0% complete (#100)

---------

Co-authored-by: Kendell Clement <k.clement.dev@gmail.com>

* Mckay/c2pro reports test (#99) (#479)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-autho…
Colelyman added a commit that referenced this issue Jan 10, 2025
* slots implementation

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add missing keys to slots class

* Remove extra return statement from find_indels_substitutions

* Round percentage complete in CLI and add initial 0% complete (#100) (#477)

* Reduce memory usage for allele plots (#478)

* Create new function to plot memory reduced alleles table plot

* Round percentage complete in CLI and add initial 0% complete (#100)

---------



* Mckay/c2pro reports test (#99) (#479)

* Fix CRISPRessoAggregate bug and other improvements (#95)

* D3-Enhancements (#78)

* Sam/try plots (#71)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Point test to try-plots

* Fix d3 not showing and plotly mixing with matplotlib

* Use logger for warnings and debug statements

* Point tests back at master

---------




* Sam/fix plots (#72)

* Fix batch mode pandas warning. (#70)

* refactor to call method on DataFrame, rather than Series.
Removes warning.

* Fix pandas future warning in CRISPRessoWGS

---------



* Functional

* Cole/fix status file name (#69)

* Update config file logging messages

This removes printing the exception (which is essentially a duplicate),
and adds a condition if no config file was provided. Also changes `json`
to `config` so that it is more clear.

* Fix divide by zero when no amplicons are present in Batch mode

* Don't append file_prefix to status file name

* Place status files in output directories

* Update tests branch for file_prefix addition

* Load D3 and plotly figures with pro with multiple amplicons

* Update batch

* Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

Before this fix, when using a file_prefix the second run that was compared
would not be displayed as a data in the first figure of the report.

* Import CRISPRessoPro instead of importing the version

When installed via conda, the version is not available

* Remove `get_amplicon_output` unused function from CRISPRessoCompare

Also remove unused argparse import

* Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

* Allow for matching of multiple guides in the same amplicon

* Fix pandas FutureWarning

* Change test branch back to master

---------



* Try catch all futures

* Fix test fail plots

* Fix d3 not showing and plotly mixing with matplotlib

---------




* Remove token from integration tests file

* Provide sgRNA_sequences to plot_nucleotide_quilt plots

* Passing sgRNA_sequences to plot

* Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

* Add max-height to Batch report samples

* Change testing branch

* Fix wrong check for large Batch plots

* Fix typo and move flexiguide to debug (#77)

* Change flexiguide output to debug level

* Fix typo in fastp merged output file name

* Adding id tags for d3 script enhancements

* pointing to test branch

* Add amplicon_name parameter to allele heatmap and line plots

* Add function to extract quantification window regions from include_idxs

* Scale the quantification window according to the coordinates of the sgRNA plot

* added c2pro check, added space in args.json

* Correct the quantification window indexes for multiple guides

* Fix name of nucleotide conversion plot when guides are not the same

* Fix jinja variables that aren't found

* Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

* Remove unneeded variable and extra whitespace

* Switch test branch to master

---------





* Add amplicon_name to plot functions

* Add sgRNA sequences to nucleotide quilt parameters in Aggregate

* Add custom_colors to Aggregate plot functions

* Update Aggregate and make_aggregate_report to have logger and tool

* Write command_used to Aggregate .json info file

* Point to new test branch and add Aggregate run

* Make the order of Aggregate runs explicit

* Sort all instances of crispresso2_folder_info in Aggregate

* Sort df_summary_quantification df in Aggregate

* Try sorting with a list of single column

* Update to correct test branch

* Move to master test branch

---------





* Squashed commit of the following:

commit 6ec98a05ee70f85b5aa0ac15ab6094b7f1f20d08
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Tue Aug 13 16:44:39 2024 -0600

    dict key changes

commit 7cfd5acf06da4eb6f49453144ee1fed1e1488a7a
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Thu Aug 8 15:30:31 2024 -0600

    added C2PRO install check back

commit bfb0003329ea61b5c79c7e1df8d9a73ec5a508db
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 13:08:12 2024 -0600

    fixed key error conditionals

commit 84444e7480605206cb3efa4a0db675c55e717304
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Fri Aug 2 09:22:44 2024 -0600

    use local jinja_paritals file

commit 71dd12786fec6c4aba0170a3bfd9022b06f5eede
Author: mbowcut2 <mbowcut@gmail.com>
Date:   Wed Jul 31 14:10:29 2024 -0600

    Squashed commit of the following:

    commit 5e3b30515c4bc437127e7fb21f53cb0bd511c4ca
    Author: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
    Date:   Mon Jul 22 09:31:44 2024 -0600

        D3-Enhancements (#78)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Fix typo and move flexiguide to debug (#77)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

        * Adding id tags for d3 script enhancements

        * pointing to test branch

        * Add amplicon_name parameter to allele heatmap and line plots

        * Add function to extract quantification window regions from include_idxs

        * Scale the quantification window according to the coordinates of the sgRNA plot

        * added c2pro check, added space in args.json

        * Correct the quantification window indexes for multiple guides

        * Fix name of nucleotide conversion plot when guides are not the same

        * Fix jinja variables that aren't found

        * Fix multiple guide errors where the wrong sgRNA sequence was associated in d3 plot

        * Remove unneeded variable and extra whitespace

        * Switch test branch to master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

    commit 09e5d9720ad21e44fc7916d71bde3fd7a9dfa7ef
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu Jul 18 14:31:54 2024 -0600

        Asymmetrical cut point (#457)

        * add cut_point_ind to plot_alleles_heatmap for asymmetrical plotting

        * Cole asymmetrical cut point (#453)

        * Pin versions of numpy and matplotlib in CI environment (#84) (#452)

        * Reduce duplication and implement cut_point_ind in plot_alleles_heatmap_hist

        ---------

        Co-authored-by: Cole Lyman <Cole@colelyman.com>

    commit 8d92972694ddff629dad844a6ad100459f69751d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Jul 18 14:29:40 2024 -0600

        Cole/update args (#85) (#456)

    commit 44f692ecabf5e2eb96ee0cfd7bae62343da7810c
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon Jul 15 16:17:29 2024 -0600

        Implement new pooled mixed-mode default behavior (#454)

        * changes for pooled mixed-mode default (#83)

        * changes for pooled mixed-mode default

        * deprecated old arg

        * added integration tests for mixed mode

        * fixed test target

        * updated test name

        * pinned numpy

        * Fix integration tests yml

        * pinning matplotlib

        * added print to CI tests

        * changed mixed mode info string

        * Remove pooled-mixed-mode-align-to-genome step from Github Actions

        * Update demultiplex_genome_wide parameter and help

        * Convert args.json to unix line endings

        * Add Pooled mixed mode demux run

        * Update the name of the argument in Pooled

        * Point integration tests back to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Revert change to pooled mixed mode info statement (#86)

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 79b482b55a0e8edbc03ec22bd2714bade1e90323
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Jul 9 12:53:23 2024 -0600

        Pin versions of numpy and matplotlib in CI environment (#84) (#452)

    commit 80dc1bdd72d50f989717bfc5f8156bc3495c45f4
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 14:07:42 2024 -0600

        Add padding to image

    commit 381755daf0939aaf2745df0a802c809633aff47d
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 30 13:59:57 2024 -0600

        White background for schematic for dark mode

    commit d649db71e610bd8840fbb8d46fadb07789b67390
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri May 24 12:45:53 2024 -0600

        Fix typo and move flexiguide to debug (#77) (#438)

        * Change flexiguide output to debug level

        * Fix typo in fastp merged output file name

    commit 71181f50ef2b39015523b1a71d9fd1bf0dce14eb
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 13:34:00 2024 -0600

        Prefix the release Docker tag with a `v` (#434)

    commit d2c2be18a6bb64b0e742cc24c4665980a24324bc
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Mon May 13 09:41:32 2024 -0600

        Showing sgRNA sequences on hover in CRISPRessoPro (#432)

        * Passing sgRNA sequences to regular and Batch D3 plots (#73)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Provide sgRNA_sequences to plot_nucleotide_quilt plots

        * Passing sgRNA_sequences to plot

        * Refactor check for determining when to use CRISPREssoPro or matplotlib for Batch plots

        * Add max-height to Batch report samples

        * Change testing branch

        * Fix wrong check for large Batch plots

        * Update integration_tests.yml to point back at master

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Push new releases to ECR (#74)

        * Create aws_ecr.yml (#1)

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * us-east-1

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Update aws_ecr.yml

        * Fix d3 sgRNA sequences (#76)

        * Pass correct sgRNA_sequences to d3 plot

        * Pass correct sgRNA sequence to prime editor plot for d3

        * Resize plotly (#75)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        * Pass div id for plotly

        * Remove debug

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1c504274818b6b17fb60620d48fd92cb2e50566d
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu May 9 14:16:25 2024 -0600

        Fix plots and improve plot error handling (#431)

        * Sam/try plots (#71)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Point test to try-plots

        * Fix d3 not showing and plotly mixing with matplotlib

        * Use logger for warnings and debug statements

        * Point tests back at master

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Sam/fix plots (#72)

        * Fix batch mode pandas warning. (#70)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Functional

        * Cole/fix status file name (#69)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

        * Try catch all futures

        * Fix test fail plots

        * Fix d3 not showing and plotly mixing with matplotlib

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Remove token from integration tests file

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit acb2ea8e26dff4cd11f71301b344f81b1cec9040
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Thu May 2 13:49:33 2024 -0600

        Use recent docker image for CircleCI testing that includes updated pandas

    commit 38fd76dbd7ce2087468f9f454b548777de959a68
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 16:42:28 2024 -0600

        Cole/fix status file name (#69) (#430)

        * Update config file logging messages

        This removes printing the exception (which is essentially a duplicate),
        and adds a condition if no config file was provided. Also changes `json`
        to `config` so that it is more clear.

        * Fix divide by zero when no amplicons are present in Batch mode

        * Don't append file_prefix to status file name

        * Place status files in output directories

        * Update tests branch for file_prefix addition

        * Load D3 and plotly figures with pro with multiple amplicons

        * Update batch

        * Fix bug in CRISPRessoCompare with pointing to report datas with file_prefix

        Before this fix, when using a file_prefix the second run that was compared
        would not be displayed as a data in the first figure of the report.

        * Import CRISPRessoPro instead of importing the version

        When installed via conda, the version is not available

        * Remove `get_amplicon_output` unused function from CRISPRessoCompare

        Also remove unused argparse import

        * Implement `get_matching_allele_files` in CRISPRessoCompare and accompanying unit tests

        * Allow for matching of multiple guides in the same amplicon

        * Fix pandas FutureWarning

        * Change test branch back to master

        ---------

        Co-authored-by: Sam <snic9004@gmail.com>

    commit 3ec22e5fd09e432c9997d30e5f9ee51a2cc00d7b
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed May 1 13:08:11 2024 -0600

        Remove linked space in readme

    commit 340a4e16795a5e500411e11572ec267525985009
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed May 1 13:07:14 2024 -0600

        Fix batch mode pandas warning. (#70) (#429)

        * refactor to call method on DataFrame, rather than Series.
        Removes warning.

        * Fix pandas future warning in CRISPRessoWGS

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>

    commit 1bc9e906f0ded81f80761d1ec375ee50a4f882a9
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 26 16:26:27 2024 -0600

        Bump version to 2.3.1 and change default CRISPRessoPooled behavior to change in 2.3.2 (#428)

    commit 5638a1f6ffa973231f23422e9c757fa8cd4af7cc
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Wed Apr 24 18:00:43 2024 -0600

        Spelling fixes

    commit d6011f29db16d8fc1c1e7222457b7f9a1f671de6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Wed Apr 24 09:33:53 2024 -0600

        Extract `jinja_partials` and fix CRISPRessoPooled fastp errors (#425)

        * Updated README (#64)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Extract jinja_partials  (#65)

        * Extract jinja_partials code

        * Remove Plotly dependency from setup.py

        * Fix CRISPRessoPooled flash errors (#68)

        * Fix replacing flash intermediate files with fastp intermediate files

        This also moves where the files are added to `files_to_remove` up to
        near where they are created.

        * Update to run test branch with paired end Pooled test

        * Add pooled-paired-sim test to integration tests

        * Replace flash and trimmomatic with fastp and remove plotly from Github Actions environment

        * Change test branch back to master

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit f4858a30c43374f54058b3ad9c1e965e1ab7fb46
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 23 17:00:28 2024 -0600

        Updated README (#64) (#424)

        * Updating README to fix argument, email, and formatting

        * removing superfluous files

        * Add link to CRISPRessoPro, move CRISPRessoPro section to end, and fix JSON formatting

        * Remove link to CRISPRessoPro

        * Replace Docker badge with link to tags

        * Add bullet points to Guardrails section and improve formatting

        * Fix typo and removed colons from guardrails

        ---------

        Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>

    commit c3dbff0fccd44b0b1a9c246dd2aa629ddc515787
    Author: Kendell Clement <k.clement.dev@gmail.com>
    Date:   Mon Apr 22 11:24:59 2024 -0600

        Update CRISPRessoPooledCORE.py (#423)

        Fix bug in error reporting if duplicate names are present

    commit 20903c14877e5166b1b8a7b50b8fcab450ea3ca6
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 16:55:39 2024 -0600

        Remove extra imports from CRISPRessoCore (#67) (#422)

    commit 4aae57e5be475cd717792265bee36a71a99425de
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Thu Apr 18 10:00:19 2024 -0600

        Cole/refactor jinja undefined (#66) (#421)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Refactor logging Jinja2 undefined variable warnings

        * Revert plot_11a update

        * Update intedration test branch

        * Update jinja to warn on undefined but not fail. Fix all undefined warnings

        * Fix github integration tests ref

        * One more undefined variable

        ---------

        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

    commit 768c3c05bf1786a2a32e135b6e145cd6503c3db1
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Tue Apr 9 17:30:10 2024 -0600

        Fix Jinja2 undefined variables (#63) (#417)

        * Replace Jinja2 PackageLoader with FileSystemLoader

        The PackageLoader doesn't work with a fairly recent version of Jinja2 (3.0.1)
        and Python 3.9. Replacing with FileSystemLoader work with the older version and
        the latest version.

        * Fix undefined variable `amplicon_name` in report template

        * Revert plot_11a update

        * Update intedration test branch

        * Update branch for integration tests

    commit 7e18f08cc1ac5f247a0fd1bbb394ccd9b0a07c2e
    Author: Han Dai <github@daihan.me>
    Date:   Fri Apr 5 18:36:41 2024 -0400

        fix: change all U+00A0 to U+0020 (#400)

    commit 235dc29c0cd0fcca2e999148d4660acf00b07221
    Author: Cole Lyman <Cole@colelyman.com>
    Date:   Fri Apr 5 16:36:16 2024 -0600

        Fastp, args as data, guardrails, and PE fix (#415)

        * Change CRISPResso_status.txt format to JSON (#46)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * add json read for status file

        * changed Formatter to json format

        * fixed json access variable name: message

        * changed  perentage_complete to numeric

        * changed status file to .json

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * New makefile commands

        * changed file to .json

        * changed status to json file

        * Make JSON human readable by adding new lines

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * point to test branch

        * pointed CI config to testing branch

        * Update integration_tests.yml

        point to master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Trevor/fastp integration (#50)

        * Update check_program to check versions and create check_fastq function

        * Update fastq arg, implement fastp in get_most_frequent_reads

        * Bump version to 2.3.0

        * Deprecate Flash and Trimmomatic parameters, and update fastp params

        * Update guess_amplicons and guess_guides to remove max_paired_end_reads_overlap

        * Implement trimming of single end reads

        * Merge (and trim) reads in CRISPRessoCORE with fastp

        * Modify error handling to account for fastp errors

        * Replace flash and trimmomatic with fastp in Docker dependencies

        * Update LICENSE.txt with fastp info

        * Remove min and max amplicon length (no longer needed)

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Implement trimming with fastp in CRISPRessoPooled

        * Implemend merging (and trimming) with fastp in CRISPRessoPooled

        * Fixed minor fastp errors

        * Move read filtering to after merging in CRISPResso (#39)

        * Move read filtering to after merging

        This is in an effort to be consistent with the behavior and results of
        CRISPRessoPooled.

        * Properly assign the correct file names for read filtering

        * Add space around operators

        * GitHub actions on pr (#51)

        * Run integration tests on pull_request

        * Run pytest on pull_request

        * Run pylint on pull_request

        * Run tests on PR only when opening PR (#53)

        * Update reports (#52)

        * Update report changes

        * Switch branch of integration test repo

        * Remove extraneous `crispresso_data_path`

        * Point integration tests back to master

        * Update where the test point to

        * Fix 'Prime-edited' key not found (#32)

        * Move 'Prime-edited' amplicon name check

        By moving this, it will check if there is an amplicon named
        'Prime-edited' (which is a reserved name) even if the
        `prime_editing_pegRNA_extension_seq` parameter is empty.

        * Only search for scaffold integration when pegRNA extension seq is provided

        * Remove spaces at the end of lines

        * Docker size (#49)

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * 3.4->2.08

        * Put ttf-mscorefonts-installer back above apt-get clean

        * restore slash, replace fastp with trimmomatic and flash, add autoremove step

        ---------

        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * initial readme modifications

        * Updated readme to remove deprecated commands, updated help text to reflect new version and fastp

        * Pointing test branch back at master

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>
        Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
        Co-authored-by: Samuel Nichols <Snic9004@gmail.com>

        * Guardrails clean history (#34)

        * Include guardrail functions

        * Add CRISPRessoReports subtree

        * Refactor to use CRISPRessoReports module

        * Include guardrail functions

        * Functional guardrails, needs reports update

        * Add guardrail partial

        * fix guardrials partial

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * GitHub actions integration tests (#48)

        * GitHub actions clean (#40)

        * Create pytest.yml

        * Create pylint.yml

        * Create .pylintrc

        * Create test_env.yml

        * Full path

        * Remove conda install

        * Replace path

        * Pytest tests

        * pip -e

        * Create integration_tests.yml

        * Simplify name

        * CRISPRESSO2_DIR environment variable

        * Up one dir

        * ls workspace

        * Install CRISPResso and ydiff

        * Clone repo instead of checkout

        * submodule

        * ls

        * CRISPResso2_copy

        * ls

        * Update env

        * Simplify

        * Pull from githubactions branch

        * Pull githubactions repo

        * Checkout githubactions

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * Run tests individually

        * Pin plotly version

        * Run all tests even if one fails

        * Test on another branch

        * Switch branch with token

        * Update integration_tests.yml

        * Introduce pandas sorting in CRISPRessoCompare (#47)

        * New makefile commands

        * Fix interleaved fastq input in CRISPRessoPooled and suppress CRISPRessoWGS params (#42)

        * Extract out split_interleaved_fastq function to CRISPRessoShared

        * Implement splitting interleaved fastq files in CRISPRessoPooled

        * Suppress split_interleaved_input from CRISPRessoWGS parameters

        * Suppress other parameters in CRISPRessoWGS

        * Move where interleaved fastq files are split to be trimmed properly

        * Bug Fix - 367 (#35)

        * - Fixed references to ref_names_for_pe

        * removed extra tabs

        * trying to match empty line, no tabs

        * - changed references to ref_names[0]

        * Mckay/pd warnings (#45)

        * refactor errors='ignore' to try except

        * refactored integer slice to iloc[]

        * moved to_numeric try except to function

        * Refactor to_numeric_ignore_errors to to_numeric_ignore_columns

        This change is slightly cleaner because it addresses the root issue that some
        columns are strings (and can therefore not be converted to numeric types). Now
        if an error does occur when converting the dfs to numeric types it won't be
        swallowed up.

        * Add documentation to to_numeric_ignore_columns

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        ---------

        Co-authored-by: Cole Lyman <cole@colelyman.com>

        * On push no branches

        * On push no branches

        * All in one file

        * Fix yml errors

        * Rename jobs

        * Remove old workflow files

        * Remove paths

        * Run jobs in parallel

        ---------

        Co-autho…

Co-authored-by: mbowcut2 <55161542+mbowcut2@users.noreply.github.com>
Co-authored-by: Trevor Martin <60452953+trevormartinj7@users.noreply.github.com>
Co-authored-by: Samuel Nichols <Snic9004@gmail.com>
Co-authored-by: Kendell Clement <k.clement.dev@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants