Skip to content

865699871/Cross_landscape_HOR_clustering

Repository files navigation

DOI

Cross-landscape HOR clustering analysis

find_pattern.pyis the main workflow of the clustering analysis.

consensus_sequence.py is used to build the target HOR consensus sequence.

processNeedle.py is used to reformat HOR DNA sequences into 0-1 vectors.

recentExpand.py calculate the HOR exact matching number within a sliding window.

Plot.py generate the HOR clustering result tracks.

Software Dependencies

  • Kalign v3.3.5
  • Needle in EMBOSS v6.6.0