This repository was created as a means of source control for the DNASCANNER project. This repository is authored by Ananya Aditi Singh, Desh Iyer and Alakto Choudhury.
- Switched to using
fasta
files from regular.txt
files to store not just one sequence but multiple sequences along with metadata. - By extension, the program now works on more than one DNA sequence at once. All the inputs are stored in
sample.fasta
- The sliding window function remains the same with slight tweaks for compatibility with the
.fasta
file. - Output from
main.ipynb
are stored in json format inoutput.json
.
- We started by writing a python script named
sliding-window.py
to illustrate the sliding window technique in Python. - We then scaled this up to the
main.ipynb
handling a much larger sequence of DNA that we generated using an online tool. This sequence is stored in the text filedna.txt
and is imported as and when required. - In addition, the data structure that stores all of the data pertaining to the nucleotides is written to the
output.json
for future reference.
The docs for this project can be found on a google doc here. This daily log includes an agenda, code snippets and screenshots with an explanation of every block of code.