Grass-KaKs

Grass-KaKs is used to calculate the ratio of the number of nonsynonymous substitutions per non-synonymous site (Ka) to the number of synonymous substitutions per synonymous site (Ks) per gene in multiple grass species.

Dependencies

The code only works under Linux system.
Install Phylogenetic Analysis by Maximum Likelihood (PAML) in your environmental path.
Syntenic gene list with multiple species (example could be found in here).
Relevant CDS sequence with primary transcript of your analyzed species. Name of fasta file should be idential with name in the header file of syntenic gene list (e.g. sorghum3.fa, setaria2.fa and maize4.fa).

Run the analysis

This is an example to run analysis for three species including maize, sorghum and setaria.

 python grass-kaks-generator.py -s sorghum3 maize4_1 setaria2 -i syntenic_list_example.csv -m

-s, species name you want to analyze, it should be idential as the header in syntenic gene list file.
-i, syntenic gene list.
-m, since maize contains a whole genome duplication, set this flag when you include maize in the analysis. maize1 and maize2 should be run separately.

The header for these three species is Maize,dn,ds,dn/ds,Sorghum,dn,ds,dn/ds,Seteria,dn,ds,dn/ds

Several things you need to notice

each subgenome of WGD (Whole Genome Duplication) Species is analyzed separately
unflag -m if you do not contain species of maize
clean up codon_alignment folder after you run
modify tree function in codon_align.py based on relations among your species
customize parameters in codeml.ctl as your requirements. Detailed setting could be found in here
modify extract_codeml.py to extract dn(Ka), ds(Ks) and dn/ds(Ka/Ks) ratio for corresponding species
change the header for your input species.
with more than 3 species, you need to define background and foreground species, also should label your interested branches

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
codon_alignment		codon_alignment
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
codeml.ctl		codeml.ctl
codon_align.py		codon_align.py
codon_align.pyc		codon_align.pyc
extract_codeml.py		extract_codeml.py
fasta_align.py		fasta_align.py
fasta_align.pyc		fasta_align.pyc
gene_extraction.py		gene_extraction.py
gene_extraction.pyc		gene_extraction.pyc
grass-kaks-generator.py		grass-kaks-generator.py
maize4.fa		maize4.fa
pal2nal.pl		pal2nal.pl
setaria2.fa		setaria2.fa
sorghum3.fa		sorghum3.fa
syntenic_list_example.csv		syntenic_list_example.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grass-KaKs

Dependencies

Run the analysis

This is an example to run analysis for three species including maize, sorghum and setaria.

Several things you need to notice

About

Releases

Packages

Languages

License

shanwai1234/Grass-KaKs

Folders and files

Latest commit

History

Repository files navigation

Grass-KaKs

Dependencies

Run the analysis

This is an example to run analysis for three species including maize, sorghum and setaria.

Several things you need to notice

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages