You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#!/usr/bin/env python"""# Original Source: # https://github.com/cruizperez/MicrobeAnnotator/blob/master/microbeannotator/pipeline/ko_mapper.py# Forked on 2023.10.18If you use this software, please cite the original publication: Ruiz-Perez, C.A., Conrad, R.E. & Konstantinidis, K.T. MicrobeAnnotator: a user-friendly, comprehensive functional annotation pipeline for microbial genomes. BMC Bioinformatics 22, 11 (2021). https://doi.org/10.1186/s12859-020-03940-5######################################################################### Author: Carlos A. Ruiz Perez# Email: cruizperez3@gatech.edu# Intitution: Georgia Institute of Technology# Version: 1.0.0# Date: Nov 13, 2020# Description: Maps protein KO information with their respective modules# and calculates the completeness percentage of each module present.########################################################################"""################################################################################"""---0.0 Import Modules---"""
Also in the help menu: Josh L. Espinoza's fork from MicrobeAnnotator. Please cite the following: https://doi.org/10.1186/s12859-020-03940-5
This module here makes a lot of sense to be categorized as bifurcating: M00075
![image](https://private-user-images.githubusercontent.com/9061708/331720140-e8c18a88-e100-49c3-9e39-9b97b10e59d9.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk5MjgyNzUsIm5iZiI6MTczOTkyNzk3NSwicGF0aCI6Ii85MDYxNzA4LzMzMTcyMDE0MC1lOGMxOGE4OC1lMTAwLTQ5YzMtOWUzOS05Yjk3YjEwZTU5ZDkucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MDIxOSUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTAyMTlUMDExOTM1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9N2Q5ZGNmMWUyYTE2MzIwM2RmN2NhMTU0MTJkZjBiODM0OTY0ZDFlZDljOTFlNjI5OGExOTdlNDdjZjc3MTgyMCZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.VbqLHqQZxRG0ZXjYHF356AtlOZ4ZhmdyVQZQvam-o_Y)
However, this module is also in the bifurcating list and it seems to be fairly linear: M00832
![image](https://private-user-images.githubusercontent.com/9061708/331720292-8b871c1f-d6c4-410d-ad04-3bb09b10ebf4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk5MjgyNzUsIm5iZiI6MTczOTkyNzk3NSwicGF0aCI6Ii85MDYxNzA4LzMzMTcyMDI5Mi04Yjg3MWMxZi1kNmM0LTQxMGQtYWQwNC0zYmIwOWIxMGViZjQucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MDIxOSUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTAyMTlUMDExOTM1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9N2Y4NDM2NTA5ZGZmNzBlODAyZWViZTE3MjQ2MzRjOGRjYmQzZmZjMDE4MmZjMTNlYjUyZTQ5YTlhZTFhMGFiYSZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.amgTzTG0adfcTK0y9MiIhmENyWiZ64Xt84eQnQ5UqRg)
Is there a rule-based scheme used determine module categorization (e.g., regular, bifurcating, or structural)?
I would like to update it with the 481 modules here: https://www.genome.jp/kegg/docs/module_statistics.html
Also, disclaimer that I forked and modified your module completion ratio calculations in my VEBA package (with proper citations of course!):
https://github.com/jolespin/veba/blob/9532bee1a5e57dd0968e2e04b21abcaede7fcccf/bin/scripts/module_completion_ratios.py#L4
Prefixed with this in the script:
Also in the help menu:
Josh L. Espinoza's fork from MicrobeAnnotator. Please cite the following: https://doi.org/10.1186/s12859-020-03940-5
And of course in the publication which should be accepted soon: https://www.biorxiv.org/content/10.1101/2024.03.08.583560v2
I'd be happy to continue development on the KEGG module completion but would need some guidance on how you determine the module categories.
The text was updated successfully, but these errors were encountered: