Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request: don't rearrange sequence order #45

Open
SethMusker opened this issue Jan 13, 2023 · 2 comments
Open

Request: don't rearrange sequence order #45

SethMusker opened this issue Jan 13, 2023 · 2 comments

Comments

@SethMusker
Copy link

Hi Robert,

Is it possible to keep the order of the sequences in the output file the same as in the input? (I've been getting this with the latest windows and linux pre-compiled binaries, with -align, with each producing a different, seemingly random order).

Cheers,
Seth

p.s. I've been really happy with the quality of alignments from muscle5!

@rcedgar
Copy link
Owner

rcedgar commented Jan 13, 2023

Thanks for the feedback. I'm aware of the "seemingly random" order issue. I was thinking the output should be sorted to bring similar sequences together (e.g. this can be done with a post-order traversal of the guide tree), but preserving the input order also makes sense as an option. On my list, but not sure when I will be able to work on it. In the mean time it's a few lines of python to re-arrange the sequences following the order of the input FASTA or a text file with a list of sequence identifiers.

@permia
Copy link

permia commented Feb 17, 2024

The esl-alimanip (in HMMER software) can solve the two situation easily. The option --tree in esl-alimanip may be used to bring similar sequences together. The option --reorder may be used to keep the order as the input fasta.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants