Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi index again #326

Merged
merged 17 commits into from
Mar 3, 2025
Merged

Multi index again #326

merged 17 commits into from
Mar 3, 2025

Conversation

ekg
Copy link
Collaborator

@ekg ekg commented Mar 2, 2025

Fixes and improves index saving and loading, and also target subset (wfmash -b) index building and loading.

ekg added 17 commits February 27, 2025 17:55
…usage

This commit introduces a robust mechanism for maintaining consistent sequence ID assignments during wfmash index creation and usage. Key changes include:

- Added export/import methods for sequence ID mappings in SequenceIdManager
- Modified addSequence to use a nextId counter for new sequences
- Updated index header reading/writing to preserve sequence ID information
- Ensured sequence ID consistency across different runs and filtering scenarios

The implementation allows:
- Preserving original sequence IDs when creating/using indexes
- Adding new sequences without disrupting existing ID assignments
- Handling cases with multiple input files and sequence subsets

Resolves potential mapping issues when working with complex sequence sets.
This commit enhances the robustness of sequence ID management by:
1. Adding more detailed error messages for missing sequences
2. Improving ID mapping import and merging
3. Adding prefix-based sequence filtering
4. Providing better logging for sequence-related issues
@ekg ekg merged commit 5b22848 into main Mar 3, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant