Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try edlib for faster duplex matching #28

Open
olawa opened this issue Dec 15, 2022 · 1 comment
Open

Try edlib for faster duplex matching #28

olawa opened this issue Dec 15, 2022 · 1 comment

Comments

@olawa
Copy link

olawa commented Dec 15, 2022

Hi,

suggest to try edlib in place of parasail for alignment of pairs to get a speed up for longer comparisons. Potentially this can recover more good pairs if you go from 250 bp to full/long comparison. I tried it with 5kb and got ~15x speedup while keeping almost the same number of good matches.

@onordesjo
Copy link

It's definitely worth a try, thanks for the suggestion. Initially we cared about getting template end and complement start match up really well, but getting more certain that we want to match up the full sequences. We would have to be careful about not allowing too short partial matches (SINEs/LINEs) as well though which will take some thinking about. If it looks ok I wouldn't be surprised if we switch to edlib.

Cheers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants