Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve coordinate concordance #965

Open
CBeelen opened this issue May 19, 2023 · 0 comments
Open

Improve coordinate concordance #965

CBeelen opened this issue May 19, 2023 · 0 comments
Milestone

Comments

@CBeelen
Copy link
Contributor

CBeelen commented May 19, 2023

The coordinate concordance used to be calculated relative to reference coordinates in 7.15, and will be updated to conseq coordinates in 7.16. However, both of these approaches have one drawback: we cannot calculate a concordance for insertions or deletions, respectively, because they have nothing to be compared to.
An alternative would be to extend both sequences to be the same length - that is, to fill up all deletions in both sequences with x characters - and to calculate the concordance for these two sequences. This would mean that the concordance decreases for insertions and deletions in the conseq equally - for a deletion, we would be able to tell because the window over which we average the concordance would extend into the deletion at its edge.
The advantage is that insertions and deletions would be treated "symmetrically". The difficulty will be to keep track of the true conseq coordinates and to assign the computed concordances to the correct positions. We will also have to figure out what to do with unmatched pieces of sequence. Currently, we do not compare them to anything and leave their concordance at 0.

@CBeelen CBeelen modified the milestones: near future, 7.17 May 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant