Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Frequent reporting of a null allele for 10X data #114

Open
vincentwalter opened this issue Nov 29, 2023 · 0 comments
Open

Frequent reporting of a null allele for 10X data #114

vincentwalter opened this issue Nov 29, 2023 · 0 comments

Comments

@vincentwalter
Copy link

Thanks for creating arcasHLA!

Since 10X references that arcasHLA does a good job at genotyping I tried to use it with scRNA data.

I extracted reads using the following command:

arcasHLA extract \
    --single --unmapped \
    sample_alignments.bam

And then genotyped:

 arcasHLA genotype \
    -g A,B,C,DQA1,DQB1,DRB1 \
    -p caucasian \
    --single \
    sample_alignments.extracted.fq.gz

This results in the following genotypes being assigned:

locus subject_1 subject_2 subject_3 subject_4 subject_5
A A*01:01:140,A*11:303 A*24:608N,A*24:608N A*01:01:143,A*03:01:103 A*29:01:01,A*29:01:01 A*03:01:119,A*24:608N
B B*07:386N,B*07:386N B*07:386N,B*40:01:02 B*07:386N,B*57:01:01 B*07:386N,B*07:386N B*07:386N,B*07:386N
C C*07:01:106,C*07:02:104 C*03:392N,C*07:02:101 C*07:02:128,C*07:02:128 C*15:05:02,C*15:05:02 C*07:02:101,C*07:02:101
DQA1 DQA1*03:01:11,DQA1*03:03:01 DQA1*01:02:01,DQA1*01:02:01 DQA1*01:03:01,DQA1*01:05:01 DQA1*01:01:01,DQA1*01:05:01 DQA1*05:05:01,DQA1*01:02:01
DQB1 DQB1*03:02:01,DQB1*03:01:01 DQB1*06:304N,DQB1*06:02:01 DQB1*06:352,DQB1*05:01:01 DQB1*05:01:01,DQB1*05:01:01 DQB1*03:01:46,DQB1*06:02:01
DRB1 DRB1*04:01:01,DRB1*04:01:01 DRB1*08:01:01,DRB1*08:01:01 DRB1*10:01:01,DRB1*13:01:01 DRB1*10:01:01,DRB1*01:02:01 DRB1*11:04:01,DRB1*15:01:01

Since the subjects were selected because they share a CD8+ T cell response to a certain antigen I'm not surprised that they share class B alleles. However, I don't think it's plausible that they all share B*07:386N, since it's a null allele. In case of subjects 1, 4 & 5 the result even implies that B*07:386N makes up both alleles.

Do you have an explanation for this?
Is there a way to exclude null alleles from the reference?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant