Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How trust4 filters barcodes from cdr3.out and anna.fa to barcode_report? #275

Open
yuyuleung opened this issue May 31, 2024 · 4 comments
Open

Comments

@yuyuleung
Copy link

Dear Dr. Li,

I am looking into the raw outputs from trsut4, like cdr3.out, anno.fa, barcode_report, and barcode_airr, so that I can learn how my data looks like (why there is a little number of complete VDJ). However, I am confused, how the trust-barcoderep.pl and trust-airr.pl actually work to filter invalid contigs and finally got barcode_report and barcode_airr. As I have quick look into these two scripts, I guess that the contigs with parital CDR3 (cdr3_score = 0) , out_of_frame CDR3 (len%3!=0), low cdr3 abundance supported will be filtered. And fianlly the contigs with valid CDR3 should be kept.

However, when I compare the set of barcodes between cdr3.out and barcode_report, I am confused because there are several barcodes that appear to be valid CDR3 contigs but are filtered out from cdr3.out.

To better understand how to process the raw assembly output, could you please explain it more detailed?

Many thanks.
Yuyu

Following file is partial cdr3.out of the barcode 812_669, but this barcode is filtered in barcode_report.
barcode_in_cdr3_not_in_report.txt

@mourisl
Copy link
Collaborator

mourisl commented May 31, 2024

It should not be filtered. I can see the barcode when directly running trust-barcoderep on this file. Is it possible to share the original cdr3.out file? You can send me the file through email (Li.Song@dartmouth.edu).

@yuyuleung
Copy link
Author

Dear Dr. Li,

thank you for your answer. I have double check it, it does in the barcode_report. I am sorry about it.

However, can you tell me more about ow the trust-barcoderep.pl and trust-airr.pl actually work to filter invalid contigs and finally got barcode_report and barcode_airr? Cause I have compared the barcode-set after filtering parital CDR3 (cdr3_score = 0) , out_of_frame CDR3 (len%3!=0), low cdr3 abundance from cdr3.out, the resulting barcode-set is not exactly as the set of barcode-report and barcode-airr.

Thank you so much.
Yuyu

@mourisl
Copy link
Collaborator

mourisl commented Jun 3, 2024

There is an extra filter for gdT. If there are some good abT or B contigs assembled, even without CDR3, those gdT barcode may be regarded as false positives. If you know the barcode that is missing in the barcode report, you can check their chain type in the cdr3.out file.

The trust-airr is based on the barcode_report file, and I don't think it has extra filters.

@yuyuleung
Copy link
Author

Oki, I will look into them. Thank you so much :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants