Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is the OCR evaluation model (ster\crnn) and tatt end-to-end? #8

Open
HansonnnCheung opened this issue May 30, 2022 · 3 comments
Open

Comments

@HansonnnCheung
Copy link

Is the OCR evaluation model (ster\crnn) and tatt end-to-end ? OR first use SR model to output results, and then input OCR?
just like the code below:
def getitem(self, index):
...
...
label_str = str_filt(word, self.voc_type)
return img_HR, img_lr, img_HRy, img_lry, label_str

Does “label_str” participate in the training of the whole model?

@mjq11302010044
Copy link
Owner

@HansonnnCheung Not really, TATT follows the settings in the TPGSR, i.e., the label_str is not used in training.

@HansonnnCheung
Copy link
Author

Thank you for your answer. Where did the “teaching_loss” come from during the training. Is "TEXT_Prior" generated by an OCR model? @mjq11302010044

In addition, if "lable_str" does not participate in training, is it necessary to use the value of AAA as an input during DataLoader?

@HansonnnCheung
Copy link
Author

Do you mean that “TPGSR and its settings” is the TPGSR model,which published in your home page? @mjq11302010044

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants