Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

difference between DRTC-L and DRCT-XL #15

Open
Ar0Kim opened this issue Jun 30, 2024 · 2 comments
Open

difference between DRTC-L and DRCT-XL #15

Ar0Kim opened this issue Jun 30, 2024 · 2 comments

Comments

@Ar0Kim
Copy link

Ar0Kim commented Jun 30, 2024

스크린샷 2024-06-30 183500
Hi, thank you for sharing your project. I have a question about the differences between DRTC-L and DRCT-XL. In the paper, it is mentioned that DRCT-L is also pretrained on the ImageNet dataset, but on the GitHub page, it is indicated as "pretrained on ImageNet" only under DRCT-XL. Secondly, to train DRCT-L according to your paper, for SRX4, should I first use train_DRCT-L_SRx4_ImageNet_from_scratch.yml to train and obtain the pretrained model, and then use train_DRCT-L_SRx4_finetune_from_ImageNet_pretrain.yml for the final training?
image

@ming053l
Copy link
Owner

ming053l commented Jul 1, 2024

hi, the difference between Large and X-Large version is the depth of RDG.
XL version have 14 RDG, L have 12.

Secondly, to train DRCT-L according to your paper, for SRX4, should I first use train_DRCT-L_SRx4_ImageNet_from_scratch.yml to train and obtain the pretrained model, and then use train_DRCT-L_SRx4_finetune_from_ImageNet_pretrain.yml for the final training?

Yes, you need to pre-train DRCT on large scale dataset ('imagenet') and then fine-tune it on your own dataset like DF2K

@Ar0Kim
Copy link
Author

Ar0Kim commented Jul 1, 2024

thank you for the reply. it's so helpful :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants