Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the model size #1

Open
PrShi113 opened this issue Sep 28, 2023 · 2 comments
Open

About the model size #1

PrShi113 opened this issue Sep 28, 2023 · 2 comments

Comments

@PrShi113
Copy link

Hi,

Thanks for releasing code of D2Former. It is a very interesting work!

I tried to calculate the number of the parameters of TSCNet in your "generator" file. But I found the model size was 3.28M, rather than 0.87M in your paper.

I wonder if I forgot something...Look forward to hearing from you.

@alibabasglab
Copy link
Owner

Hi,

Thanks for your interests. Regarding your query, we used the following code in the 'train.py' file for computing the number of parameters:

self.model = TSCNet(num_channel=32, num_features=self.n_fft // 2 + 1).cuda()
summary(self.model, [(1, 2, args.cut_len//self.hop+1, int(self.n_fft/2)+1)])

You may follow the same procedure to obtain the number of model parameters.

Thank you.

@PrShi113
Copy link
Author

PrShi113 commented Oct 2, 2023

Thank you for your rapid response!

One more question, I am using a RTX-2080 8G GPU for training, but find CUDA out of memory. Could you inform me the GPUs you used to train your D2Former recipe? Many thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants