Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can i use 128 as framesize when using model , if not, how can I do it ? #18

Open
zhuhao528 opened this issue Aug 30, 2023 · 4 comments
Open

Comments

@zhuhao528
Copy link

zhuhao528 commented Aug 30, 2023

我可以在使用模型的时候使用帧长度为128吗,我看现有的模型是使用的帧长度是160。如果在使用模型的时候使用128帧长不合适,那么需要改网络训练的帧长,重新训练吗?

@YongyuG
Copy link
Owner

YongyuG commented Sep 6, 2023 via email

@zhuhao528
Copy link
Author

zhuhao528 commented Sep 6, 2023

ok thanks,I read a blog that helped me,this is the address https://codeantenna.com/a/pEQArF4zmR

@zhuhao528
Copy link
Author

zhuhao528 commented Sep 9, 2023

when modifing the frame_size to 128, should i modify eband5ms array as same time?
for example:
/for 16K speech frame_size 160 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/200
// static const opus_int16 eband5ms[] = {
///0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/
// 0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 16, 20, 24, 28, 34, 40, 48, 60, 78, 100
//};

/for 16K speech frame_size 128 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/250 向下取整
static const opus_int16 eband5ms[] = {
/0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/
0, 0, 1, 2, 3, 4, 4, 5, 6, 8, 9, 11, 12, 16, 19, 22, 27, 32, 38, 48, 62, 80
};

I round up when I calculate,is this right?Frequency range for boys is 100-200hz,girl is 200-350Hz,will this have a detrimental effect?

@YongyuG
Copy link
Owner

YongyuG commented Sep 25, 2023

when modifing the frame_size to 128, should i modify eband5ms array as same time? for example: /for 16K speech frame_size 160 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/200 // static const opus_int16 eband5ms[] = { ///0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/ // 0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 16, 20, 24, 28, 34, 40, 48, 60, 78, 100 //};

/for 16K speech frame_size 128 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/250 向下取整 static const opus_int16 eband5ms[] = { /0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/ 0, 0, 1, 2, 3, 4, 4, 5, 6, 8, 9, 11, 12, 16, 19, 22, 27, 32, 38, 48, 62, 80 };

I round up when I calculate,is this right?Frequency range for boys is 100-200hz,girl is 200-350Hz,will this have a detrimental effect?

when modifing the frame_size to 128, should i modify eband5ms array as same time? for example: /for 16K speech frame_size 160 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/200 // static const opus_int16 eband5ms[] = { ///0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/ // 0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 16, 20, 24, 28, 34, 40, 48, 60, 78, 100 //};

/for 16K speech frame_size 128 files/ // hz[i]/4*(sample_rate/win_len) = hz[i]/250 向下取整 static const opus_int16 eband5ms[] = { /0 200 400 600 800 1k 1.2 1.4 1.6 2k 2.4 2.8 3.2 4k 4.8 5.6 6.8 8k 9.6 12k 15.6 20k/ 0, 0, 1, 2, 3, 4, 4, 5, 6, 8, 9, 11, 12, 16, 19, 22, 27, 32, 38, 48, 62, 80 };

I round up when I calculate,is this right?Frequency range for boys is 100-200hz,girl is 200-350Hz,will this have a detrimental effect?

no need actually, but you can do it as a experimentt.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants