-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Help Needed #10
Comments
It seems that your image is of different size. The network suppose your image's hight is 32, and the width is multiplies of 8 by default. The CNN backbone of the network compress image's width by 1/4 in arch So if your image's width is 80, the size of Or you can train the network with The |
Thanks for your quick response. If i decode this prediction |
Do you mean sequence length can be up to 99? CTC loss introduce a blank token ϵ to get around not knowing the alignment between the input and the output. When you infer an image's content, you should collapse repeats and remove ϵ tokens. So the decoded output size is not fixed. It depends on your input image and your trained networks. You can refer Awni's article here. If your datasets is small, the mean and std of the datasets is significant, and the |
Is there any particular reason to use softmax? i did not see original paper mention it Never mind. i got it. It served as input for CTClOSS |
@ronghui19 Yes, I didn't know what is the result when remove CTC loss from the crnn network, which I mean only combine the log softmax in the procedure of back propagation. I'm testing this. |
@zhiqwang In file crnn.py, there is an init_network function, as far as i am concerned, you may forget to set pretrained network frozen. There should be something like this: |
@ronghui19 Current, there is not any pre-trained model to be used in the CNN backbone. So I did not set the learning rate of this part. |
I am training a CRNN model in pytorch
max_seq_length=99
number_of_alphabets=96
batch_size=16
output=CRNN(image)
what should be the expected shape of output?
Secondly, should we apply softmax in CRNN after fully connected layer?
Any help would be appreciated. Thanks
The text was updated successfully, but these errors were encountered: