Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't recognize consecutive same charactors #9

Closed
RaisingSun opened this issue Nov 14, 2017 · 4 comments
Closed

Can't recognize consecutive same charactors #9

RaisingSun opened this issue Nov 14, 2017 · 4 comments

Comments

@RaisingSun
Copy link

Hi, I read your excellent paper and use your code to do some experiment. But I found it can not recognize the consecutive charactor when they are same. For example, "good" will be recognized as "god".
Could you please help me about this problem?
Thanks

@weinman
Copy link
Owner

weinman commented Nov 14, 2017

The model should learn to emit the blank character between repeats so the CTC decoding doesn't collapse them.

@weinman weinman closed this as completed Nov 14, 2017
@RaisingSun
Copy link
Author

Thanks for your response. You mean I have to add blank character between repeats in label?

@Parshwa27
Copy link

@RaisingSun I have the same issue. Can you help?

@weinman
Copy link
Owner

weinman commented Mar 13, 2019

@RaisingSun No, you do not add the blank in the label. If your training schedule is sufficient, it will learn to emit the blank so as to force the repeat character after collapsing. Training the model is very sensitive to local minima. See e.g. #42 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants