Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

有计划对繁体字做支持吗 #156

Closed
fangkuann opened this issue Jun 8, 2020 · 3 comments
Closed

有计划对繁体字做支持吗 #156

fangkuann opened this issue Jun 8, 2020 · 3 comments

Comments

@fangkuann
Copy link

fangkuann commented Jun 8, 2020

目前来看如果字幕是繁体字的话, 错误率挺高的

@LDOUBLEV
Copy link
Collaborator

LDOUBLEV commented Jun 8, 2020

繁体字识别错误率高主要原因是识别网络的训练数据中繁体字比较少。目前暂时没有支持繁体字的排期,你可以根据我们release的中文模型进行fine-tune,加入繁体字数据进行训练

@fangkuann
Copy link
Author

好的,英文单词现在识别出来是粘粘在一起的,这个可以做分词处理么

@guanshuicheng
Copy link

英文单词间的空格可以单独作为一个字符加入字典训练,我之前训练过CRNN的,效果特别好。

an1018 pushed a commit to an1018/PaddleOCR that referenced this issue Aug 17, 2022
* add blazeface nas demo

* fix format

* fix configs

* fix reduce_rate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants