skip sfx when scanning images #16576
Unanswered
nhannguyen0411
asked this question in
Q&A
Replies: 1 comment
-
|
Hello, in this case, you might consider performing a secondary recognition. For example, for all text areas recognized as numbers, use a text recognition model without the lang="en" parameter to recognize them again. Here is the documentation for using the text recognition model. https://www.paddleocr.ai/main/en/version3.x/module_usage/text_recognition.html |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone, I'm new to PPOCRV5, currently I'm having a problem using PPOCRV5 to scan text on images. When I add the parameter lang="en", it automatically scans kanji/kana as numbers, but if I remove lang="en", the kanji/kana are kept but other characters are recognized incorrectly. Now I want to remove the sfx recognition or keep the kanji/kana when leaving lang="en". Does anyone have a way? Thanks.
python: 3.11
paddleocr: latest
lang="en"

without lang="en"

Beta Was this translation helpful? Give feedback.
All reactions