Skip to content

改进图片 OCR 提取文本结果中的多余字符 #7109

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
88250 opened this issue Jan 18, 2023 · 3 comments
Closed

改进图片 OCR 提取文本结果中的多余字符 #7109

88250 opened this issue Jan 18, 2023 · 3 comments
Assignees
Milestone

Comments

@88250
Copy link
Member

88250 commented Jan 18, 2023

  • 不可见字符
  • 中文之间的空格
@88250 88250 added this to the 2.7.1 milestone Jan 18, 2023
@88250 88250 self-assigned this Jan 18, 2023
@88250
Copy link
Member Author

88250 commented Jan 18, 2023

6d4aa07

88250 added a commit that referenced this issue Jan 18, 2023

Unverified

This user has not yet uploaded their public signing key.
@88250 88250 closed this as completed Jan 18, 2023
@Zuoqiu-Yingyi
Copy link
Contributor

@88250 请问若需要对已进行 OCR 的图片重新进行 OCR, 是直接重建索引还是删除 siyuan.db 后再重建索引

@88250
Copy link
Member Author

88250 commented Jan 24, 2023

删除 data/assets/ocr-texts.json 或者打开这个文件自行调整内容。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants