[BUG] text in Chinese will contain space which cause omnisearch cannot find t #35

liangzh-404 · 2023-08-07T15:35:24Z

Problem description:
when extract text in Chinese, the result will contain space, then you can't search it in omnisearch(unless you add space manually)
here is a image I test

the result in cache is
{"path":".obsidian/plugins/text-extractor/cache/802469d99ce6f82bfe4e7c007322d468.json","text":"万东立刻来了思路写起了大纲 , 大致剧情如下。【女主被匪徒绑架 , 男主为救女主被匪徒击中要害丧失了生育能力。女主知道后十分的愧疚 , 便把自己的子宫移植给了男主","libVersion":"0.2.2","langs":"chi_sim+eng"}
as you can see, it contains space between characters, which cause omnisearch stop working

until I add space munally, then I can get ocr result

Your environment:

Plugin version: 0.4.6
Obsidian version: 1.4.2
Operating system: 14.0
Number of images/PDFs in your vault (approx.): very small
Other plugins that may be related to the issue:
omnisearch

albert748 · 2024-04-10T12:00:02Z

the same issue here

CamWam · 2024-10-09T07:14:51Z

same here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] text in Chinese will contain space which cause omnisearch cannot find t #35

[BUG] text in Chinese will contain space which cause omnisearch cannot find t #35

liangzh-404 commented Aug 7, 2023

albert748 commented Apr 10, 2024

CamWam commented Oct 9, 2024

[BUG] text in Chinese will contain space which cause omnisearch cannot find t #35

[BUG] text in Chinese will contain space which cause omnisearch cannot find t #35

Comments

liangzh-404 commented Aug 7, 2023

albert748 commented Apr 10, 2024

CamWam commented Oct 9, 2024