Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whitespace preservation #11

Open
aznotins opened this issue Sep 19, 2024 · 2 comments
Open

Whitespace preservation #11

aznotins opened this issue Sep 19, 2024 · 2 comments

Comments

@aznotins
Copy link
Collaborator

There are still problematic cases with white space preservation in the output.

  • valoda (LAT1)
  • un „vislabāk pārvaldītā valoda”
morphotagger.sh -vert -whitespace-marker << EOF
(LAT1)
valoda (LAT1)
„vislabāk pārvaldītā valoda”
un „vislabāk pārvaldītā valoda”
EOF
(	zb	(
<g />
LAT1	xx	Lat1
<g />
)	zb	)

valoda	ncfsn4	valoda
(	zb	(
LAT1	xx	Lat1
<g />
)	zb	)

„	zq	"
<g />
vislabāk	rsn	labi
pārvaldītā	vmnpdfsnpsypn	pārvaldīt
valoda	ncfsn4	valoda
<g />
”	zq	"

un	cc	un
„	zq	"
vislabāk	rsn	labi
pārvaldītā	vmnpdfsnpsypn	pārvaldīt
valoda	ncfsn4	valoda
<g />
”	zq	"

@viestursjl

@ghost

This comment was marked as spam.

@viestursjl
Copy link
Contributor

If I recall correctly the bug for this stems from PeterisP/morphology, where the original bugfix that I wrote didn't end up getting merged into the main repository. Created a new pull request for the bugfix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants