Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce synchronized contention during name transliteration #242

Closed
msbarry opened this issue May 30, 2022 · 0 comments · Fixed by #246
Closed

Reduce synchronized contention during name transliteration #242

msbarry opened this issue May 30, 2022 · 0 comments · Fixed by #246
Assignees

Comments

@msbarry
Copy link
Contributor

msbarry commented May 30, 2022

After fixing the expression matching performance improvement from #206, CPU utilization during pass 2 node handling dropped, but node processing time remained about the same (4 minutes on a 64 core machine). Digging into this it turns out to be that ICU4J transliteration is synchronized across threads. Setting --transliterate=false removes the thread contention and reduces overall processing time by 2-3 minutes.

See if there's a way to do ICU4J name transliteration without requiring global synchronization.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant