match /
character
#381
Replies: 6 comments
-
Hey @pierrevdb, Meilisearch generates tokens based on the text, those tokens are split by using a list of separators that you can find on our documentation page. This list of separators is currently not customizable but you can upvote this feature on our roadmap page, it will help us prioritize it. |
Beta Was this translation helpful? Give feedback.
-
Hello @pierrevdb |
Beta Was this translation helpful? Give feedback.
-
Thanks guys. Not sure how I missed that page of the documentation. For our application we need to search many reference fields that contain document numbers. These almost always contain chars that are listed as soft/hard spaces, so being able to work around these would be advantageous. It is also conceivable that simply removing the chars for indexing would result in collisions that would render the search results ambiguous or less accurate, so that is not really an option. I will see whether giving proximity a higher priority has the desired effect for us, and will look at the roadmap as @Kerollmops suggested for the relevant features around customising/ignoring tokens. |
Beta Was this translation helpful? Give feedback.
-
Hello everyone 👋 We just released a 🧪 prototype that allows customizing tokenization and we'd love your feedback. How to get the prototype?Using docker, use the following command:
From source, compile Meilisearch on the How to use the prototype?You can find all the details in the PR. Feedback and bug reporting when using this prototype are encouraged! Thanks in advance for your involvement. It means a lot to us ❤️ |
Beta Was this translation helpful? Give feedback.
-
Hello everyone 👋 We have just released the first RC (release candidate) of Meilisearch containing this new feature! You can test it by using:
You are welcome to leave your feedback in this discussion. If you encounter any bugs, please report them here. 🎉 Official and stable release containing this change will be available on September 25th, 2023 |
Beta Was this translation helpful? Give feedback.
-
Hey folks 👋 v1.4.0 has been released! 🦓 You can now customize tokenization by adding or removing tokens from the list of separator and non-separator tokens. ✨ Note: |
Beta Was this translation helpful? Give feedback.
-
When searching for a term that contains a '/', the text before and after the '/' are matched separately but the entire term is not.
I couldn't find any documentation relating to this specific issue.
Steps to reproduce
Using the built-in web interface produces the same result with the text before and after the '/' highlighted.
Expected behaviour
Should match the entire term
Meilisearch version: [e.g. v0.23.1]
Beta Was this translation helpful? Give feedback.
All reactions