-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The term 'aws' should be ignored by the search algorithm #828
Comments
I'm not sure we can implement this without breaking a lot of other title-based searches such as |
I believe there's a difference between "aws-X" and "aws X". The first is one term 'aws-X', the second term is two words: 'aws' and 'X'. If a user types the word "aws" (not the prefix "aws", but a term that equals to "aws") we should ignore it |
The issue is it conflicts entirely with how the search engine works. All search terms are "tokenized", meaning that they are separated into a list of segments. For example, if I search I will give this suggestion a try but I believe it will still cause problematic edge cases like before. I would also argue that we are getting acceptable and relevant results with the current behavior. Libraries that match the fields the strongest appear first, while looser matches have lower relevance scores. If you look at other search engines, it feels like the first 10-20% of results are strongly relevant, and beyond that point results are only tangentially related |
This issue is now marked as stale because it hasn't seen activity for a while. Add a comment or it will be closed soon. If you wish to exclude this issue from being marked as stale, add the "backlog" label. |
Some visitors search for 'AWS codepipeline', they get 688 results. Most of them are not relevant.
If you search instead for 'codepipline' you get 27 relevant results.
The text was updated successfully, but these errors were encountered: