Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the meaning of sas in the results? #5

Open
chgao96 opened this issue Jul 25, 2017 · 3 comments
Open

What is the meaning of sas in the results? #5

chgao96 opened this issue Jul 25, 2017 · 3 comments

Comments

@chgao96
Copy link

chgao96 commented Jul 25, 2017

I am really confused about that. Hope for a response.

@rdevooght
Copy link
Owner

Hi,

Are you talking about the "sps@10" ? Otherwise I don't see what you mean by "sas".
The sps@k is a measure that we designed (this is why you haven't seen it yet), which is the percentage of user for which the method is able to predict the first item in the test set (given k trials).
You can see it as a recall@k where only the first item in the test set is considered to be a positive result.

The goal of the sps is to measure whether the method is able to predict the next item that a user will interact with. This is indeed what we measure, because for each user the items are sorted by the timestamp of the user-item interactions, and therefore the first item of the test set is the next item with which the user interacted.

Is it clearer ?

@chgao96
Copy link
Author

chgao96 commented Aug 18, 2017

Thank you for a response.
Yeah, it is "sps". I made a spelling mistake.
Well, I understand the meaning of "sps". But, I have never seen a published paper taking it as a metric. So could you offer any reference? Thanks a lot.
Besides, what is the full name of "sps"?

@rdevooght
Copy link
Owner

It stands for short-term prediction success.
This is actually a measure we came up with because we wanted to measure how well can the model predict the next user interaction, which is something never measured in recommender system research.
I am pretty sure that this measure exists under another name somewhere in the sequence prediction field, since it is such a simple measure, but I could not tell you where.

We could have used the perplexity instead, which is widely used to evaluate language model.
It would have been more familiar to those coming from the sequence prediction field, but I believe that the sps is more intuitive for those coming from the recommender system field, which is why we chose it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants