processing dataset errors #6

xingjinshuo · 2024-08-27T03:17:32Z

Where can I find the beauty and toys data sets?I found All_beauty and Toys_and_Games at https://datarepo.eng.ucsd.edu/mcauley_group/data/amazon_v2/categoryFiles/, but the number of users and items obtained after processing by data_preprocess.py is inconsistent with the article.

97z · 2024-09-03T08:23:35Z

I have the same errors. I use Amazon review 2018, and download the review data and meta data. The video domain can catch the paper's data, but the movie domain's number is inconsistent. 311143 86678 is not the same as the paper "297,498 59,944"

ghdtjr · 2024-09-07T06:46:23Z

@xingjinshuo We used the Luxury_Beauty dataset not All_Beauty.

The thresholds for the preprocessing are varying for each dataset. I guess, I did not implement the automatically to set the threshold based on the dataset. You should check the value of the threshold which is the number of minimum interactions.

xingjinshuo · 2024-09-07T07:22:04Z

@ghdtjr I understand,Thanks for your answer. Is "Toys" the "Toys and Games" in the amazon review dataset?

ghdtjr · 2024-09-08T04:56:09Z

@xingjinshuo You're right. The "Toys" dataset means the "Toys and Games".

97z · 2024-09-10T07:04:03Z

@xingjinshuo You're right. The "Toys" dataset means the "Toys and Games".

Hello, your paper has mentioned you select 30K(your paper wrote 3K maybe is wrong)，But the whole data after filter 4-cores is larger than 30K. Do you select the user randomly？or something else？ Thank you for your reply!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

processing dataset errors #6

processing dataset errors #6

xingjinshuo commented Aug 27, 2024

97z commented Sep 3, 2024

ghdtjr commented Sep 7, 2024

xingjinshuo commented Sep 7, 2024

ghdtjr commented Sep 8, 2024

97z commented Sep 10, 2024

processing dataset errors #6

processing dataset errors #6

Comments

xingjinshuo commented Aug 27, 2024

97z commented Sep 3, 2024

ghdtjr commented Sep 7, 2024

xingjinshuo commented Sep 7, 2024

ghdtjr commented Sep 8, 2024

97z commented Sep 10, 2024