-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Shall I use cosine or dot product to calculate the similarity between user & item? #17
Comments
Hi, if you care only about similarity, not popularity, you can use cosine similarity. If you care about popularity of items, you can use dot product |
Thanks very much,chtran. And what's more ,shall I use the item bias when I calculate the score? would the result be like this: score = user_factor dot_product item_factor + item_bias or just: ? |
And btw, it seemed that qmf does not support user bias? |
Any suggestions for this? Thanks very much |
Hi @jackyhawk, Yes, if you train your model with item biases, you should also use them when making predictions. -Alberto |
Thanks very much albietz. and it would be like this? And i've just paste the test metrics of my real training output( about 5 million,0.5 million items, and about 0.5 billion click,which are our users behavior within latest 30 days of (after filtering some outliers) ) And what about the train loss? it seems that value of [train loss] betweem 0.05~0.08 works for me. ==================================================== 18:29:02.830051 26919 MetricsEngine.cpp:41] begin metrics: epoch 9: recorded metric test_avg_auc = 0.91341,log_:1 |
And what's more, as for the input data file, should the old data in the beginning or the new data in the beginning? Thanks very much |
And what's more, Shall I use more (longer-period user's behaviro ) data as the training dataset to improve the precison & recall @10? and currently, i just use 100 as the nfactor dimension, shall I also increase that value? Thanks very much |
Any suggestions for that? Thanks very much |
I tried to increase the dimension of latent factor (from 100 to 200 and then to 300), it seemed that the results got better |
Any suggestions for this? Thanks very much |
Shall I user consine or dot product to calculate the similarity between user & item?
(since there would be some negative latent fatore for user & item, is it still suitable to cosine?
thanks very much
The text was updated successfully, but these errors were encountered: