You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a request that comes up on a regular basis. When we call the citations() and references() operators, the returned scores are not useful, in the sense that they neither reflect the original scores from the inner query nor they reflect the number of times the documents in the inner query were cited by the returned documents. We would like to enable the latter.
For example, if I search for author:"accomazzi, a" in the astronomy colleciton I will find about 200+ documents. If I ask for the their citations via citations(author:"accomazzi, a") the generated list has a ranking which is somewhat meaningless. Instead, we would like to see at the top the papers that cite the original inner query most frequently, which in this case would be:
Ideally we should take one step forward and consider implementing a hybrid score controlled by an optional parameter, as we have done for the reviews() operator:
The optional parameter (let's call it textWeightRatio) would control how much weight is given to the scores coming from the documents retrieved by the inner query, so that we can compute a final score for each citing paper j this way:
final_score(j) = SUM (1 + textWeightRatio * innerScore(i) / maxInnerScore)
where innerScore(i) is the relevance score computed for document i which matches the inner query, and SUM is computed over all citations to the inner set. maxInnerscore is the highest score from the inner query. When textWeightRatio is 0 (default), the final score is simply the number of citations document j has to the documents selected by the inner query.
The text was updated successfully, but these errors were encountered:
This is a request that comes up on a regular basis. When we call the citations() and references() operators, the returned scores are not useful, in the sense that they neither reflect the original scores from the inner query nor they reflect the number of times the documents in the inner query were cited by the returned documents. We would like to enable the latter.
For example, if I search for
author:"accomazzi, a"
in the astronomy colleciton I will find about 200+ documents. If I ask for the their citations viacitations(author:"accomazzi, a")
the generated list has a ranking which is somewhat meaningless. Instead, we would like to see at the top the papers that cite the original inner query most frequently, which in this case would be:Ideally we should take one step forward and consider implementing a hybrid score controlled by an optional parameter, as we have done for the
reviews()
operator:montysolr/montysolr/src/main/java/org/apache/lucene/queryparser/flexible/aqp/builders/AqpAdsabsSubQueryProvider.java
Line 802 in 811eee1
The optional parameter (let's call it
textWeightRatio
) would control how much weight is given to the scores coming from the documents retrieved by the inner query, so that we can compute a final score for each citing paperj
this way:where
innerScore(i)
is the relevance score computed for documenti
which matches the inner query, and SUM is computed over all citations to the inner set.maxInnerscore
is the highest score from the inner query. WhentextWeightRatio
is 0 (default), the final score is simply the number of citations documentj
has to the documents selected by the inner query.The text was updated successfully, but these errors were encountered: