Fix Specter cache #136

haroldrubio · 2022-11-10T17:25:19Z

This PR no longer attempts to find and remove all old instances of publications in the Redis cache, and instead sets an expiration date whenever inserting into Redis.

melisabok · 2022-11-14T23:10:48Z

expertise/models/multifacet_recommender/specter.py

@@ -116,6 +116,7 @@ def _maybe_print_to_console_and_file(self,
            paper_id = prediction_json['paper_id']
            cache_key = paper_id + "_" + str(self._metadata[paper_id]['mdate'])
            self._redis_con.tensorset(key=cache_key, tensor=np.array(prediction_json['embedding']))
+            self._redis_con.expire(cache_key, 2629746) ## Expire after 1 month


can we this in the config file?

Hmm it can be set in the model_params of the config.json but I don't think the models will have access to the Flask config

ok, let's keep it as it is

melisabok · 2022-11-14T23:11:33Z

expertise/models/multifacet_recommender/specter.py

@@ -211,7 +212,6 @@ def set_archives_dataset(self, archives_dataset):
                                "authors": [profile_id],
                                "mdate": pub_mdate
                            }
-                        self._remove_keys_from_cache(publication["id"])


so this is the call that is taking time? tensorset is fast enough?

Yes, tensorset is fast but the _remove_keys_from_cache function scans through all the keys to find matches which is quite slow

How much is slow? I'm still confused as to why redis is slow here. Are there too many keys in the database?

I'm not sure what the self._remove_keys_from_cache method does, but its performance also depends on whether SCAN or KEYS is used.

haroldrubio added 3 commits November 7, 2022 07:05

Dont remove keys

ef80e29

Expire after 1 month

972d55f

Remove check for old key

0e9d9d3

haroldrubio self-assigned this Nov 10, 2022

melisabok reviewed Nov 14, 2022

View reviewed changes

melisabok requested a review from carlosmondra November 14, 2022 23:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Specter cache #136

Fix Specter cache #136

haroldrubio commented Nov 10, 2022

melisabok Nov 14, 2022

haroldrubio Nov 15, 2022

melisabok Nov 15, 2022

melisabok Nov 14, 2022

haroldrubio Nov 15, 2022

carlosmondra Nov 15, 2022

Fix Specter cache #136

Are you sure you want to change the base?

Fix Specter cache #136

Conversation

haroldrubio commented Nov 10, 2022

melisabok Nov 14, 2022

Choose a reason for hiding this comment

haroldrubio Nov 15, 2022

Choose a reason for hiding this comment

melisabok Nov 15, 2022

Choose a reason for hiding this comment

melisabok Nov 14, 2022

Choose a reason for hiding this comment

haroldrubio Nov 15, 2022

Choose a reason for hiding this comment

carlosmondra Nov 15, 2022

Choose a reason for hiding this comment