[feat] Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py. #362

MoFHeka · 2023-09-21T10:35:19Z

Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py.
Also fix some import bug in tf_save_restore_patch.py.
Also fix the example in demo where the python code for keras horovod synchronous training was wrong.

Description

I have overwritten the keras save function and now it is not necessary to save the embedding shard explicitly, as long as model.save or Keras.model.save_model is called on each rank, but tf.saved_model.save is not supported.
tf.saved_model.save can also be supported in theory, but because the obj object of the save is not necessarily the keras object, I am lazy to write it for the moment, and there is a need to talk about it.

Type of change

Checklist:

I've properly formatted my code according to the guidelines
- By running yapf
- By running clang-format
This PR addresses an already submitted issue for TensorFlow Recommenders-Addons
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works

How Has This Been Tested?

Adding a test with HvdAllToAllEmbedding.
Follow the demo demo/dynamic_embedding/movielens-1m-keras-with-horovod.

rhdong · 2023-09-22T09:26:57Z

tensorflow_recommenders_addons/dynamic_embedding/python/ops/tf_save_restore_patch.py

+  try:
+    import horovod.tensorflow as hvd
+    try:
+      hvd.rank()
+    except:
+      hvd = None
+  except:


try: import horovod.tensorflow as hvd hvd.rank() except: hvd = None

rhdong · 2023-09-22T09:32:04Z

tensorflow_recommenders_addons/dynamic_embedding/python/ops/tf_save_restore_patch.py

+                        model,
+                        filepath,
+                        overwrite,
+                        include_optimizer,


It would be better to add comments for each important input arguments.

rhdong · 2023-09-22T09:35:22Z

tensorflow_recommenders_addons/dynamic_embedding/python/ops/tf_save_restore_patch.py

+      *args,
+      **kwargs)
+
+  def _traverse_emb_layers_and_save(hvd_rank):


Do we have adequate UT cases to cover this function?

rhdong · 2023-09-22T09:38:34Z

tensorflow_recommenders_addons/dynamic_embedding/python/keras/callbacks.py

@@ -106,6 +107,10 @@ def __init__(self, root_rank=0, device='', local_variables=None):
      self.register_local_var(var)


+@deprecated(


Is the warning always triggered? It's recommended to show it only when the users actually refer to the AllToAllEmbedding.

Only show when this class was called init or new.
This callback class was only designed for horovod all2all embedding saving. For now, it's useless after new saving patch function.

def deprecated_wrapper(func_or_class): """Deprecation wrapper.""" if isinstance(func_or_class, type): # If a class is deprecated, you actually want to wrap the constructor. cls = func_or_class if cls.__new__ is object.__new__: # If a class defaults to its parent's constructor, wrap that instead. func = cls.__init__ constructor_name = '__init__' decorators, _ = tf_decorator.unwrap(func) for decorator in decorators: if decorator.decorator_name == 'deprecated': # If the parent is already deprecated, there's nothing to do. return cls else: func = cls.__new__ constructor_name = '__new__' else: cls = None constructor_name = None func = func_or_class

…by adding save function overwriting patch in tf_save_restore_patch.py. Also fix some import bug in tf_save_restore_patch.py. Also adding a save and restore test for HvdAllToAllEmbeeding.

rhdong

LGTM

MoFHeka requested a review from rhdong as a code owner September 21, 2023 10:35

MoFHeka requested a review from Lifann September 22, 2023 06:07

rhdong reviewed Sep 22, 2023

View reviewed changes

MoFHeka force-pushed the master-dev branch from 87731f0 to 6fa5e53 Compare September 22, 2023 13:32

[feat] Make saving model more easier when using HvdAllToAllEmbedding …

9b9f394

…by adding save function overwriting patch in tf_save_restore_patch.py. Also fix some import bug in tf_save_restore_patch.py. Also adding a save and restore test for HvdAllToAllEmbeeding.

MoFHeka force-pushed the master-dev branch from 6fa5e53 to 9b9f394 Compare September 22, 2023 17:49

rhdong approved these changes Sep 25, 2023

View reviewed changes

rhdong merged commit d774172 into tensorflow:master Sep 25, 2023
33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py. #362

[feat] Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py. #362

MoFHeka commented Sep 21, 2023 •

edited

Loading

rhdong Sep 22, 2023

MoFHeka Sep 22, 2023

rhdong Sep 22, 2023

MoFHeka Sep 22, 2023

rhdong Sep 22, 2023

MoFHeka Sep 22, 2023

rhdong Sep 22, 2023

MoFHeka Sep 22, 2023 •

edited

Loading

rhdong left a comment

		@@ -106,6 +107,10 @@ def __init__(self, root_rank=0, device='', local_variables=None):
		self.register_local_var(var)


		@deprecated(

[feat] Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py. #362

[feat] Make saving model more easier when using HvdAllToAllEmbedding by adding save function overwriting patch in tf_save_restore_patch.py. #362

Conversation

MoFHeka commented Sep 21, 2023 • edited Loading

Description

Type of change

Checklist:

How Has This Been Tested?

rhdong Sep 22, 2023

Choose a reason for hiding this comment

MoFHeka Sep 22, 2023

Choose a reason for hiding this comment

rhdong Sep 22, 2023

Choose a reason for hiding this comment

MoFHeka Sep 22, 2023

Choose a reason for hiding this comment

rhdong Sep 22, 2023

Choose a reason for hiding this comment

MoFHeka Sep 22, 2023

Choose a reason for hiding this comment

rhdong Sep 22, 2023

Choose a reason for hiding this comment

MoFHeka Sep 22, 2023 • edited Loading

Choose a reason for hiding this comment

rhdong left a comment

Choose a reason for hiding this comment

MoFHeka commented Sep 21, 2023 •

edited

Loading

MoFHeka Sep 22, 2023 •

edited

Loading