Pull request for the checkpoint conversion of BERT(486) #761

vulkomilev · 2023-02-19T07:17:20Z

No description provided.

abheesht17

@vulkomilev, I've submitted a quick review. Please take a look. I had a few questions/suggestions:

Have you tried running this script for all presets? Does it work fine?
The check_output function has not been called in main().
Could you please run the formatter/linter? It seems like this script has not been formatted. You have to follow the steps listed here: https://github.com/keras-team/keras-nlp/blob/master/CONTRIBUTING.md#formatting-code.

tools/checkpoint_conversion/convert_bert.py

abheesht17 · 2023-02-21T13:03:19Z

tools/checkpoint_conversion/convert_bert.py

+    return keras_nlp_output
+
+
+def main(_):


We should call check_outputs() in main.

tools/checkpoint_conversion/convert_bert.py

abheesht17 · 2023-02-21T13:06:52Z

tools/checkpoint_conversion/convert_bert.py

+    return vocab_path, checkpoint_path, config_path, weights, model
+
+
+def convert_checkpoints(preset, weights, model):


The if...elif...else logic still looks a bit complicated. Instead of using two sources - TF MG and original BERT repo, we're planning to just use the original BERT repo. Might make this whole block simpler. Let me do that for you!

mattdangerw · 2023-03-01T20:04:36Z

@vulkomilev it looks like a lot of the comments above were resolved without being addressed. Did you mean to push a change to this branch?

vulkomilev · 2023-03-01T20:06:04Z

I still haven't pushed the changes will do in about an hour

mattdangerw · 2023-03-03T20:29:47Z

tools/checkpoint_conversion/convert_bert_checkpoints.py

+    return vocab_path, checkpoint_path, config_path
+
+
+def convert_checkpoints(preset,checkpoint_path,config_dict):


Make sure to run our format script, this does not look like it has been black formatted in many place. (contributor guide has instructions)

mattdangerw · 2023-03-03T20:33:51Z

tools/checkpoint_conversion/convert_bert_checkpoints.py

+    model = keras_nlp.models.BertBackbone.from_preset("bert_tiny_en_uncased",
+                                                      load_weights=True)  # keras_nlp.models.BertBase(vocabulary_size=VOCAB_SIZE)
+    model.summary()
+    if preset in ['bert_base_en_uncased', 'bert_base_en']:


I don't think we should need all these if cases based on preset. Why do we need to add all these?

It's probably easiest to write this for checkpoints from a single source. The official BERT repo has all checkpoints we need. So you could model on https://github.com/keras-team/keras-nlp/blob/master/tools/checkpoint_conversion/bert_tiny_uncased_en.ipynb, for example.

here we don't download from a fixed source f"""https://storage.googleapis.com/bert_models/2020_02_20/{MODEL_SUFFIX}_{MODEL_SPEC_STR}.zip"""

@mattdangerw - I will push some changes later today, will use ckpts from a single source

@abheesht17 any update on this?

mattdangerw · 2023-03-03T20:35:33Z

tools/checkpoint_conversion/convert_bert_checkpoints.py

+FLAGS = flags.FLAGS
+
+PRESET_MAP = {
+    "bert_base_cased": {'base':"roberta.base",


Are these big dicts actually used anywhere? They don't seem to be.

What you probably need is a mapping from our preset names to checkpoint files, and that's it!

they are used in the configuration of the model . Where I can find the list of checkpoint files to map them?

abheesht17 · 2023-03-04T23:42:16Z

@vulkomilev, do you mind if I push changes on top of yours? There are some modifications to make here, especially w.r.t. the single source BERT ckpts.

vulkomilev · 2023-03-05T07:51:24Z

yeah no problem

vulkomilev · 2023-03-21T21:11:19Z

Hi @abheesht17 I am encountering error with the conversion .What is the dimension of the last layer for bert_base_uncased because for hugging face it is 768

vulkomilev · 2023-04-04T20:17:13Z

bump

vulkomilev · 2023-04-11T19:13:43Z

made a new release but I have problem with the shapes can some one check it ?

vulkomilev · 2023-04-25T18:03:04Z

@mattdangerw @ abheesht17 bump

vulkomilev added 2 commits January 12, 2023 21:19

added bert checkpoint converter

c867668

refactored the if statements

8c97c68

abheesht17 self-requested a review February 21, 2023 03:14

abheesht17 suggested changes Feb 21, 2023

View reviewed changes

vulkomilev added 2 commits March 1, 2023 23:05

fixes guided by the comments

42246d7

removed convert_bert.py

e8064c7

mattdangerw requested changes Mar 3, 2023

View reviewed changes

mattdangerw assigned mattdangerw and abheesht17 Mar 22, 2023

beta 1.0

16e131f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull request for the checkpoint conversion of BERT(486) #761

Pull request for the checkpoint conversion of BERT(486) #761

vulkomilev commented Feb 19, 2023

abheesht17 left a comment

abheesht17 Feb 21, 2023

abheesht17 Feb 21, 2023

mattdangerw commented Mar 1, 2023

vulkomilev commented Mar 1, 2023

mattdangerw Mar 3, 2023

mattdangerw Mar 3, 2023

vulkomilev Mar 6, 2023

abheesht17 Mar 6, 2023

vulkomilev Mar 14, 2023

mattdangerw Mar 3, 2023

vulkomilev Mar 6, 2023

abheesht17 commented Mar 4, 2023

vulkomilev commented Mar 5, 2023

vulkomilev commented Mar 21, 2023

vulkomilev commented Apr 4, 2023

vulkomilev commented Apr 11, 2023

vulkomilev commented Apr 25, 2023

		return vocab_path, checkpoint_path, config_path, weights, model


		def convert_checkpoints(preset, weights, model):

		return vocab_path, checkpoint_path, config_path


		def convert_checkpoints(preset,checkpoint_path,config_dict):

Pull request for the checkpoint conversion of BERT(486) #761

Are you sure you want to change the base?

Pull request for the checkpoint conversion of BERT(486) #761

Conversation

vulkomilev commented Feb 19, 2023

abheesht17 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdangerw commented Mar 1, 2023

vulkomilev commented Mar 1, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

abheesht17 commented Mar 4, 2023

vulkomilev commented Mar 5, 2023

vulkomilev commented Mar 21, 2023

vulkomilev commented Apr 4, 2023

vulkomilev commented Apr 11, 2023

vulkomilev commented Apr 25, 2023