Add presets for Electra and checkpoint conversion script #1384

pranavvp16 · 2024-01-03T07:55:42Z

I have uploaded the weights on personal google cloud bucket. The from_preset method works properly in my local setup, but it throws some error in google collab notebook.

mattdangerw · 2024-01-04T21:40:09Z

Please format your code with ./shell/format.sh, also looks like we have a pretty simple merge conflict to resolve.

mattdangerw

Looks overall good! Left a few comments. See Kaggle comment below.

keras_nlp/models/electra/electra_backbone.py

keras_nlp/models/electra/electra_presets.py

keras_nlp/models/electra/electra_backbone.py

mattdangerw · 2024-01-06T00:06:11Z

keras_nlp/models/electra/electra_presets.py

+            "lowercase": False,
+        },
+        # TODO: Upload weights on GCS.
+        "weights_url": "https://storage.googleapis.com/pranav-keras/electra-base-generator/model.weights.h5",


We actually just moved all our weights over to Kaggle. https://github.com/keras-team/keras-nlp/releases/tag/v0.7.0

This will make it easier to upload models long term, but let me get back to you next week on exact steps for upload. If you have a kaggle username, could you reply here with it?

kaggle here is my kaggle username

keras_nlp/models/electra/electra_preprocessor.py

pranavvp16 · 2024-01-17T15:56:08Z

Sorry for the delay I'll make the above following requested changes, also I have left my kaggle username above

pranavvp16 · 2024-03-18T20:27:46Z

I have made all the changes as suggested

mattdangerw

This looks great! Just let me know where the final assets to copy over are and I will pull this in.

mattdangerw · 2024-03-20T18:28:59Z

keras_nlp/models/electra/electra_presets.py

+            "path": "electra",
+            "model_card": "https://github.com/google-research/electra",
+        },
+        "kaggle_handle": "kaggle://pranavprajapati16/electra/keras/electra_base_discriminator_en/1",


I don't see anything at https://www.kaggle.com/models/pranavprajapati16/electra.

You should now have the ability to make models public, can you do so? Or is the actual model here? https://www.kaggle.com/models/pranavprajapati16/electra_base_discriminator_en (in which case these links are still wrong).

Let me know where to get the proper assets and I will copy to the Keras org.

Sorry the model was private just made it public. https://www.kaggle.com/models/pranavprajapati16/electra

Thanks! Uploading now! I can just patch the new links into this PR and land. I'll ping here if I run into any issues.

mattdangerw · 2024-03-23T00:44:21Z

Actually does look like there is an error here. It looks like the tokenizer should be configured to lowercase input, but is not. This is leading to some test failures.

E.g. input_data=["The quick brown fox."], -> an UNK token at the beginning of the sequence and failing tests.

Can you take a look and confirm that we should be lowercasing input for all electra presets? If so I can go ahead an make the changes here, there will be some annoying renames to stick to our conversions--we should call the variants uncased_en instead of _en.

mattdangerw · 2024-03-23T00:45:31Z

Also, could you try converting the large versions? And adding presets? We should probably include those.

https://huggingface.co/collections/google/electra-release-64ff6e8b18830fabea30a1ab

# Conflicts: # keras_nlp/models/electra/electra_tokenizer.py

pranavvp16 · 2024-03-23T21:41:43Z

regarding the tokenizer I found this config for one of the presets. Also I have uploaded the weights for large models

mattdangerw · 2024-03-25T22:57:11Z

Thanks! I'll get large uploaded, and fix up the lowercasing issues.

mattdangerw · 2024-03-26T22:16:09Z

OK think this is working! Going to pull this in. @pranavvp16 please let us know if you spot any issues. We should probably test this out with an end to end colab to make sure things are working as intended.

mattdangerw · 2024-03-26T22:17:24Z

Not that I changed all the preset names to include "uncased" in keeping with bert conventions. electra_small_generator_uncased_en

…1384) * Added ElectraBackbone * Added backbone tests for ELECTRA * Fix config * Add model import to __init__ * add electra tokenizer * add tests for tokenizer * add __init__ file * add tokenizer and backbone to models __init__ * Fix Failing tokenization test * Add example on usage of the tokenizer with custom vocabulary * Add conversion script to convert weights from checkpoint * Add electra preprocessor * Add presets and tests * Add presets config with model weights * Add checkpoint conversion script * Name conversion for electra models * Update naming conventions according to preset names * Fix failing tokenizer tests * Update checkpoint conversion script according to kaggle * Add validate function * Kaggle preset * update preset link * Add electra presets * Complete run_small_preset test for electra * Add large variations of electra in presets * Fix case issues with electra presets * Fix format --------- Co-authored-by: Matt Watson <[email protected]>

pranavvp16 and others added 20 commits October 29, 2023 19:17

Added ElectraBackbone

f812c39

Merge branch 'keras-team:master' into electra

879020a

Added backbone tests for ELECTRA

c2aa9bd

Fix config

79df89f

Add model import to __init__

7bc3697

add electra tokenizer

b7bcfcf

add tests for tokenizer

8d9dd15

add __init__ file

273075a

add tokenizer and backbone to models __init__

bfbf648

Merge branch 'master' into electra

a79deb1

Fix Failing tokenization test

538d938

Merge remote-tracking branch 'origin/electra' into electra

eb8baa5

Merge branch 'keras-team:master' into electra

b3f81d5

Add example on usage of the tokenizer with custom vocabulary

47c9119

Merge branch 'keras-team:master' into electra

ec9f683

Add conversion script to convert weights from checkpoint

e3bad73

Add electra preprocessor

148913d

Add presets and tests

06dfae9

Add presets config with model weights

3b72d15

Add checkpoint conversion script

fcdcbbb

mattdangerw self-requested a review January 4, 2024 21:40

mattdangerw requested changes Jan 6, 2024

View reviewed changes

pranavvp16 and others added 4 commits January 21, 2024 20:08

Name conversion for electra models

d025883

Update naming conventions according to preset names

97b94ee

Merge branch 'master' into electra

316a15a

Fix failing tokenizer tests

b52d8b5

pranavvp16 mentioned this pull request Feb 2, 2024

Add Electra Weights to Kaggle Models #1422

Open

Merge branch 'keras-team:master' into electra

2e038eb

Complete run_small_preset test for electra

b268e26

pranavvp16 force-pushed the electra branch from dada198 to b268e26 Compare March 19, 2024 04:55

pranavvp16 requested a review from mattdangerw March 19, 2024 16:36

mattdangerw approved these changes Mar 20, 2024

View reviewed changes

pranavvp16 requested a review from mattdangerw March 20, 2024 20:10

mattdangerw added the kokoro:force-run Runs Tests on GPU label Mar 22, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 22, 2024

mattdangerw added the kokoro:force-run Runs Tests on GPU label Mar 23, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 23, 2024

mattdangerw force-pushed the electra branch from 60bde7c to 9c8c6f6 Compare March 23, 2024 00:36

pranavvp16 force-pushed the electra branch 2 times, most recently from 3780fe9 to 2889350 Compare March 23, 2024 21:07

Add large variations of electra in presets

0411151

pranavvp16 force-pushed the electra branch from 2889350 to 0411151 Compare March 23, 2024 21:08

Merge remote-tracking branch 'origin/master' into electra

fa9a2f2

# Conflicts: # keras_nlp/models/electra/electra_tokenizer.py

Fix case issues with electra presets

0bb7b64

mattdangerw added the kokoro:force-run Runs Tests on GPU label Mar 26, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 26, 2024

Fix format

c49e4ac

mattdangerw added the kokoro:force-run Runs Tests on GPU label Mar 26, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 26, 2024

mattdangerw merged commit a6700eb into keras-team:master Mar 26, 2024
8 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add presets for Electra and checkpoint conversion script #1384

Add presets for Electra and checkpoint conversion script #1384

pranavvp16 commented Jan 3, 2024

mattdangerw commented Jan 4, 2024

mattdangerw left a comment

mattdangerw Jan 6, 2024

pranavvp16 Jan 17, 2024

pranavvp16 commented Jan 17, 2024

pranavvp16 commented Mar 18, 2024

mattdangerw left a comment

mattdangerw Mar 20, 2024

pranavvp16 Mar 20, 2024

mattdangerw Mar 22, 2024

mattdangerw commented Mar 23, 2024

mattdangerw commented Mar 23, 2024 •

edited

Loading

pranavvp16 commented Mar 23, 2024

mattdangerw commented Mar 25, 2024

mattdangerw commented Mar 26, 2024

mattdangerw commented Mar 26, 2024

Add presets for Electra and checkpoint conversion script #1384

Add presets for Electra and checkpoint conversion script #1384

Conversation

pranavvp16 commented Jan 3, 2024

mattdangerw commented Jan 4, 2024

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw Jan 6, 2024

Choose a reason for hiding this comment

pranavvp16 Jan 17, 2024

Choose a reason for hiding this comment

pranavvp16 commented Jan 17, 2024

pranavvp16 commented Mar 18, 2024

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw Mar 20, 2024

Choose a reason for hiding this comment

pranavvp16 Mar 20, 2024

Choose a reason for hiding this comment

mattdangerw Mar 22, 2024

Choose a reason for hiding this comment

mattdangerw commented Mar 23, 2024

mattdangerw commented Mar 23, 2024 • edited Loading

pranavvp16 commented Mar 23, 2024

mattdangerw commented Mar 25, 2024

mattdangerw commented Mar 26, 2024

mattdangerw commented Mar 26, 2024

mattdangerw commented Mar 23, 2024 •

edited

Loading