Fix gemma rms_normalization's use of epsilon #1472

cpsauer · 2024-02-27T08:40:53Z

Hi wonderful Keras folks,

I was browsing the new Gemma source and noticed that the RMSNorm code didn't use the epsilon parameter it takes in. This fixes that.

While we're here, I'm curious what drove the 1+scale multiplier (instead of just initializing scale to 1). Would love to learn if you're down to share.

Thanks,
Chris
(ex-Googler)

Hi wonderful Keras folks, I was browsing the new Gemma source and noticed that the RMSNorm code didn't use the epsilon parameter it takes in. This fixes that. While we're here, I'm curious what drove the 1+scale multiplier (instead of just initializing scale to 1). Would love to learn if you're down to share. Thanks, Chris (ex-Googler)

mattdangerw

Thanks and looks good to me. I don't actually know why 1 + scale was chosen and I'd love to know myself. If I learn will let you know!

cpsauer · 2024-02-28T07:44:54Z

Thanks, Matt! Glad I'm not the only one curious.
(Also, looked you up on linkedin and it looks like we took similar academic tracks. Go Stanford :)

Hi wonderful Keras folks, I was browsing the new Gemma source and noticed that the RMSNorm code didn't use the epsilon parameter it takes in. This fixes that. While we're here, I'm curious what drove the 1+scale multiplier (instead of just initializing scale to 1). Would love to learn if you're down to share. Thanks, Chris (ex-Googler)

mattdangerw approved these changes Feb 27, 2024

View reviewed changes

mattdangerw merged commit 81de50a into keras-team:master Feb 28, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gemma rms_normalization's use of epsilon #1472

Fix gemma rms_normalization's use of epsilon #1472

cpsauer commented Feb 27, 2024

mattdangerw left a comment

cpsauer commented Feb 28, 2024 •

edited

Loading

Fix gemma rms_normalization's use of epsilon #1472

Fix gemma rms_normalization's use of epsilon #1472

Conversation

cpsauer commented Feb 27, 2024

mattdangerw left a comment

Choose a reason for hiding this comment

cpsauer commented Feb 28, 2024 • edited Loading

cpsauer commented Feb 28, 2024 •

edited

Loading