feat: modify rope for llama-3 and support llama-3.2 #131

rheasukthanker · 2024-10-08T13:56:29Z

Reference Issues/PRs

Closes #130 and #129

What does this implement/fix? Explain your changes.

Fixes rope for llama-3 models. Using whittle API in eval-harness we can now match llama-3.1-8B performance from huggingface api.

Minimal Example / How should this PR be tested?

Added test for llama 3.2

aaronkl · 2024-10-09T12:02:58Z

whittle/models/gpt/model.py

 from whittle.models.gpt.blocks import Block
 from whittle.modules.embedding import Embedding
 from whittle.modules.layernorm import LayerNorm
 from whittle.modules.linear import Linear
 from whittle.modules.rmsnorm import RMSNorm


-class GPT(nn.Module):
+class GPT(torch.nn.Module):


we import torch.nn above, so we can use nn.Module here

rheasukthanker and others added 5 commits October 8, 2024 14:49

update rope for llama-3

3e4b461

add test for llama-3.2

5f08d61

fix formatting

48de7f6

update litgpt version

cb6af53

revert back test string

73197e2

rheasukthanker requested a review from aaronkl October 8, 2024 14:14

rheasukthanker marked this pull request as draft October 8, 2024 14:26

rheasukthanker marked this pull request as ready for review October 8, 2024 14:46

rheasukthanker marked this pull request as draft October 8, 2024 14:54

add batched indexing

0c33f1a

rheasukthanker marked this pull request as ready for review October 8, 2024 15:16

aaronkl approved these changes Oct 10, 2024

View reviewed changes

fix import issue

e67a8d3

rheasukthanker merged commit 4e54c6e into main Oct 10, 2024
7 checks passed

rheasukthanker deleted the fix_llama branch October 10, 2024 12:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: modify rope for llama-3 and support llama-3.2 #131

feat: modify rope for llama-3 and support llama-3.2 #131

rheasukthanker commented Oct 8, 2024 •

edited

Loading

aaronkl Oct 9, 2024

rheasukthanker Oct 10, 2024

feat: modify rope for llama-3 and support llama-3.2 #131

feat: modify rope for llama-3 and support llama-3.2 #131

Conversation

rheasukthanker commented Oct 8, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Minimal Example / How should this PR be tested?

aaronkl Oct 9, 2024

Choose a reason for hiding this comment

rheasukthanker Oct 10, 2024

Choose a reason for hiding this comment

rheasukthanker commented Oct 8, 2024 •

edited

Loading