Split sharded Llama dataset exporting and loading in export scripts #327

sogartar · 2024-10-24T21:40:47Z

Separate the 2 steps. We need exported irpa files for the IREE module anyway.

rsuderman · 2024-10-24T21:45:46Z

sharktank/sharktank/examples/export_paged_llm_v1.py

-        llama_config.tensor_parallelism_size = attn_q_weight.shard_count
+    llama_config = LlamaModelConfig(
+        hp,
+        tensor_parallelism_size=args.tensor_parallelism_size,


This will not work as it assumes that the passed arg is guaranteed to match the irpa file. We should look into plumming sharding into the saved hyper parameters then extract. I also dislike the hack in line 84 however the solution is not to introduce mismatches.

OK, I will add it to the exported irpa hyper parameters.

I fixed it.

rsuderman · 2024-10-24T22:54:36Z

sharktank/sharktank/examples/export_paged_llm_v1.py

-        llama_config.tensor_parallelism_size = attn_q_weight.shard_count
+    llama_config = LlamaModelConfig(
+        hp,
+        tensor_parallelism_size=dataset.properties["tensor_parallelism_size"],


Add a check prior for when tensor_parallelism_size has no value and in those cases default to 1. We should maintain that the old non-sharded irpa files still work.

rsuderman · 2024-10-25T00:22:34Z

sharktank/sharktank/examples/export_paged_llm_v1.py

-        llama_config.tensor_parallelism_size = attn_q_weight.shard_count
+    llama_config = LlamaModelConfig(
+        hp,
+        tensor_parallelism_size=dataset.properties["tensor_parallelism_size"]


Just declare it outside of llama_config. Giant comprehensions are bad for readability.

Separate the 2 steps. We need exported irpa files for the IREE module anyway.

sogartar requested a review from rsuderman October 24, 2024 21:41

rsuderman requested changes Oct 24, 2024

View reviewed changes

sogartar requested a review from rsuderman October 25, 2024 00:06

rsuderman requested changes Oct 25, 2024

View reviewed changes

sogartar force-pushed the sharded-llama-dataset-exporting branch from 2c22d38 to 38c3410 Compare October 25, 2024 07:48

sogartar requested a review from rsuderman October 25, 2024 07:55

rsuderman approved these changes Oct 25, 2024

View reviewed changes

sogartar added 4 commits October 25, 2024 15:10

Split sharded Llama dataset exporting and loading in export scripts

5ccadf2

Separate the 2 steps. We need exported irpa files for the IREE module anyway.

Add tensor parallelism size to dataset properties and fix test

e0ab7de

Default to tensor_parallelism_size=1

5365bd4

Improve readability

1dc7897

sogartar force-pushed the sharded-llama-dataset-exporting branch from 3fbb144 to 1dc7897 Compare October 25, 2024 19:10

sogartar enabled auto-merge (squash) October 25, 2024 19:11

sogartar merged commit 1aeb3a8 into nod-ai:main Oct 25, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split sharded Llama dataset exporting and loading in export scripts #327

Split sharded Llama dataset exporting and loading in export scripts #327

sogartar commented Oct 24, 2024

rsuderman Oct 24, 2024

sogartar Oct 24, 2024

sogartar Oct 25, 2024

rsuderman Oct 24, 2024

rsuderman Oct 25, 2024

sogartar Oct 25, 2024

sogartar Oct 25, 2024

Split sharded Llama dataset exporting and loading in export scripts #327

Split sharded Llama dataset exporting and loading in export scripts #327

Conversation

sogartar commented Oct 24, 2024

rsuderman Oct 24, 2024

Choose a reason for hiding this comment

sogartar Oct 24, 2024

Choose a reason for hiding this comment

sogartar Oct 25, 2024

Choose a reason for hiding this comment

rsuderman Oct 24, 2024

Choose a reason for hiding this comment

rsuderman Oct 25, 2024

Choose a reason for hiding this comment

sogartar Oct 25, 2024

Choose a reason for hiding this comment

sogartar Oct 25, 2024

Choose a reason for hiding this comment