if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? #549

tensimixt · 2024-07-20T12:43:54Z

Feature request

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? which has a vocab size > 130k.
Currently Vllm for example doesn't support vocab_size > 128512 when enable_lora=True

I think if huggingface and LoRAX are based on punica kernels they will also have this limitation or this limitation does not exist for TGI and LoRAX?

Thank you!

Motivation

be able to run inference with Mistral NeMO + LoRA Adapter (in a multi-lora world)

Your contribution

Checked various deployment providers and found the limitation

Nero10578 · 2024-08-20T20:50:53Z

Feature request

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? which has a vocab size > 130k. Currently Vllm for example doesn't support vocab_size > 128512 when enable_lora=True

I think if huggingface and LoRAX are based on punica kernels they will also have this limitation or this limitation does not exist for TGI and LoRAX?

Thank you!

Motivation

be able to run inference with Mistral NeMO + LoRA Adapter (in a multi-lora world)

Your contribution

Checked various deployment providers and found the limitation

did you figure out if Mistral Nemo 12B works with lora adapters with lorax? It does not work with VLLM or Aphrodite still and I am looking for alternatives.

preduct0r · 2024-10-31T21:26:41Z

Feature request

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? which has a vocab size > 130k. Currently Vllm for example doesn't support vocab_size > 128512 when enable_lora=True
I think if huggingface and LoRAX are based on punica kernels they will also have this limitation or this limitation does not exist for TGI and LoRAX?
Thank you!

Motivation

be able to run inference with Mistral NeMO + LoRA Adapter (in a multi-lora world)

Your contribution

Checked various deployment providers and found the limitation

did you figure out if Mistral Nemo 12B works with lora adapters with lorax? It does not work with VLLM or Aphrodite still and I am looking for alternatives.

Did you find alternatives to vllm? I still strugle with this problem of serving mistral_nemo with lora

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? #549

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? #549

tensimixt commented Jul 20, 2024

Nero10578 commented Aug 20, 2024

Feature request

Motivation

Your contribution

preduct0r commented Oct 31, 2024

Feature request

Motivation

Your contribution

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? #549

if LoRAX is based on punica kernels will it be able to support LoRA Adapters for Mistral NeMO 12B? #549

Comments

tensimixt commented Jul 20, 2024

Feature request

Motivation

Your contribution

Nero10578 commented Aug 20, 2024

Feature request

Motivation

Your contribution

preduct0r commented Oct 31, 2024

Feature request

Motivation

Your contribution