Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Model Request] Jiutian 13.9b MoE model #2935

Open
20241123 opened this issue Sep 23, 2024 · 1 comment
Open

[Model Request] Jiutian 13.9b MoE model #2935

20241123 opened this issue Sep 23, 2024 · 1 comment

Comments

@20241123
Copy link

🐛 Bug

I am trying to work with Jiutian 13.9b MoE model.But getting error in model compilation step.

To Reproduce

Steps to reproduce the behavior:

pip install --pre -U -f https://mlc.ai/wheels mlc-chat-nightly-cu122 mlc-ai-nightly-cu122
git lfs install
git clone https://huggingface.co/clemenschen/Jiutian-139MoE-chat-4bit
mlc_llm convert_weight Jiutian-139MoE-chat --quantization q4f16_1 -o Jiutian-139MoE-chat_mlc

Results in:

raise ValueError(f"Unknown model type: {model_type}. Available ones: {list(MODELS.keys())}")
ValueError: Unknown model type: jiutian. Available ones: ['llama', 'mistral', 'gemma', 'gemma2', 'gpt2', 'mixtral', 'gpt_neox', 'gpt_bigcode', 'phi-msft', 'phi', 'phi3', 'phi3_v', 'qwen', 'qwen2', 'qwen2_moe', 'stablelm', 'baichuan', 'internlm', 'internlm2', 'rwkv5', 'orion', 'llava', 'rwkv6', 'chatglm', 'eagle', 'bert', 'medusa', 'starcoder2', 'cohere', 'minicpm']

Expected behavior

Should convert the weights. It works fine with qwen and qwen2.

Environment

Operating system: Ubuntu
Device: v100
How you installed MLC-LLM: pip AND source
How you installed TVM-Unity: pip
Python version: 3.10
GPU driver version (if applicable): 5.7

@20241123 20241123 added the bug Confirmed bugs label Sep 23, 2024
@MasterJH5574 MasterJH5574 added new-models and removed bug Confirmed bugs labels Sep 23, 2024
@MasterJH5574 MasterJH5574 changed the title [Bug] Jiutian 13.9b MoE model compilation not working [Model Request] Jiutian 13.9b MoE model Sep 23, 2024
@MasterJH5574
Copy link
Member

Hi @20241123 thanks for the question. This model is in a new architecture that MLC does not yet support. Contributions are welcome :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

2 participants