Skip to content

Commit

Permalink
Add GGUF Qwen 2 (#860)
Browse files Browse the repository at this point in the history
* Add GGUF Qwen 2

* Update readme
  • Loading branch information
EricLBuehler authored Oct 16, 2024
1 parent 25174b4 commit d82306d
Show file tree
Hide file tree
Showing 7 changed files with 459 additions and 1 deletion.
11 changes: 11 additions & 0 deletions Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -415,6 +415,7 @@ If you do not specify the architecture, an attempt will be made to use the model
- `phi2`
- `phi3`
- `starcoder2`
- `qwen2`
**With adapters:**
Expand Down
1 change: 1 addition & 0 deletions mistralrs-core/src/gguf/mod.rs
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,7 @@ pub enum GGUFArchitecture {
Phi2,
Phi3,
Starcoder2,
Qwen2,
}

// Wraps from_str() for some convenience:
Expand Down
1 change: 1 addition & 0 deletions mistralrs-core/src/models/mod.rs
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@ pub(crate) mod phi3_5_moe;
pub(crate) mod quantized_llama;
pub(crate) mod quantized_phi2;
pub(crate) mod quantized_phi3;
pub(crate) mod quantized_qwen2;
pub(crate) mod quantized_starcoder2;
pub(crate) mod qwen2;
pub(crate) mod starcoder2;
Loading

0 comments on commit d82306d

Please sign in to comment.