Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add option to disable duplicates in topk
#464 opened Nov 6, 2024 by kdamaszk Loading…
AWQ Support
#458 opened Nov 4, 2024 by maktukmak Loading…
Dev/afierka/mss acc fix
#456 opened Nov 4, 2024 by afierka-intel Draft
Config hidden layer number to run in 1 lazy graph
#451 opened Nov 1, 2024 by libinta Loading…
to make repetition penalty faster
#442 opened Oct 29, 2024 by ccrhx4 Loading…
Add HPU information to collect_env script
#430 opened Oct 25, 2024 by michalkuligowski Loading…
GPTQ Support
#421 opened Oct 23, 2024 by maktukmak Loading…
Create run-lm-eval-mmlu.sh
#399 opened Oct 16, 2024 by michalkuligowski Draft
Optimize LoRA mask creation habana Issues or PRs submitted by Habana Labs
#285 opened Sep 13, 2024 by SanjuCSudhakaran Draft
[build] Changes for RH build external Issues or PRs submitted by external users
#190 opened Aug 15, 2024 by Xaenalt Loading…
ProTip! no:milestone will show everything without a milestone.