Skip to content

Issues: HabanaAI/vllm-fork

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Usage]: How to run FP8 inference
#453 opened Nov 3, 2024 by warlock135
1 task done
[Bug]: the generated text on BFloat16 is not as good as that on Float32. bug Something isn't working
#443 opened Oct 29, 2024 by ccrhx4
1 task done
[Bug]: Engine loop has died bug Something isn't working
#419 opened Oct 23, 2024 by warlock135
1 task done
[Bug]: MQLLMEngine dies after a period of inactivity bug Something isn't working
#416 opened Oct 23, 2024 by Xaenalt
1 task done
[RFC]: change VLLM_DECODE_BLOCK_BUCKET_* design to fit small AND large batch size at one warmup intel Issues or PRs submitted by Intel
#328 opened Sep 24, 2024 by ccrhx4
1 task done
[Misc]: issue with loading weights from safetensors files external Issues or PRs submitted by external users
#211 opened Aug 28, 2024 by huijjj
[Usage]: The prompt bucket shape will not impact the performance intel Issues or PRs submitted by Intel
#209 opened Aug 28, 2024 by JunxiChhen
[Feature]: support pipeline parallelism inference in vllm intel Issues or PRs submitted by Intel
#205 opened Aug 27, 2024 by Zjq9409
[Feature]: Compile warmup take too long intel Issues or PRs submitted by Intel
#201 opened Aug 26, 2024 by Zjq9409
[Bug]: benchmark_latency.py cannot exit when using tp bug Something isn't working intel Issues or PRs submitted by Intel
#197 opened Aug 21, 2024 by JunxiChhen
[Usage]: vllm can't run qwen 32B inference external Issues or PRs submitted by external users
#193 opened Aug 17, 2024 by kunger97
[Bug]: Device type HPU is not supported for torch.Generator() api bug Something isn't working habana Issues or PRs submitted by Habana Labs
#183 opened Aug 14, 2024 by sungwook-son
ProTip! Mix and match filters to narrow down what you’re looking for.