Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix for Sparsity Persist #2323

Merged
merged 5 commits into from
Jun 11, 2024
Merged

Fix for Sparsity Persist #2323

merged 5 commits into from
Jun 11, 2024

Conversation

Satrat
Copy link

@Satrat Satrat commented Jun 7, 2024

When debugging the Marlin24 kernels I found that the sparsity structure was not being correctly maintained. The vLLM check for sparsity structure was failing. After this GPTQ fix the problem went away, I pulled this code from the nm-AutoGPTQ codebase

@dbogunowicz
Copy link
Contributor

@Satrat nice find! do you think that this may be something that also confused me over the last few days?

dbogunowicz
dbogunowicz previously approved these changes Jun 10, 2024
rahul-tuli
rahul-tuli previously approved these changes Jun 10, 2024
Copy link
Member

@rahul-tuli rahul-tuli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Satrat Satrat dismissed stale reviews from rahul-tuli and dbogunowicz via 33aa7da June 10, 2024 15:28
@bfineran bfineran merged commit e255b17 into main Jun 11, 2024
17 checks passed
@bfineran bfineran deleted the sa/marlin_24 branch June 11, 2024 19:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants