Skip to content

Commit

Permalink
stick to regular 2:4 sparsity, avoid marlin kernel
Browse files Browse the repository at this point in the history
Signed-off-by: Brian Dellabetta <[email protected]>
  • Loading branch information
brian-dellabetta committed Feb 10, 2025
1 parent 3462e17 commit 7b39550
Show file tree
Hide file tree
Showing 3 changed files with 8 additions and 18 deletions.
8 changes: 8 additions & 0 deletions tests/e2e/vLLM/configs/sparse_24_qwen.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
cadence: "nightly"
test_type: "regression"
model: Qwen/Qwen2.5-0.5B
recipe: tests/e2e/vLLM/recipes/Sparse_2of4/recipe_sparse_2of4.yaml
scheme: sparse2of4_only
dataset_id: garage-bAInd/Open-Platypus
dataset_split: train
save_compressed: False
9 changes: 0 additions & 9 deletions tests/e2e/vLLM/configs/w4a16_2of4_channel_quant_qwen.yaml

This file was deleted.

9 changes: 0 additions & 9 deletions tests/e2e/vLLM/configs/w4a16_2of4_grouped_quant_qwen.yaml

This file was deleted.

0 comments on commit 7b39550

Please sign in to comment.