Skip to content

Commit 94a955e

Browse files
committed
fix
Signed-off-by: yiliu30 <[email protected]>
1 parent 59ed49a commit 94a955e

File tree

1 file changed

+7
-0
lines changed
  • examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round

1 file changed

+7
-0
lines changed

examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round/README.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,13 @@ python quantize.py --model $QWEN_MODEL -t qwen_mxfp4 --use_autoround_format --ou
2525
python quantize.py --model $DS_MODEL -t qwen_mxfp4 --use_autoround_format --output_dir ./qmodels
2626
```
2727

28+
## Evaluation
29+
```bash
30+
git clone https://github.com/yiliu30/vllm-fork/tree/
31+
cd vllm-fork
32+
git checkout fused-moe-ar
33+
VLLM_USE_PRECOMPILED=1 pip install --editable . -vvv
34+
```
2835

2936
### Prompt Tests
3037

0 commit comments

Comments
 (0)