Skip to content

Commit 0ac9c2a

Browse files
committed
update
Signed-off-by: yiliu30 <[email protected]>
1 parent d1086a1 commit 0ac9c2a

File tree

1 file changed

+3
-0
lines changed
  • examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round

1 file changed

+3
-0
lines changed

examples/pytorch/nlp/huggingface_models/language-modeling/quantization/auto_round/README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,9 @@
44
```bash
55
export MODEL=Qwen/Qwen3-235B-A22B
66
```
7+
> [!TIP]
8+
> For quicker experimentation (shorter quantization and evaluation time, lower memory),
9+
> you can start with the smaller `Qwen/Qwen3-30B-A3B` model before moving to larger variants.
710
811
- MXFP8
912
```bash

0 commit comments

Comments
 (0)