Replies: 1 comment
-
Hey @lkwq007, we still haven't ported any of the quantization utilities. Quantization APIs might be different in NNX to leverage the the ability to do model surgery. I'd imagine having a proper |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was wondering if there are any examples demonstrating how to use aqt with flax.nnx, specifically for quantizing a pretrained model into a quantized version. Using
nnx.bridge.ToLinen
to wrap the model and then doing the quantization seems to be a feasible approach, but is there a better way to achieve this? Thanks.Beta Was this translation helpful? Give feedback.
All reactions