How do I enforce bfloat16 activations in my model? #4723

wittenator · 2025-04-21T22:05:03Z

wittenator
Apr 21, 2025

With the new NNX API it's not super clear to me how I can enforce that the activations of my nnx model are in another dtype than the models weights. Are there best practices for mixed precision training?

Answered by wittenator

May 13, 2025

Just for reference: I missed that there are dtype and param_dtype parameters for each layer. Dtype specifies the activation dtype and param_dtype the dtype of the weights.

View full answer

wittenator · 2025-05-13T06:36:13Z

wittenator
May 13, 2025
Author

Just for reference: I missed that there are dtype and param_dtype parameters for each layer. Dtype specifies the activation dtype and param_dtype the dtype of the weights.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How do I enforce bfloat16 activations in my model? #4723

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How do I enforce bfloat16 activations in my model? #4723

Uh oh!

wittenator Apr 21, 2025

Replies: 1 comment

Uh oh!

wittenator May 13, 2025 Author

wittenator
Apr 21, 2025

wittenator
May 13, 2025
Author