Skip to content

Conversation

@vbaddi
Copy link
Contributor

@vbaddi vbaddi commented Aug 6, 2025

No description provided.

Signed-off-by: vbaddi <[email protected]>
@vbaddi vbaddi added the enhancement New feature or request label Aug 7, 2025
@MT-Paugurs
Copy link

Good day, Qefficient dev team,
we are using 8x AI100Ultra cards locally and are very interested into trying out this model. Any estimates for the release date?

Signed-off-by: Mamta Singh <[email protected]>
@ochougul
Copy link
Contributor

ochougul commented Nov 3, 2025

Good day, Qefficient dev team, we are using 8x AI100Ultra cards locally and are very interested into trying out this model. Any estimates for the release date?

Hi @MT-Paugurs , Thanks for showing interest.
Feel free to use this branch to give a quick run using examples/gpt_oss.py script.
Currently the model is optimized for decode-only mode.
We are working on enabling the prefill+decode mode. And will be released soon.

Can you please tell, which sdk version you are using?

@ochougul ochougul force-pushed the add_gpt_oss branch 2 times, most recently from 084412a to 21a6620 Compare November 3, 2025 16:47
@quic-mamta quic-mamta marked this pull request as draft November 3, 2025 17:23
@PaugursA
Copy link

PaugursA commented Nov 3, 2025

Good day, Qefficient dev team, we are using 8x AI100Ultra cards locally and are very interested into trying out this model. Any estimates for the release date?

Hi @MT-Paugurs , Thanks for showing interest.

Feel free to use this branch to give a quick run using examples/gpt_oss.py script.

Currently the model is optimized for decode-only mode.

We are working on enabling the prefill+decode mode. And will be released soon.

Can you please tell, which sdk version you are using?

Hi! Currently using Platform/Apps SDKs 1.20.4.0.

Thanks for the info!

Paugurs

@quic-mamta quic-mamta closed this Nov 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants