Major Changes
- Bianries Update July 2025 by @martindevans in #1225
- docs: update the API references. by @AsakusaRinne in #1213, #1214
Bug Fixes
- Conversation
-1
instead of_end
by @martindevans in #1197 - Preserve cut off generated text by @OoLunar in #1202
- Fix for getting CUDA Version inside nvidia/cuda containers by @Crelex in #1222
Other Changes
- Update to stable Microsoft.Extensions.AI.Abstractions by @stephentoub in #1190
- Memory efficient context handling by @zsogitbe in #1183
- Expose
DecodeSpecialTokens
throughIInferenceParams
forStatelessExecutor
by @jacob-mink-1996 in #1203 - Unseal the DefaultSamplingPipeline by @m0nsky in #1208
- Updated main workflow from windows-2019 to windows-2022 by @martindevans in #1216, #1218
- Multi modal (Just libraries) by @SignalRT in #1220
New Contributors
- @omahs made their first contribution in #1196
- @OoLunar made their first contribution in #1202
- @leopardracer made their first contribution in #1206
- @jacob-mink-1996 made their first contribution in #1203
- @kilavvy made their first contribution in #1207
- @vtjl10 made their first contribution in #1209
- @zeevick10 made their first contribution in #1210
- @maximevtush made their first contribution in #1211
- @Crelex made their first contribution in #1222
Full Changelog: v0.24.0...v0.25.0