Skip to content

Pull requests: openvinotoolkit/openvino.genai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update Tokenizers Submodule category: tokenizers Tokenizer class or submodule update
#2767 opened Sep 24, 2025 by apaniukov Loading…
1 of 3 tasks
Set xfail for gguf reader tests on windows. category: GGUF GGUF file reader
#2766 opened Sep 24, 2025 by popovaan Loading…
Implement Adaptive R-KV mode for cache eviction category: continuous batching Continuous batching category: CPP API Changes in GenAI C++ public headers category: Python API Python API for GenAI
#2762 opened Sep 23, 2025 by vshampor Draft
[CI] [GHA] Remove redundant args from the coverity command category: GHA CI based on Github actions
#2761 opened Sep 23, 2025 by akashchi Loading… 2025.4
[Embeddings] Add last token pooling category: CPP API Changes in GenAI C++ public headers category: GGUF GGUF file reader category: Python API Python API for GenAI category: RAG RAG pipeline components category: tests dependencies no-match-files
#2757 opened Sep 22, 2025 by as-suvorov Loading… 2025.4
Bump timm from 1.0.19 to 1.0.20 in /tests/python_tests category: GGUF GGUF file reader category: tests dependencies dependencies Pull requests that update a dependency file python Pull requests that update python code
#2753 opened Sep 22, 2025 by dependabot bot Loading…
Remove unused get_model_kv_cache_precision() category: continuous batching Continuous batching
#2750 opened Sep 19, 2025 by Wovchena Loading…
[GHA] Shell command built fix category: GHA CI based on Github actions
#2749 opened Sep 19, 2025 by mryzhov Loading…
Bump datasets from 3.6.0 to 4.1.1 in /tools/who_what_benchmark category: WWB PR changes WWB dependencies Pull requests that update a dependency file python Pull requests that update python code
#2748 opened Sep 19, 2025 by dependabot bot Loading…
Bump datasets from 3.6.0 to 4.1.1 in /tests/python_tests category: GGUF GGUF file reader category: tests dependencies dependencies Pull requests that update a dependency file python Pull requests that update python code
#2747 opened Sep 19, 2025 by dependabot bot Loading…
Using pytest cache instead of ov cache env variable category: continuous batching Continuous batching category: GGUF GGUF file reader category: GHA CI based on Github actions category: LLM LLM pipeline (stateful, static) category: sampling Sampling / Decoding algorithms category: tokenizers Tokenizer class or submodule update category: visual language Visual language pipeline category: whisper Whisper pipeline
#2744 opened Sep 18, 2025 by sgonorov Draft
eagle3 cb impl with top-1 proposal category: cmake / build Cmake scripts category: continuous batching Continuous batching category: CPP API Changes in GenAI C++ public headers category: llm_bench Label for tool/llm_bench folder category: LLM LLM pipeline (stateful, static) category: LoRA Low rank adapters category: sampling Sampling / Decoding algorithms category: speculative decoding Speculative decoding no-match-files
#2740 opened Sep 17, 2025 by songbell Loading…
add_request() to support token_type_ids with prompt category: continuous batching Continuous batching category: GGUF GGUF file reader category: visual language Visual language pipeline
#2738 opened Sep 17, 2025 by zhaohb Loading…
[VLM] Add nanoLLaVA category: CPP API Changes in GenAI C++ public headers category: GGUF GGUF file reader category: GH Pages Docs Github Pages documentation category: Python API Python API for GenAI category: visual language Visual language pipeline
#2733 opened Sep 15, 2025 by popovaan Loading…
[llm_bench] Add reranking pipeline category: GGUF GGUF file reader category: llm_bench Label for tool/llm_bench folder
#2728 opened Sep 12, 2025 by sbalandi Loading…
chang gpu_block_size to 256 category: continuous batching Continuous batching
#2727 opened Sep 12, 2025 by ceciliapeng2011 Draft
WWB Text Generation with LoRA category: WWB PR changes WWB
#2723 opened Sep 11, 2025 by likholat Loading…
Expose get_original_chat_template method in Tokenizer category: CPP API Changes in GenAI C++ public headers category: GHA CI based on Github actions category: Python API Python API for GenAI category: tokenizers Tokenizer class or submodule update
#2722 opened Sep 11, 2025 by mzegla Loading…
ProTip! Exclude everything labeled bug with -label:bug.