-
Notifications
You must be signed in to change notification settings - Fork 285
Pull requests: openvinotoolkit/openvino.genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Tokenizers Submodule
category: tokenizers
Tokenizer class or submodule update
#2767
opened Sep 24, 2025 by
apaniukov
Loading…
1 of 3 tasks
Set xfail for gguf reader tests on windows.
category: GGUF
GGUF file reader
#2766
opened Sep 24, 2025 by
popovaan
Loading…
Implement Adaptive R-KV mode for cache eviction
category: continuous batching
Continuous batching
category: CPP API
Changes in GenAI C++ public headers
category: Python API
Python API for GenAI
Update transformers to 4.53.3
category: GGUF
GGUF file reader
category: samples dependencies
category: tests dependencies
#2760
opened Sep 23, 2025 by
as-suvorov
•
Queued
Enable model caching for Whisper pipeline on GPU and NPU
category: Whisper samples
GenAI Whisper samples
#2759
opened Sep 23, 2025 by
luke-lin-vmc
Loading…
[GenAI] Pass customized position_ids to continues batching pipeline.
category: continuous batching
Continuous batching
category: CPP API
Changes in GenAI C++ public headers
category: prompt lookup
Prompt look-up decoding
category: Python API
Python API for GenAI
category: speculative decoding
Speculative decoding
no-match-files
[Embeddings] Add last token pooling
category: CPP API
Changes in GenAI C++ public headers
category: GGUF
GGUF file reader
category: Python API
Python API for GenAI
category: RAG
RAG pipeline components
category: tests dependencies
no-match-files
Bump timm from 1.0.19 to 1.0.20 in /tests/python_tests
category: GGUF
GGUF file reader
category: tests dependencies
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#2753
opened Sep 22, 2025 by
dependabot
bot
Loading…
Remove unused get_model_kv_cache_precision()
category: continuous batching
Continuous batching
#2750
opened Sep 19, 2025 by
Wovchena
Loading…
[GHA] Shell command built fix
category: GHA
CI based on Github actions
#2749
opened Sep 19, 2025 by
mryzhov
Loading…
Bump datasets from 3.6.0 to 4.1.1 in /tools/who_what_benchmark
category: WWB
PR changes WWB
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#2748
opened Sep 19, 2025 by
dependabot
bot
Loading…
Bump datasets from 3.6.0 to 4.1.1 in /tests/python_tests
category: GGUF
GGUF file reader
category: tests dependencies
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#2747
opened Sep 19, 2025 by
dependabot
bot
Loading…
[DO NOT MERGE][ONLY FOR TESTING] Updated chat_sample.py to validate StatefulLLMPipeline
category: LLM samples
GenAI LLM samples
do_not_merge
do_not_review
#2745
opened Sep 18, 2025 by
AsyaPronina
Loading…
Using pytest cache instead of ov cache env variable
category: continuous batching
Continuous batching
category: GGUF
GGUF file reader
category: GHA
CI based on Github actions
category: LLM
LLM pipeline (stateful, static)
category: sampling
Sampling / Decoding algorithms
category: tokenizers
Tokenizer class or submodule update
category: visual language
Visual language pipeline
category: whisper
Whisper pipeline
eagle3 cb impl with top-1 proposal
category: cmake / build
Cmake scripts
category: continuous batching
Continuous batching
category: CPP API
Changes in GenAI C++ public headers
category: llm_bench
Label for tool/llm_bench folder
category: LLM
LLM pipeline (stateful, static)
category: LoRA
Low rank adapters
category: sampling
Sampling / Decoding algorithms
category: speculative decoding
Speculative decoding
no-match-files
#2740
opened Sep 17, 2025 by
songbell
Loading…
add_request() to support token_type_ids with prompt
category: continuous batching
Continuous batching
category: GGUF
GGUF file reader
category: visual language
Visual language pipeline
#2738
opened Sep 17, 2025 by
zhaohb
Loading…
C API: implemented VlmPipeline
category: C API
category: cmake / build
Cmake scripts
no-match-files
#2735
opened Sep 16, 2025 by
zhaohb
Loading…
[VLM] Add nanoLLaVA
category: CPP API
Changes in GenAI C++ public headers
category: GGUF
GGUF file reader
category: GH Pages Docs
Github Pages documentation
category: Python API
Python API for GenAI
category: visual language
Visual language pipeline
#2733
opened Sep 15, 2025 by
popovaan
Loading…
[VLMPipeline] Run embed models on GPU when run LM on NPU
#2730
opened Sep 12, 2025 by
JohnLeFeng
Loading…
[llm_bench] Add reranking pipeline
category: GGUF
GGUF file reader
category: llm_bench
Label for tool/llm_bench folder
#2728
opened Sep 12, 2025 by
sbalandi
Loading…
chang gpu_block_size to 256
category: continuous batching
Continuous batching
#2727
opened Sep 12, 2025 by
ceciliapeng2011
•
Draft
WWB Text Generation with LoRA
category: WWB
PR changes WWB
#2723
opened Sep 11, 2025 by
likholat
Loading…
Expose get_original_chat_template method in Tokenizer
category: CPP API
Changes in GenAI C++ public headers
category: GHA
CI based on Github actions
category: Python API
Python API for GenAI
category: tokenizers
Tokenizer class or submodule update
#2722
opened Sep 11, 2025 by
mzegla
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.