Fix MiniCPM-o-2_6 #2847

Wovchena · 2025-10-16T07:45:14Z

Description

WWB uses _OVMiniCPMVForCausalLM.preprocess_inputs() for optimum-intel and --hf. That function fails to apply chat_template for MiniCPM-o-2_6 and MiniCPM-V-2_6. A fix is required for optimum-intel. It will take time to validate both of the models with that fix. This PR addresses other identified issues. The original implementation applies chat_template: https://huggingface.co/openbmb/MiniCPM-o-2_6/blob/main/modeling_minicpmo.py#L959 but it's not used in WWB.
WWB results for --weights-format fp16 and unpublished fix for optimum-intel chat_template:

--hf vs optimum-intel : 0.986911
--hf vs --genai: 0.935212
optimum-intel vs --genai 0.942425

Docs: https://wovchena.github.io/openvino.genai-public/docs/supported-models/#visual-language-models-vlms

That should be enough for this release but follow up PRs are needed. The ideal solution includes:

Fix optimum-intel chat_template application
Request WWB not to use optimum-intel with --hf
Align --hf and optimum-intel (likely to be never completed because the results are close: 0.986911)
Extend GenAI pre-commit tests
Copy image resize implementation from PIL to GenAI
Check if it's possible to align sin() and cos() with numpy with hardcoding commit sin() and cos() values

CVS-170987

Checklist:

Tests have been updated or added to cover the new code
This patch fully addresses the ticket.
I have made corresponding changes to the documentation

CVS-170987

Copilot

Pull Request Overview

This pull request fixes a bug in the WWB implementation where _OVMiniCPMVForCausalLM.preprocess_inputs() failed to apply chat templates for MiniCPM-o-2_6 and MiniCPM-V-2_6 models. The fix changes the image tag format from (<image>./</image>)\n to <image>./</image>\n across documentation and implementation files.

Key Changes

Updated image tag format for MiniCPM models from (<image>./</image>)\n to <image>./</image>\n
Added support for new MiniCPM-o-2_6 model variant
Fixed related algorithm implementations in the MiniCPM vision encoder

Reviewed Changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
tests/python_tests/test_vlm_pipeline.py	Updated test to use correct image tag format without parentheses
src/python/py_vlm_pipeline.cpp	Updated documentation strings to show correct image tag format for both MiniCPM variants
src/python/openvino_genai/py_openvino_genai.pyi	Updated type stub documentation to reflect correct image tag format
src/cpp/src/visual_language/minicpm/classes.hpp	Updated resample method signature to use single target size with padding parameter
src/cpp/src/visual_language/minicpm/classes.cpp	Refactored implementation with improved algorithms and removed parentheses from image tag
src/cpp/src/visual_language/inputs_embedder.hpp	Updated comment to reflect correct image tag format
src/cpp/src/debug_utils.hpp	Added support for boolean numpy array type
src/cpp/include/openvino/genai/visual_language/pipeline.hpp	Updated API documentation with correct image tag format
site/docs/use-cases/image-processing/_sections/_usage_options/index.mdx	Updated documentation to separate MiniCPM-o-2_6 and MiniCPM-V-2_6 with correct tags
site/docs/supported-models/index.mdx	Added notes about MiniCPM-o compatibility issues
site/docs/supported-models/_components/vlm-models-table/models.ts	Added MiniCPM-o-2_6 model to supported models table
samples/cpp/visual_language_chat/README.md	Removed specific model reference to make documentation more generic

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/cpp/src/visual_language/minicpm/classes.cpp

Wovchena added 4 commits October 16, 2025 08:59

Fix MiniCPM-o-2_6

0ace320

CVS-170987

fp32 weights and kv cache, sdpa 0.942925

1ace7b6

Clean up

6acda9c

docs

0ace78a

Copilot AI review requested due to automatic review settings October 16, 2025 07:45

fix link

1ace772

Copilot AI reviewed Oct 16, 2025

View reviewed changes

src/cpp/src/visual_language/minicpm/classes.cpp Show resolved Hide resolved

src/cpp/src/visual_language/minicpm/classes.cpp Show resolved Hide resolved

src/cpp/src/visual_language/minicpm/classes.cpp Show resolved Hide resolved

Wovchena changed the title ~~Fix mini cpm o 2 6 cvs 170987~~ Fix MiniCPM-o-2_6 Oct 16, 2025

doc

2acea3d

Wovchena added the Code Freeze label Oct 16, 2025

Wovchena added this to the 2025.4 milestone Oct 16, 2025

Wovchena requested a review from yatarkan October 16, 2025 10:40

Wovchena enabled auto-merge October 16, 2025 11:51

Wovchena mentioned this pull request Oct 16, 2025

Add MiniCPM-o-2_6 #2479

Closed

yatarkan approved these changes Oct 16, 2025

View reviewed changes

Wovchena added this pull request to the merge queue Oct 16, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 16, 2025

Wovchena added this pull request to the merge queue Oct 17, 2025

Merged via the queue into openvinotoolkit:master with commit 939001f Oct 17, 2025
138 of 142 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix MiniCPM-o-2_6 #2847

Fix MiniCPM-o-2_6 #2847

Uh oh!

Wovchena commented Oct 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix MiniCPM-o-2_6 #2847

Fix MiniCPM-o-2_6 #2847

Uh oh!

Conversation

Wovchena commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Wovchena commented Oct 16, 2025 •

edited

Loading