fix internvl flash long context acc #4003

CUHKSZzxy · 2025-09-23T13:19:50Z

Motivation

The main branch code was tested with the following script, which uses large images that exceed the max_prefill_token_num limit, triggering the is_long_context condition in lmdeploy. As a result, the response will be repeated and meaningless.

Testing script

import os
from lmdeploy import pipeline, PytorchEngineConfig
from lmdeploy.vl import load_image

if __name__ == '__main__':
    os.environ['CUDA_VISIBLE_DEVICES'] = '4'

    # configurations
    tp = 1
    backend_config = PytorchEngineConfig(
        tp=tp,
        # max_prefill_token_num=10240
    )

    # init pipeline
    model_path = 'InternVL3_5-8B-Flash'
    pipe = pipeline(
        model_path,
        backend_config=backend_config,
        log_level='INFO'
    )

    # inference
    messages = [
        dict(role='user', content=[
            dict(type='text', text='<IMAGE_TOKEN>\n<IMAGE_TOKEN>\nDescribe the two images in detail.'),
            dict(type='image_url', image_url=dict(url='img1.jpeg')),
            dict(type='image_url', image_url=dict(url='img2.jpeg')),
        ])
    ]

    response = pipe(messages)
    print(response)

Performance & Acc

Test with VLMEvalKit, dataset: BLINK

Model	Flash Mode	Time	Acc
InternVL3.5-8B	/	~3m06s	52.66
InternVL3.5-8B-Flash	false	~3m06s	52.91
InternVL3.5-8B-Flash	true	~2m51s	51.44

grimoire

LGTM

lvhan028 · 2025-09-24T12:19:56Z

May add the evaution result of internvl3.5-241b

CUHKSZzxy · 2025-09-24T12:51:57Z

May add the evaution result of internvl3.5-241b

Test with VLMEvalKit, dataset: BLINK

Model	Flash Mode	Time	Acc
InternVL3.5-241B-A28B	/	/	61.4
241B-A8B-Flash	false	~7m12s	61.3
241B-A8B-Flash	true	~7m02s	59.7

InternVL3.5-241B-A28B results are taken from the official InternVL3.5 report Table 5.
https://arxiv.org/pdf/2508.18265

fix multi-image long context acc

1ab026b

CUHKSZzxy changed the title ~~fix multi-image long context acc~~ fix internvl flash long context acc Sep 23, 2025

fix pos id

30b7d1e

lvhan028 requested a review from grimoire September 24, 2025 05:12

lvhan028 added the Bug:P1 label Sep 24, 2025

remove device

b268941

grimoire approved these changes Sep 24, 2025

View reviewed changes

lvhan028 merged commit d18ab56 into InternLM:main Sep 24, 2025
5 of 6 checks passed

CUHKSZzxy deleted the fix-internvl-flash branch September 25, 2025 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix internvl flash long context acc #4003

fix internvl flash long context acc #4003

Uh oh!

CUHKSZzxy commented Sep 23, 2025 •

edited

Loading

Uh oh!

grimoire left a comment

Uh oh!

lvhan028 commented Sep 24, 2025

Uh oh!

CUHKSZzxy commented Sep 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix internvl flash long context acc #4003

fix internvl flash long context acc #4003

Uh oh!

Conversation

CUHKSZzxy commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Performance & Acc

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

lvhan028 commented Sep 24, 2025

Uh oh!

CUHKSZzxy commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CUHKSZzxy commented Sep 23, 2025 •

edited

Loading

CUHKSZzxy commented Sep 24, 2025 •

edited

Loading