Standardized Inference Engine Integration Testing #1948

penfever · 2025-08-22T17:59:08Z

Description

This PR refactors and stabilizes the inference test suite, extending coverage to more engines (VLLM and LlamaCPP), makes VLM GPU inference tests more efficient by using smaller models, and standardizes the testing format for inference using an abstract base class approach.

Each inference engine now inherits 7+ standardized test methods covering basic inference, batch processing, file I/O, parameter validation, deterministic generation, and error handling scenarios. Abstract methods allow each engine to specify performance thresholds, model configurations, and hardware requirements while maintaining consistent test behavior.

Improved GPU VRAM cache release triggering to allow for more extended test suites without CUDA OOM errors.

Related issues

Fixes # (issue)

Before submitting

This PR only changes documentation. (You can ignore the following checks in that case)
Did you read the contributor guideline Pull Request guidelines?
Did you link the issue(s) related to this PR in the section above?
Did you add / update tests where needed?

Reviewers

At least one review from a member of oumi-ai/oumi-staff is required.

Benjamin Feuer and others added 30 commits August 22, 2025 07:31

inference tests

18f015c

Merge branch 'main' into penfever/inference-tests

99ac8ef

clean up tests

9534e36

cleanup tests

ae6b016

cleanup tests

c8636a5

clean up utils

50043d4

simplify tests

eee5457

fix test runners

7d783f7

cleanup

f80a0ba

abstract out inference engine tests

af3f40d

precommit

0332c63

fix precommit

8ef8b82

pre-commit fixes

272d780

pre-commit fixes

2d95f38

pre-commit

d867d92

pre-commit

0179871

Merge branch 'main' into penfever/inference-tests

66b2e10

VLLM w/o triton

71eebba

VLLM tests to CPU

6ca5506

pyright comments

1cd4cb9

switch to smaller VLM for failing test

a2ab02b

pyright fix

67d07c8

reduce model size, VLLM to CPU

3fe0a56

type checking fixes

cbdaf05

remove new test groups

458fba4

test fixes

e6d0e95

skip failing VLLM tests

c8005fc

Merge branch 'main' into penfever/inference-tests

fa87ce6

Update gpu_tests.yaml

474219d

Update pyproject.toml

a04bea7

Benjamin Feuer added 2 commits August 25, 2025 19:03

cleanup

b531f21

pre-commit fixes

14896d2

penfever marked this pull request as ready for review August 25, 2025 23:10

penfever requested review from a team, oelachqar and wizeng23 and removed request for a team August 25, 2025 23:11

penfever changed the title ~~[WIP] Penfever/inference tests~~ Standardized Inference Engine Integration Testing Aug 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Standardized Inference Engine Integration Testing #1948

Standardized Inference Engine Integration Testing #1948

Uh oh!

penfever commented Aug 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Standardized Inference Engine Integration Testing #1948

Are you sure you want to change the base?

Standardized Inference Engine Integration Testing #1948

Uh oh!

Conversation

penfever commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related issues

Before submitting

Reviewers

Uh oh!

Uh oh!

penfever commented Aug 22, 2025 •

edited

Loading