LLM pipeline implementation #1040

mohitmundhragithub · 2025-08-26T05:41:43Z

No description provided.

…included

…re pipeline cannot handle an input size larger than the max prefill size

github-actions · 2025-08-26T05:41:51Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

…lemented performance benchmark for LLM pipeline

…y input and issue_query only handles output tokens

… vs KV cache size

sonarqubecloud · 2025-09-09T03:33:35Z

Quality Gate passed

Issues
48 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

flutter/cpp/backends/external.h

mohitmundhragithub

.

flutter/cpp/datasets/mmlu_gen.cc

mohitmundhragithub · 2025-09-30T04:51:14Z

flutter/cpp/mlperf_driver.cc

 namespace mobile {

+// A method to be called by the backend as soon as the first token is generated (only for token based benchmarks)
+static void FirstTokenCallback(void* context) {


what's the use of context?

the context is the arguments that get passed to loadGen, these are created by the driver and sent to the backend. Backend only needs to pass those to the callback without reading/modifying them.

@freedomtan to check it.

farook-edev added 9 commits August 2, 2025 08:11

WIP LLM pipeline and dataset implementation

46346a7

fixed issues preventing libraries from compiling, runtime errors not …

5aab20a

…included

upgrade TensorFlow to 2.18.0

f598e57

upgraded llm pipeline to use TFLite C++ api + small bug fixes

fe32950

basic flutter app support for icon and dataset

24ad1d5

added linux x86_64 config for internal testing

aa09439

updated bazel config to use SSE/MMX instructions

84b164e

fixed incorrect answer format and compression

d57040c

got pipeline and dataset to produce proper results + fixed issues whe…

f9e40a5

…re pipeline cannot handle an input size larger than the max prefill size

mohitmundhragithub requested review from anhappdev and a team as code owners August 26, 2025 05:41

mohitmundhragithub assigned farook-edev Aug 26, 2025

mohitmundhragithub requested review from Mostelk and freedomtan August 26, 2025 05:42

mohitmundhragithub marked this pull request as draft August 26, 2025 05:42

farook-edev added 2 commits September 1, 2025 07:07

added support for loadgen's token based performance measurement + imp…

057c9f8

…lemented performance benchmark for LLM pipeline

fixed bugs in inference process, first token function now handles onl…

3c8b4f5

…y input and issue_query only handles output tokens

freedomtan mentioned this pull request Sep 2, 2025

Check what to do to bring in LLM models #940

Open

farook-edev changed the title ~~Feat llm~~ LLM pipeline implementation Sep 2, 2025

farook-edev linked an issue Sep 2, 2025 that may be closed by this pull request

Check what to do to bring in LLM models #940

Open

farook-edev added 5 commits September 8, 2025 00:54

optimized tensor retrieval for inference + added check for input size…

a03fbea

… vs KV cache size

clang-format

69a630a

mmlu dataset cleanup and formatting

816f282

slight code cleanup

fca2905

fixed issue with genai ops import

20e7805

mohitmundhragithub commented Sep 22, 2025

View reviewed changes

flutter/cpp/backends/external.h Outdated Show resolved Hide resolved

mohitmundhragithub commented Sep 22, 2025

View reviewed changes

flutter/cpp/datasets/mmlu_gen.cc Show resolved Hide resolved

farook-edev added 5 commits September 28, 2025 02:42

code/config cleanup

83aea46

add zero-shot option to MMLU constructor

61a5c8a

use function to detect which token is answer letter

54adcd0

quick initial implementation of first token callback

65f797f

moved tokenizer to dataset side (possibly needs cleanup)

719aefa

mohitmundhragithub commented Sep 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLM pipeline implementation #1040

LLM pipeline implementation #1040

Uh oh!

mohitmundhragithub commented Aug 26, 2025

Uh oh!

github-actions bot commented Aug 26, 2025 •

edited

Loading

Uh oh!

sonarqubecloud bot commented Sep 9, 2025

Uh oh!

Uh oh!

mohitmundhragithub left a comment

Uh oh!

Uh oh!

mohitmundhragithub Sep 30, 2025

Uh oh!

farook-edev Sep 30, 2025

Uh oh!

freedomtan Sep 30, 2025

Uh oh!

Uh oh!

LLM pipeline implementation #1040

Are you sure you want to change the base?

LLM pipeline implementation #1040

Uh oh!

Conversation

mohitmundhragithub commented Aug 26, 2025

Uh oh!

github-actions bot commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonarqubecloud bot commented Sep 9, 2025

Quality Gate passed

Uh oh!

Uh oh!

mohitmundhragithub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mohitmundhragithub Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

farook-edev Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

freedomtan Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 26, 2025 •

edited

Loading