⏪ revert back to 3-layer micro model #497

prashantgupta24 · 2025-10-01T18:41:31Z

Description

Revert back to 3-layer micro model.

Needs a fix from @ckadner (#499) where the HF models cache also contains the revision so that we can cache the right model.

Signed-off-by: Prashant Gupta <[email protected]>

…icro-model

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 · 2025-10-01T20:01:40Z

bot:test

prashantgupta24 · 2025-10-01T20:10:44Z

.github/workflows/test.yml


      - name: "Download HF models"
-        if: ( steps.changed-src-files.outputs.any_changed == 'true' && steps.cache_restore.outputs.cache-hit != 'true' )
+        if: ( steps.changed-src-files.outputs.any_changed == 'true' && steps.cache_restore.outputs.cache-hit == 'true' )


temporary change to get GH to download the older revision

prashantgupta24 · 2025-10-01T21:26:51Z

FYI bot test seems to be fine with the 3 layer model!

ckadner

If it is just tiny granite that we need to change revisions for (occasionally), then your quick fix here is probably sufficient -- after merging your PR and deleting the old cache blobs

ckadner · 2025-10-02T18:11:37Z

.github/workflows/test.yml


      - name: "Download HF models"
-        if: ( steps.changed-src-files.outputs.any_changed == 'true' && steps.cache_restore.outputs.cache-hit != 'true' )
+        if: ( steps.changed-src-files.outputs.any_changed == 'true' && steps.cache_restore.outputs.cache-hit == 'true' )


This means you will have both revisions:

4 layer model from the cache

3 layer model downloaded with revision

Making all the tests work. If you bypass the cache, I reckon some tests will break like in PR #499

Once the 3 layer model is verified, I don't think we need the 4 layer anymore, so we should only be using one revision of the model at all places

Ideally the fix has to be that we delete the old cache and populate it with the 3 layer revision and merge this PR right?

I think it needs more code/test changes to make sure we don't use the latest/main revision of the tiny granite model anymore/anywhere.

Kicking of a test run here that is not also still using the old cached model: #502

Yup:

E huggingface_hub.errors.LocalEntryNotFoundError: Cannot find an appropriate cached snapshot folder for the specified revision on the local disk and outgoing traffic has been disabled. To enable repo look-ups and downloads online, pass 'local_files_only=False' as input.

head_call_error = OfflineModeIsEnabled("Cannot access file since 'local_files_only=True' as been set. (repo_id: ibm-ai-platform/micro-g3.3-8b-instruct-1b, repo_type: model, revision: main, filename: config.json)") force_download = False, local_files_only = True

Also, need to not use the same revision key to download the FP8 model 🙄
ValueError: Unrecognized model in ibm-ai-platform/micro-g3.3-8b-instruct-1b-FP8

.github/workflows/test.yml

⏪ revert back to 3-layer micro model

87b339d

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 requested review from rafvasq and sducouedic as code owners October 1, 2025 18:41

prashantgupta24 marked this pull request as draft October 1, 2025 18:43

prashantgupta24 added 6 commits October 1, 2025 11:53

🐛 download the right revision

7174818

Signed-off-by: Prashant Gupta <[email protected]>

🎨 fmt

5aeeb88

Signed-off-by: Prashant Gupta <[email protected]>

🐛 download the right model

38acec8

Signed-off-by: Prashant Gupta <[email protected]>

🚧 flip condition so that we download the right model

49c5077

Signed-off-by: Prashant Gupta <[email protected]>

Merge remote-tracking branch 'upstream/main' into switch-to-3-layer-m…

e90387e

…icro-model

⚡️ add hf_cache for older revision

b194d0a

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 commented Oct 1, 2025

View reviewed changes

ckadner mentioned this pull request Oct 2, 2025

WIP: GHA test with model revisions #499

Draft

1 task

ckadner reviewed Oct 2, 2025

View reviewed changes

This was referenced Oct 2, 2025

[DON'T MERGE] GHA don't use cache for 4-layer granite model (PR #497) #501

Closed

[DON'T MERGE] <[CI Test]> Don't use cache for 4-layer granite model (PR #497) #502

Draft

vllm-project deleted a comment from github-actions bot Oct 2, 2025

ckadner mentioned this pull request Oct 2, 2025

[Tests] Model revision parameter not used consistently #503

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⏪ revert back to 3-layer micro model #497

⏪ revert back to 3-layer micro model #497

Uh oh!

prashantgupta24 commented Oct 1, 2025 •

edited by ckadner

Loading

Uh oh!

prashantgupta24 commented Oct 1, 2025

Uh oh!

prashantgupta24 Oct 1, 2025

Uh oh!

prashantgupta24 commented Oct 1, 2025

Uh oh!

ckadner left a comment

Uh oh!

ckadner Oct 2, 2025

Uh oh!

prashantgupta24 Oct 2, 2025

Uh oh!

prashantgupta24 Oct 2, 2025

Uh oh!

ckadner Oct 2, 2025 •

edited

Loading

Uh oh!

ckadner Oct 2, 2025 •

edited

Loading

Uh oh!

ckadner Oct 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

⏪ revert back to 3-layer micro model #497

Are you sure you want to change the base?

⏪ revert back to 3-layer micro model #497

Uh oh!

Conversation

prashantgupta24 commented Oct 1, 2025 • edited by ckadner Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

prashantgupta24 commented Oct 1, 2025

Uh oh!

prashantgupta24 Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 commented Oct 1, 2025

Uh oh!

ckadner left a comment

Choose a reason for hiding this comment

Uh oh!

ckadner Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

ckadner Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ckadner Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ckadner Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

prashantgupta24 commented Oct 1, 2025 •

edited by ckadner

Loading

ckadner Oct 2, 2025 •

edited

Loading

ckadner Oct 2, 2025 •

edited

Loading