Workaround for Assertion error when embedding with bge-m3 in lazy mode #2093

slokesha · 2025-10-28T04:02:51Z

Added Workaround for Assertion error when embedding with bge-m3 in lazy mode.

For RoBERTa models, the position_ids are recalculated using the function create_position_ids_from_input_ids_hpu().
However, due to a known issue on HPU where modifying an already allocated tensor in-place can lead to invalid or corrupted values, this workaround precomputes position_ids on the CPU using the corresponding input_ids.
The computed position_ids are then transferred to the HPU within hpu_model_runner to ensure correctness and avoid in-place modification issues.

Signed-off-by: slokesha <[email protected]>

Workaround for Assertion error when embedding with bge-m3 in lazy mode

0dc8cac

Signed-off-by: slokesha <[email protected]>

slokesha requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners October 28, 2025 04:02

Precommit Fix

f34ca0d

Signed-off-by: slokesha <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Workaround for Assertion error when embedding with bge-m3 in lazy mode #2093

Workaround for Assertion error when embedding with bge-m3 in lazy mode #2093

slokesha commented Oct 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Workaround for Assertion error when embedding with bge-m3 in lazy mode #2093

Are you sure you want to change the base?

Workaround for Assertion error when embedding with bge-m3 in lazy mode #2093

Conversation

slokesha commented Oct 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

slokesha commented Oct 28, 2025 •

edited by github-actions bot

Loading