fix: LazyMemoryExec should produce independent streams per execute() by viirya · Pull Request #21565 · apache/datafusion

viirya · 2026-04-12T06:17:48Z

Which issue does this PR close?

Closes #.

Rationale for this change

LazyMemoryExec::execute() shares the same generator instance across multiple calls via Arc::clone, so a second call to execute(0) continues from where the first left off instead of starting from the beginning. This is inconsistent with how other ExecutionPlan implementations behave, where each execute() call produces an independent stream. This was discovered while writing e2e tests for NestedLoopJoinExec memory-limited execution (#21448), where the OOM fallback path re-executes the left child plan and got incomplete results.

What changes are included in this PR?

LazyMemoryExec::execute() was sharing the same generator instance (via Arc::clone) across multiple calls, causing streams to share mutable state. This meant a second call to execute(0) would continue from where the first call left off, instead of starting from the beginning.

Fix by calling reset_state() on the generator to create a fresh instance for each execute() call, matching the expected ExecutionPlan semantics that each execute() produces an independent stream.

Are these changes tested?

Unit test

Are there any user-facing changes?

No

LazyMemoryExec::execute() was sharing the same generator instance (via Arc::clone) across multiple calls, causing streams to share mutable state. This meant a second call to execute(0) would continue from where the first call left off, instead of starting from the beginning. Fix by calling reset_state() on the generator to create a fresh instance for each execute() call, matching the expected ExecutionPlan semantics that each execute() produces an independent stream. Co-authored-by: Isaac

2010YOUY01

LGTM, thank you!

viirya · 2026-04-14T02:38:48Z

Thanks @2010YOUY01

…pache#21565) ## Which issue does this PR close?  - Closes #. ## Rationale for this change LazyMemoryExec::execute() shares the same generator instance across multiple calls via Arc::clone, so a second call to execute(0) continues from where the first left off instead of starting from the beginning. This is inconsistent with how other ExecutionPlan implementations behave, where each execute() call produces an independent stream. This was discovered while writing e2e tests for NestedLoopJoinExec memory-limited execution (apache#21448), where the OOM fallback path re-executes the left child plan and got incomplete results. ## What changes are included in this PR? LazyMemoryExec::execute() was sharing the same generator instance (via Arc::clone) across multiple calls, causing streams to share mutable state. This meant a second call to execute(0) would continue from where the first call left off, instead of starting from the beginning. Fix by calling reset_state() on the generator to create a fresh instance for each execute() call, matching the expected ExecutionPlan semantics that each execute() produces an independent stream. ## Are these changes tested?  Unit test ## Are there any user-facing changes? No

github-actions bot added the physical-plan Changes to the physical-plan crate label Apr 12, 2026

viirya mentioned this pull request Apr 12, 2026

feat: Add memory-limited execution for NestedLoopJoinExec #21448

Merged

2010YOUY01 approved these changes Apr 14, 2026

View reviewed changes

viirya added this pull request to the merge queue Apr 14, 2026

Merged via the queue into apache:main with commit f1c643a Apr 14, 2026
36 checks passed

viirya deleted the fix-lazy-memory-exec-shared-state branch April 14, 2026 02:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: LazyMemoryExec should produce independent streams per execute()#21565

fix: LazyMemoryExec should produce independent streams per execute()#21565
viirya merged 1 commit intoapache:mainfrom
viirya:fix-lazy-memory-exec-shared-state

viirya commented Apr 12, 2026 •

edited

Loading

Uh oh!

2010YOUY01 left a comment

Uh oh!

viirya commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

viirya commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

2010YOUY01 left a comment

Choose a reason for hiding this comment

Uh oh!

viirya commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

viirya commented Apr 12, 2026 •

edited

Loading