Skip to content

Conversation

@mesejo
Copy link
Contributor

@mesejo mesejo commented Dec 10, 2025

Which issue does this PR close?

Closes #1305.

What changes are included in this PR?

Use of coalesce to get the non-null key values in the keys in the join, similar to what is done in pyarrow.

Are there any user-facing changes?

The parameter name has changed from drop_duplicate_keys to coalesce_duplicate_keys but drop_duplicate_keys is not an a released version, so this is a new addition.

@mesejo mesejo force-pushed the fix/coalesce_mutual_keys branch from 7f9369d to d0ecc60 Compare December 10, 2025 22:42
@mesejo mesejo force-pushed the fix/coalesce_mutual_keys branch from d0ecc60 to 9f5fa20 Compare December 17, 2025 12:25
@mesejo mesejo marked this pull request as ready for review December 17, 2025 12:26
@timsaucer
Copy link
Member

I updated the description because I don't think this is a breaking change since the drop_duplicate_keys wasn't released.

Copy link
Member

@timsaucer timsaucer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Excellent addition. Thank you!

@timsaucer timsaucer merged commit 474e9e6 into apache:main Jan 5, 2026
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Full join on dataframe with only index yields dropped rows

2 participants