CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

pkolaczk · 2025-09-30T11:20:43Z

We should not take the quick path and compare rowIds
between keys with no clustering, because rowIds may
differ for the keys within the same partition.

…sult set

We should not take the quick path and compare rowIds between keys with no clustering, because rowIds may differ for the keys within the same partition.

github-actions · 2025-09-30T11:20:59Z

eolivelli · 2025-09-30T11:29:13Z

src/java/org/apache/cassandra/index/sai/disk/PrimaryKeyWithSource.java

+        // This optimisation is valid only when both primary keys have clustering components.
+        // We must not compare by rowId when the clustering is empty, because two keys
+        // from the same partition may have different rowIds, yet they should be considered equal in that case.
+        if (o instanceof PrimaryKeyWithSource && !hasEmptyClustering() && !o.hasEmptyClustering())


do we have unit tests about this class PrimaryKeyWithSource ?

if we have no tests I think that we should add them, so that we can seal this behavior and prevent regressions

Good idea to add them, yes, I'll do

sonarqubecloud · 2025-09-30T12:01:07Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
85.2% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-09-30T12:05:04Z

❌ Build ds-cassandra-pr-gate/PR-2027 rejected by Butler

5 regressions found
See build details here

Found 5 new test failures

Test	Explanation	Runs
o.a.c.db.counters.CounterLockManagerTest.testInterruptedExceptionCachedCounterLockManager (compression)	<span title="The test is NOT RUN on upstream but has a SINGLE FAILURE on feature:
• Feature branch: a SINGLE FAILURE in 1 builds (1 to 1)
• Upstream branch: NOT RUN at all in 0 builds (na to na)">NEW	🔴	0 / 0
o.a.c.index.sai.cql.VectorSiftSmallTest.testSiftSmall (compression)	<span title="The test is NOT RUN on upstream but has a SINGLE FAILURE on feature:
• Feature branch: a SINGLE FAILURE in 1 builds (1 to 1)
• Upstream branch: NOT RUN at all in 0 builds (na to na)">NEW	🔴	0 / 0
o.a.c.index.sai.cql.datamodels.QueryCellDeletionsWithCompoundKeyWithStaticsTest.testCellDeletions[aa] (compression)	<span title="The test is NOT RUN on upstream but has a SINGLE FAILURE on feature:
• Feature branch: a SINGLE FAILURE in 1 builds (1 to 1)
• Upstream branch: NOT RUN at all in 0 builds (na to na)">NEW	🔴	0 / 0
o.a.c.index.sai.cql.datamodels.TinySegmentQueryCellDeletionsWithCompoundKeyWithStaticsTest.testCellDeletions[aa] (compression)	<span title="The test is NOT RUN on upstream but has a SINGLE FAILURE on feature:
• Feature branch: a SINGLE FAILURE in 1 builds (1 to 1)
• Upstream branch: NOT RUN at all in 0 builds (na to na)">NEW	🔴	0 / 0
o.a.c.metrics.TrieMemtableMetricsTest.testContentionMetrics (compression)	<span title="The test is NOT RUN on upstream but has a SINGLE FAILURE on feature:
• Feature branch: a SINGLE FAILURE in 1 builds (1 to 1)
• Upstream branch: NOT RUN at all in 0 builds (na to na)">NEW	🔴	0 / 0

No known test failures found

pkolaczk · 2025-09-30T13:29:34Z

Putting this on hold, as apparently there are some parts of the code that rely on ordering of PrimaryKeys by their rowId and this breaks some queries involving static rows.

michaeljmarshall · 2025-09-30T14:18:27Z

src/java/org/apache/cassandra/index/sai/disk/PrimaryKeyWithSource.java

+        // This optimisation is valid only when both primary keys have clustering components.
+        // We must not compare by rowId when the clustering is empty, because two keys
+        // from the same partition may have different rowIds, yet they should be considered equal in that case.
+        if (o instanceof PrimaryKeyWithSource && !hasEmptyClustering())


We need a more nuanced implementation. The point of PrimaryKeyWithSource is to avoid calling loadDeferred(), but this does just that. I think we should consider moving PrimaryKeyWithSource logic into individual PrimaryKeyMap implementations, and then we can do checks on the schema when we load the map, not on each key. For example, we would take the slow path if there is a static column and the fast path otherwise. This also makes it easier to differentiate between partition aware and row aware keys.

michaeljmarshall and others added 2 commits September 29, 2025 14:55

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in re…

ada025c

…sult set

CNDB-15485: Fix PrimaryKeyWithSource comparisons

0f2f4ee

We should not take the quick path and compare rowIds between keys with no clustering, because rowIds may differ for the keys within the same partition.

pkolaczk mentioned this pull request Sep 30, 2025

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set #2023

Merged

eolivelli reviewed Sep 30, 2025

View reviewed changes

adelapena approved these changes Sep 30, 2025

View reviewed changes

pkolaczk marked this pull request as draft September 30, 2025 13:28

michaeljmarshall reviewed Sep 30, 2025

View reviewed changes

Base automatically changed from cndb-15485 to cndb-main-release-202505 September 30, 2025 20:10

pkolaczk mentioned this pull request Oct 7, 2025

CNDB-15570 Fix issues with duplicated result sets in SAI #2037

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

Uh oh!

pkolaczk commented Sep 30, 2025

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

eolivelli Sep 30, 2025

Uh oh!

pkolaczk Sep 30, 2025

Uh oh!

sonarqubecloud bot commented Sep 30, 2025

Uh oh!

cassci-bot commented Sep 30, 2025

Uh oh!

pkolaczk commented Sep 30, 2025

Uh oh!

michaeljmarshall Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

Are you sure you want to change the base?

CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

Uh oh!

Conversation

pkolaczk commented Sep 30, 2025

Uh oh!

github-actions bot commented Sep 30, 2025

Checklist before you submit for review

Uh oh!

eolivelli Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

pkolaczk Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Sep 30, 2025

Quality Gate passed

Uh oh!

cassci-bot commented Sep 30, 2025

❌ Build ds-cassandra-pr-gate/PR-2027 rejected by Butler

Found 5 new test failures

No known test failures found

Uh oh!

pkolaczk commented Sep 30, 2025

Uh oh!

michaeljmarshall Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants