Skip to content

Conversation

@szymon-miezal
Copy link

@szymon-miezal szymon-miezal commented Mar 3, 2025

What is the issue

Node crashes during node replacements result in hibernated nodes that cannot join the cluster anymore due to lack of SYN messages from seeds.

What does this PR fix and why was it fixed

Port DB-1482 which allows to use a jmx endpoint on a seed to bring the hibernated node back to the gossip candidate list.

Tested via: datastax/cassandra-dtest#75.

https://jenkins-stargazer.aws.dsinternal.org/job/ds-cassandra-pr-gate/view/change-requests/job/PR-1617/3/testReport/dtest-offheap-bti.replace_address_test/TestReplaceAddress/tests_stage_2___dtests___dtest_offheap_bti___dtest_offheap_bti_14___test_revive_endpoint/

@github-actions
Copy link

github-actions bot commented Mar 3, 2025

Checklist before you submit for review

  • Make sure there is a PR in the CNDB project updating the Converged Cassandra version
  • Use NoSpamLogger for log lines that may appear frequently in the logs
  • Verify test results on Butler
  • Test coverage for new/modified code is > 80%
  • Proper code formatting
  • Proper title for each commit staring with the project-issue number, like CNDB-1234
  • Each commit has a meaningful description
  • Each commit is not very long and contains related changes
  • Renames, moves and reformatting are in distinct commits

@szymon-miezal szymon-miezal changed the title [WIP] HCD-73 HCD-73: Add a jmx endpoint to change the node state in gossip Mar 6, 2025
@bereng
Copy link
Collaborator

bereng commented Mar 6, 2025

LGTM and I don't think we need a CNDB PR here as it's just adding some JMX methods. +1

@sonarqubecloud
Copy link

sonarqubecloud bot commented Mar 6, 2025

Quality Gate Failed Quality Gate failed

Failed conditions
10.7% Coverage on New Code (required ≥ 80%)

See analysis details on SonarQube Cloud

@cassci-bot
Copy link

✔️ Build ds-cassandra-pr-gate/PR-1617 approved by Butler


Approved by Butler
See build details here

@szymon-miezal
Copy link
Author

It seems that Sonar Qube complains about the lack of coverage as it fails to process the addition of a dtest 😞.

@szymon-miezal szymon-miezal merged commit a14cd51 into main Mar 6, 2025
467 of 476 checks passed
@szymon-miezal szymon-miezal deleted the HCD-73 branch March 6, 2025 11:38
szymon-miezal added a commit that referenced this pull request Mar 6, 2025
### What is the issue
Node crashes during node replacements result in hibernated nodes that
cannot join the cluster anymore due to a lack of SYN messages from seeds.

### What does this PR fix and why was it fixed
Port DB-1482, which allows the use a jmx endpoint on a seed to bring the
hibernated node back to the gossiping candidate list.

Tested via: datastax/cassandra-dtest#75.
djatnieks pushed a commit that referenced this pull request Mar 11, 2025
Node crashes during node replacements result in hibernated nodes that
cannot join the cluster anymore due to a lack of SYN messages from seeds.

Port DB-1482, which allows the use a jmx endpoint on a seed to bring the
hibernated node back to the gossiping candidate list.

Tested via: datastax/cassandra-dtest#75.
djatnieks pushed a commit that referenced this pull request May 18, 2025
Node crashes during node replacements result in hibernated nodes that
cannot join the cluster anymore due to a lack of SYN messages from seeds.

Port DB-1482, which allows the use a jmx endpoint on a seed to bring the
hibernated node back to the gossiping candidate list.

Tested via: datastax/cassandra-dtest#75.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants