Skip to content

Conversation

@MeredithAnya
Copy link
Member

Same schema as in #338 but changes the naming convention from search-issues to generic-events. While search issues is the naming convention for the storage in ClickHouse, the other topics for search issues are named using generic-events, so changing for consistency.

@MeredithAnya MeredithAnya requested review from a team as code owners November 4, 2024 18:17
@MeredithAnya MeredithAnya merged commit 55bca62 into main Nov 4, 2024
16 checks passed
@MeredithAnya MeredithAnya deleted the meredith/generic-events-deletions branch November 4, 2024 18:24
MeredithAnya added a commit to getsentry/snuba that referenced this pull request Nov 4, 2024
Updating for consistency with naming conventions, will need
getsentry/sentry-kafka-schemas#347 before we can
merge this PR (and update it with the right version)
MeredithAnya added a commit to getsentry/snuba that referenced this pull request Nov 8, 2024
**context**:
The work for bulk deleting in snuba has so far included the following:
* Added the kafka schemas -
getsentry/sentry-kafka-schemas#347
* Adding the new endpoint including producing to the topic -
#6440
* Created the topic in production for all env -
getsentry/ops#12711

**what's left**:
Now that we have the topics created we can finish up the consumer side.
- [ ] Add the consumer logic to snuba (This PR)
- [ ] Add the consumer deployment to S4S region in the ops repository
- [ ] Set up datadog alerts/metrics and other observability

**this PR**:
It's a bit of a larger PR but it can be reviewed in a couple sections:
* The main consumer logic and strategy
* This has the logic to create the strategy factory for the consumer and
composes the strategy steps. The `strategy.py` file has the details for
actually executing the delete query
* The `batching.py` file - I have an [arroyo
PR](getsentry/arroyo#390) that makes this file
obsolete but in the meantime I don't think it needs to block this PR
since it will be easy to remove after
* The formatters are going to be the only code that someone will need to
write in the future when deploying the deletions consumer for a
different storage. How one formats the conditions for the `DELETE` query
is up to that logic.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants