Enhancement for feedback scrape type #161

annelaucg · 2025-10-28T18:33:07Z

No description provided.

qiujian16 · 2025-10-29T02:34:19Z

enhancements/sig-architecture/179-work-feedback-scrape-type/README.md

+
+### Risks and Mitigation
+
+What are the risks of this proposal and how do we mitigate. Think broadly. For


I think one of the risks we have to consider is there might be too many watch event, and there could be too many updates call to the manifestwork.

Added some more details to this section on how we can limit the updates to the ManifestWork object.

Need to still think through a few more implementation details (eg. how can MWRS verify if the resource is Ready to stop the informer from watching).

might be also worth having some metrics defined, e.g. number of started informers.

qiujian16 · 2025-10-29T02:38:11Z

enhancements/sig-architecture/179-work-feedback-scrape-type/README.md

+In the ManifestWork Controller (pkg/work/spoke/controllers/statuscontroller/availablestatus_controller.go)
+introduce a watch-based path alongside the existing poll loop. When syncing the ManifestWork, register a informer for the resource if `feedbackScrapeType`  WATCH. 
+
+When there is a change seen on the WATCH type, patch the status conditions for that resource. 


There is some technical details we need to consider. For instance, per resource watch might be too heavy, we might want a per resource type informer maintained in a informer pool. So when a mw wants to watch a certain resource, agent register the resource to the pool incrementing a reference count for each resource type. When a resource is removed from mw, or deleted the resource should be unregistered from the pool.

So instead of creating a dynamic informer per resource (with name, namespace, etc.), are you suggesting to create a informer that is more generalized to the resource type?

Can you share if there is some existing functionality in informer factories to support the behavior you mention of registering to the pool and increment reference count?

I do not think there is, I can build some demo code on that. I think dynamic informer per resource/name/namespace might be too many. I am thinking dynamic informer per resource/namespace is good enough. Just something need to be considered when we implement it.

enhancements/sig-architecture/179-work-feedback-scrape-type/README.md

qiujian16 · 2025-11-10T02:29:21Z

/approve
/assign @zhujian7

I think it looks good, @zhujian7 will take another look.
Please signoff the commit with command git commit --amend --signoff and git push -f.

openshift-ci · 2025-11-10T02:29:26Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: annelaucg, qiujian16

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [qiujian16]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

zhujian7 · 2025-11-10T07:55:24Z

enhancements/sig-architecture/179-work-feedback-scrape-type/README.md

+In the ManifestWork Controller (pkg/work/spoke/controllers/statuscontroller/availablestatus_controller.go)
+introduce a watch-based path alongside the existing poll loop. 
+
+When syncing the ManifestWork, register a informer for the resource if `feedbackScrapeType`  WATCH. If the `feedbackScrapeType` is no longer WATCH, unregister the informer. If the ManifestWork is in Ready state (Progressing == False and Degraded == False / NotSet), unregister the informer. This will prevent long running WATCH from triggering too many ManifestWork changes. If there is new rollout, and the status changes to Progressing, then register the informer again. 


Could you explain more details about the register/unregister informer part? I have a concern that in some cases, even with this watch informer, we might still not observe the whole status change process of the resource, since it may take several seconds to wait for the informer cache to sync when starting the informer. For example, if the replica number changes from 1->2->3 in 5 seconds, and the waiting for the deployment informer cache sync process takes 6 seconds, the result we see will still be 1->3.

Are you thinking of the situation on start up? Or just in general when the informer is running? My expectation is that the informer should see every change (which I think is the guarantee of the informer) and that the ManifestWork feedbackStatus will get updated with that change. And when the ManifestWork statusFeedback updates, we will populate this upwards to the MWRS controller.

From my understanding, which step will be missed above since it takes several seconds to wait for informer cache to sync?

The informer will be registered when a new rollout is started with a new MWRS.spec and observedGeneration for the ManifestWork. The informer will never get unregistered unless the feedbackScrapeType is removed from that resource. However, once the MWRS is marked as Ready, the informer will stop sending updates on add/update/delete to the statusFeedback to prevent triggering resync on the MWRS controller level.

annelau21 added 2 commits October 28, 2025 11:31

Enhancement for feedback scrape type

06f0b76

Enhancement for feedback scrape type

9d302b4

openshift-ci bot added the do-not-merge/work-in-progress label Oct 28, 2025

qiujian16 reviewed Oct 29, 2025

View reviewed changes

annelaucg changed the title ~~Enhancement for feedback scrape type~~ [WIP] Enhancement for feedback scrape type Oct 29, 2025

annelau21 added 2 commits October 29, 2025 10:51

feedback-scrape-type-enhancement

fd12192

Updated to include some risk/mitigations

7c89339

annelaucg marked this pull request as ready for review November 5, 2025 21:33

openshift-ci bot requested review from deads2k and qiujian16 November 5, 2025 21:33

annelaucg changed the title ~~[WIP] Enhancement for feedback scrape type~~ Enhancement for feedback scrape type Nov 6, 2025

openshift-ci bot removed the do-not-merge/work-in-progress label Nov 6, 2025

openshift-ci bot assigned zhujian7 Nov 10, 2025

openshift-ci bot added the approved label Nov 10, 2025

zhujian7 reviewed Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhancement for feedback scrape type #161

Enhancement for feedback scrape type #161

annelaucg commented Oct 28, 2025

Uh oh!

qiujian16 Oct 29, 2025

Uh oh!

annelaucg Oct 30, 2025

Uh oh!

qiujian16 Nov 5, 2025

Uh oh!

qiujian16 Oct 29, 2025

Uh oh!

annelaucg Oct 30, 2025

Uh oh!

qiujian16 Nov 5, 2025

Uh oh!

Uh oh!

qiujian16 commented Nov 10, 2025

Uh oh!

openshift-ci bot commented Nov 10, 2025

Uh oh!

zhujian7 Nov 10, 2025 •

edited

Loading

Uh oh!

annelaucg Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		### Risks and Mitigation

		What are the risks of this proposal and how do we mitigate. Think broadly. For

Enhancement for feedback scrape type #161

Are you sure you want to change the base?

Enhancement for feedback scrape type #161

Conversation

annelaucg commented Oct 28, 2025

Uh oh!

qiujian16 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

annelaucg Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

qiujian16 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

qiujian16 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

annelaucg Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

qiujian16 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qiujian16 commented Nov 10, 2025

Uh oh!

openshift-ci bot commented Nov 10, 2025

Uh oh!

zhujian7 Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

annelaucg Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhujian7 Nov 10, 2025 •

edited

Loading