MTN: improve risk control tests #761

allglc · 2025-09-24T09:52:06Z

No description provided.

codecov-commenter · 2025-09-24T14:57:12Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (26cf8bc) to head (8e35373).
⚠️ Report is 7 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##            master      #761    +/-   ##
==========================================
  Coverage   100.00%   100.00%            
==========================================
  Files           55        56     +1     
  Lines         6093      6325   +232     
  Branches       351       360     +9     
==========================================
+ Hits          6093      6325   +232

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Valentin-Laurent

Looks pretty good! Review for the commit "add dummy constants". Haven't checked the following ones yet.
To push the constant thing further, we could have created a dummy_risk, but that's probably a bit overkill.

Valentin-Laurent · 2025-09-24T15:30:54Z

mapie/tests/risk_control/test_binary_classification_control.py

        "best_predict_param_choice, expected",
        [[precision, 0.5], [recall, 0.7]]
    )
    def test_correct_param_out_of_two(self, best_predict_param_choice, expected):


Regarding this test, maybe we can stick to numerical values everywhere (for param and target)? (the dummy_param and dummy_param_2 were misnamed). My point: here we actually care about the param value. In other words, if we change the dummy param/target values, the test may break.

Valentin-Laurent · 2025-09-24T15:36:19Z

mapie/tests/risk_control/test_binary_classification_control.py

        )

-        dummy_param = 0.5
        y_calibrate = np.array([1, 0])


We may use a dummy_y here

Valentin-Laurent · 2025-09-24T15:37:51Z

mapie/tests/risk_control/test_binary_classification_control.py

        dummy_predictions = np.array([[True, False]])
        valid_params_index = [0]
-        controller.valid_predict_params = np.array([dummy_param])
+        controller.valid_predict_params = dummy_param


Unsure about using dummy_param here, as it is not obvious that it is a singleton (and that's the point of the test)

Valentin-Laurent · 2025-09-24T15:39:08Z

mapie/tests/risk_control/test_binary_classification_control.py


+dummy_param = np.array([0.5])
+dummy_target = 0.9
+dummy_X_test = [[0]]


Could be renamed dummy_X for a more generic use?

Valentin-Laurent · 2025-09-24T15:41:03Z

mapie/tests/risk_control/test_binary_classification_control.py

        result = bcc_deterministic._get_predictions_per_param(
            X=[],
-            params=np.array([0.5])
+            params=dummy_param


Unsure about using dummy_param here, as it is not obvious that it is a singleton (and that's the point of the test). Maybe we can create a single_param constant.

Copilot

Pull Request Overview

This PR improves the organization and test coverage of risk control tests by separating binary classification control tests into their own dedicated test file and consolidating imports in the precision-recall control test file.

Move binary classification control tests from the precision-recall test file to a dedicated test file
Remove unused imports from the precision-recall control test file
Refactor test code to use common dummy variables and fixtures for better maintainability

Reviewed Changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`mapie/tests/risk_control/test_precision_recall_control.py`	Removes binary classification control tests and cleans up unused imports, keeping only precision-recall specific tests
`mapie/tests/risk_control/test_binary_classification_control.py`	New file containing all binary classification control tests with improved organization and shared dummy variables

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-25T07:57:13Z

mapie/tests/risk_control/test_binary_classification_control.py

+        y_calibrate = np.array([1, 0])
+        dummy_predictions = np.array([[True, False]])
+        valid_params_index = [0]
+        controller.valid_predict_params = dummy_param


The assignment is inconsistent with the expected array type. In the original code (line 808 in the diff), this was assigned as np.array([dummy_param]) but now it's assigned as just dummy_param. This will cause issues when the controller expects an array but receives a scalar value.

Copilot · 2025-09-25T07:57:13Z

mapie/tests/risk_control/test_binary_classification_control.py

+        valid_params_index = [0, 1]
+
+        controller.valid_predict_params = np.array(
+            [dummy_param, dummy_param + 0.2])


The array construction is incorrect. dummy_param is defined as np.array([0.5]) (line 20), so dummy_param + 0.2 results in np.array([0.7]). The expected values in the parametrized test are 0.5 and 0.7 (scalars), but this creates arrays [0.5] and [0.7]. This should be [dummy_param[0], dummy_param[0] + 0.2] or define dummy_param as a scalar.

Suggested change

[dummy_param, dummy_param + 0.2])

[dummy_param[0], dummy_param[0] + 0.2])

Copilot · 2025-09-25T07:57:13Z

mapie/tests/risk_control/test_binary_classification_control.py

+        predictions_per_param = np.array(
+            [[False, False]])  # precision undefined
+        valid_params_index = [0]
+        controller.valid_predict_params = dummy_param


Same issue as in line 218. The assignment should be np.array([dummy_param]) to maintain consistency with the expected array type, or dummy_param should be defined as a scalar if that's the intended usage pattern.

Copilot · 2025-09-25T07:57:14Z

mapie/tests/risk_control/test_binary_classification_control.py

+            valid_params_index=valid_params_index
+        )
+
+        assert controller.best_predict_param == dummy_param


This assertion will fail because dummy_param is an array [0.5] but best_predict_param is expected to be a scalar 0.5. The assertion should compare with dummy_param[0] or dummy_param should be defined as a scalar.

Copilot · 2025-09-25T07:57:14Z

mapie/tests/risk_control/test_binary_classification_control.py

+            predictions_per_param=predictions_per_param,
+            valid_params_index=valid_params_index
+        )
+        assert controller.best_predict_param == dummy_param


Same issue as in line 226. The assertion compares an array with what should be a scalar value. Should be dummy_param[0] or redefine dummy_param as a scalar.

Suggested change

assert controller.best_predict_param == dummy_param

assert controller.best_predict_param == dummy_param[0]

allglc added 6 commits September 23, 2025 17:28

move tests to folder

42b6a73

split risk control tests into binary_classif and precision_recall

47199e5

dummy BCC fixture, and dummy predict function

dbc0692

add dummy constants

1f6807b

reorganize: dummies and constants on top

64585e9

fix coverage issue: dummy_predict never called

4524c69

Valentin-Laurent reviewed Sep 24, 2025

View reviewed changes

Valentin-Laurent requested a review from Copilot September 25, 2025 07:55

Copilot AI reviewed Sep 25, 2025

View reviewed changes

minor modifs

8e35373

allglc marked this pull request as ready for review September 25, 2025 09:48

Valentin-Laurent approved these changes Sep 25, 2025

View reviewed changes

allglc merged commit 2b19896 into master Sep 25, 2025
6 checks passed

allglc deleted the improve-risk-control-tests branch September 25, 2025 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MTN: improve risk control tests #761

MTN: improve risk control tests #761

allglc commented Sep 24, 2025

Uh oh!

codecov-commenter commented Sep 24, 2025 •

edited

Loading

Uh oh!

Valentin-Laurent left a comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Uh oh!

Valentin-Laurent Sep 24, 2025

Uh oh!

Valentin-Laurent Sep 24, 2025

Uh oh!

Valentin-Laurent Sep 24, 2025

Uh oh!

Valentin-Laurent Sep 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 25, 2025

Uh oh!

Copilot AI Sep 25, 2025

Uh oh!

Copilot AI Sep 25, 2025

Uh oh!

Copilot AI Sep 25, 2025

Uh oh!

Copilot AI Sep 25, 2025

Uh oh!

Uh oh!

Uh oh!

	[dummy_param, dummy_param + 0.2])
	[dummy_param[0], dummy_param[0] + 0.2])

	assert controller.best_predict_param == dummy_param
	assert controller.best_predict_param == dummy_param[0]

MTN: improve risk control tests #761

MTN: improve risk control tests #761

Conversation

allglc commented Sep 24, 2025

Uh oh!

codecov-commenter commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Valentin-Laurent left a comment

Choose a reason for hiding this comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Valentin-Laurent Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Sep 24, 2025 •

edited

Loading