fix: extract_boxed_answer returns full text when no \boxed{} found by chopratejas · Pull Request #1028 · PrimeIntellect-ai/verifiers

chopratejas · 2026-03-17T20:43:27Z

Problem

extract_boxed_answer() returns the entire input text when no \boxed{} tag is found. This text is then passed to math_verify, which extracts any number it can find — allowing a model to get correct-answer credit by mentioning the answer anywhere in its output without using \boxed{}.

Training impact

We ran rewardprobe simulate on the GSM8K MathRubric. The strategy scoreboard shows:

correct_lazy         ████████████████████ 1.00  ← just the answer, no reasoning
shortcut             ████████████████████ 1.00  ← skips computation  
perfect              █████████████░░░░░░░ 0.67  ← full reasoning + boxed answer

A model trained against this will learn to skip reasoning entirely — because correct_lazy (just outputting the number) scores higher than perfect (showing work in \boxed{}).

Fix

Add a strict parameter to extract_boxed_answer (default: False).

strict=False (default): returns full text on no match — backwards compatible, existing callers unaffected
strict=True: returns "" on no match — used by MathRubric to enforce \boxed{} format

Changes

data_utils.py: Add strict parameter with safe default
math_rubric.py: MathRubric uses strict=True via functools.partial
test_math_rubric.py: Tests updated to use \boxed{} format (reflecting correct behavior)

Addresses Cursor bot feedback

"Sub-LLM responses silently dropped in RLM environment": Fixed. strict defaults to False, so rlm_env.py and all other callers get the same passthrough behavior as before.
"Change breaks existing valid-answer tests": Fixed. Tests now use \boxed{} completions, which is the correct format MathRubric should require.

Found using rewardprobe, a pre-training QA tool for reward functions.

verifiers/utils/data_utils.py

Problem: When a completion contains no \boxed{} tag, extract_boxed_answer returns the entire input text. This is passed to math_verify, which matches any number in the text — allowing a model to get correct-answer credit by mentioning the answer anywhere without using \boxed{}. During RL training, this means a model can skip the \boxed{} format entirely and still score 1.0 by embedding the correct number in its reasoning text. The strategy scoreboard from rewardprobe shows the impact: "correct_lazy" (just outputting the answer) scores 1.0, while "perfect" (full reasoning + boxed answer) scores only 0.67. Fix: Add a `strict` parameter to extract_boxed_answer (default: False). When strict=True, returns "" on no match instead of the full text. MathRubric now uses strict=True via functools.partial. This is backwards compatible: - extract_boxed_answer(text) still returns text (default strict=False) - Only MathRubric's parser uses strict=True - Other callers (rlm_env.py, etc.) are unaffected - Tests updated to use \boxed{} format in completions Found using rewardprobe (https://github.com/chopratejas/rewardprobe).

mikasenghaas

i agree that strict behavior here is prob desirable but will have to verify that this doesn't create problems in other consumers of this function

verifiers/utils/data_utils.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-18T23:18:08Z

tests/test_math_rubric.py

-            {"completion": "\\frac{1}{2}", "answer": "0.5"},
+            {"completion": "\\boxed{1}", "answer": "1"},
+            {"completion": "\\boxed{x + 1}", "answer": "1 + x"},
+            {"completion": "\\boxed{\\frac{1}{2}}", "answer": "0.5"},


Timeout test breaks with strict boxed extraction

High Severity

The test_timeout test sets completion = "1" * int(1e5) without wrapping it in \boxed{}. Now that MathRubric defaults to strict=True, extract_boxed_answer returns "" for this completion, causing correct_answer to immediately return 0.0 via the if response == "" early exit. The assertion at line 124–125 expects 1.0 when timeout_seconds == 10, but will always get 0.0, so this test will fail.

Additional Locations (1)

verifiers/rubrics/math_rubric.py#L26-L31

cursor · 2026-03-18T23:18:08Z

tests/test_math_rubric.py

-            {"completion": "\\frac{1}{2}", "answer": "0.5"},
+            {"completion": "\\boxed{1}", "answer": "1"},
+            {"completion": "\\boxed{x + 1}", "answer": "1 + x"},
+            {"completion": "\\boxed{\\frac{1}{2}}", "answer": "0.5"},


Invalid answer tests no longer test wrong math

Medium Severity

The test_score_invalid_answers completions ("1" and "\\frac{1}{3}") lack \boxed{} wrapping. With the new strict extraction in MathRubric, these return "" and score 0.0 due to format rejection, not because of wrong math. These tests pass for the wrong reason and no longer validate that incorrect math answers are scored as 0.0. Tests like \boxed{1} vs answer "2" are needed to actually test wrong-answer scoring.

cursor bot reviewed Mar 17, 2026

View reviewed changes

verifiers/utils/data_utils.py Outdated Show resolved Hide resolved

verifiers/utils/data_utils.py Outdated Show resolved Hide resolved

mikasenghaas reviewed Mar 18, 2026

View reviewed changes

verifiers/utils/data_utils.py Outdated Show resolved Hide resolved

chopratejas force-pushed the fix/extract-boxed-fallback branch from 9fbc510 to 4d0b28d Compare March 18, 2026 23:14

cursor bot reviewed Mar 18, 2026

View reviewed changes

mikasenghaas mentioned this pull request Mar 20, 2026

perf: math rubric skip overlong answers #1046

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: extract_boxed_answer returns full text when no \boxed{} found#1028

fix: extract_boxed_answer returns full text when no \boxed{} found#1028
chopratejas wants to merge 1 commit intoPrimeIntellect-ai:mainfrom
chopratejas:fix/extract-boxed-fallback

chopratejas commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 18, 2026

Uh oh!

cursor bot Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chopratejas commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Training impact

Fix

Changes

Addresses Cursor bot feedback

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 18, 2026

Choose a reason for hiding this comment

Timeout test breaks with strict boxed extraction

Uh oh!

cursor bot Mar 18, 2026

Choose a reason for hiding this comment

Invalid answer tests no longer test wrong math

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chopratejas commented Mar 17, 2026 •

edited

Loading