feat: add huber loss #4684

iProzd · 2025-03-30T10:48:00Z

Summary by CodeRabbit

New Features
- Introduced an optional robust loss calculation mode that leverages Huber loss. Users can now choose between the traditional loss and a smoother Huber-based approach, offering improved error handling for key training metrics.
Documentation
- Updated user-facing guides to explain the new robust loss option and its configurable threshold.
Tests
- Expanded test coverage to validate both conventional and Huber loss scenarios, ensuring consistent and reliable performance.

deepmd/tf/loss/ener.py

coderabbitai · 2025-03-30T10:51:51Z

📝 Walkthrough

Walkthrough

This pull request integrates a conditional Huber loss computation into multiple loss modules across different frameworks. A new function, custom_huber_loss, has been added, and several loss classes have been updated with new parameters use_huber and huber_delta. The loss calculation logic now conditionally applies the Huber loss based on these parameters. Additionally, the serialization/deserialization methods have been updated to reflect a version change from 1 to 2. Test cases and argument documentation have been modified accordingly to support these changes.

Changes

File(s)	Change Summary
`deepmd/.../loss/ener.py` (dpmodel, pd, pt, tf)	Added `custom_huber_loss`; updated loss classes (`EnergyLoss`, `EnergyStdLoss`, `EnerStdLoss`) with new parameters `use_huber` and `huber_delta`; modified loss calculation logic to conditionally apply Huber loss; updated serialization/deserialization (version bump from 1 to 2).
`deepmd/utils/argcheck.py`	Introduced new documentation strings and arguments (`use_huber` and `huber_delta`) in the `loss_ener` function.
`source/tests/consistent/.../test_ener.py` `source/tests/universal/dpmodel/loss/test_loss.py`	Modified tests to parameterize the `use_huber` option, updating test data and key mappings to include the new loss parameter.

Possibly related PRs

feat: dpmodel energy loss & consistent tests #4531: The changes in the main PR are related to those in the retrieved PR as both introduce a new custom_huber_loss function and modify the loss calculation logic in the EnergyLoss and EnergyStdLoss classes, respectively.
feat(pt): train with energy Hessian #4169: The changes in the main PR, which introduce a new Huber loss function and modifications to the EnergyLoss class, are related to the retrieved PR as both involve the addition of parameters for Huber loss and modifications to loss calculation logic in classes that handle energy loss.

Suggested labels

Docs

Suggested reviewers

njzjz
wanghan-iapcm

✨ Finishing Touches

📝 Generate Docstrings

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai plan to trigger planning for file edits and PR creation.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🔭 Outside diff range comments (1)

source/tests/universal/dpmodel/loss/test_loss.py (1)
57-102: 💡 Verification agent
🧩 Analysis chain

Consider extending Huber loss to Energy Spin loss

Since LossParamEnergy now supports Huber loss via the use_huber parameter, consider whether this functionality should also be added to LossParamEnergySpin. If Huber loss is generally beneficial for energy-related losses, it would make sense to implement it consistently across similar loss functions.

🏁 Script executed:
#!/bin/bash
# Check if Huber loss is implemented in the EnergySpinLoss class

# Search for EnergySpinLoss class implementation
rg "class EnergySpinLoss" -A 5

# Check if there's a spin-specific loss implementation that might need Huber loss
rg "def custom_huber_loss" -A 5
Length of output: 2124
Action: Consider Extending Huber Loss Support for Energy Spin Loss
After verifying the codebase, it appears that while custom Huber loss is implemented for various energy loss functions (as seen in files like deepmd/tf/loss/ener.py, deepmd/pd/loss/ener.py, and deepmd/dpmodel/loss/ener.py), the EnergySpinLoss class in deepmd/pt/loss/ener_spin.py does not incorporate Huber loss functionality. Given that LossParamEnergy now includes support for Huber loss via the use_huber parameter, it may be beneficial to align the Energy Spin loss implementation for consistency and potentially improved robustness.

Points for Consideration:

Parameter Addition: Introduce a use_huber (or similar) parameter in EnergySpinLoss to enable Huber loss behavior.

Functionality Parity: Leverage the existing custom_huber_loss implementations as a guideline for this extension.

Test Updates: Update the tests in source/tests/universal/dpmodel/loss/test_loss.py to cover scenarios with and without Huber loss when using the Energy Spin loss variant.

🧹 Nitpick comments (12)

source/tests/universal/dpmodel/loss/test_loss.py (1)
17-17: Consider adding huber_delta parameter to tests

While you've added the use_huber parameter, I notice there's no corresponding huber_delta parameter in the test configuration, which was mentioned in other files. Consider adding this parameter to fully test the Huber loss functionality with different delta values, which controls the transition point between L1 and L2 loss.
- def LossParamEnergy(
-     starter_learning_rate=1.0,
-     pref_e=1.0,
-     pref_f=1.0,
-     pref_v=1.0,
-     pref_ae=1.0,
-     use_huber=False,
- ):
+ def LossParamEnergy(
+     starter_learning_rate=1.0,
+     pref_e=1.0,
+     pref_f=1.0,
+     pref_v=1.0,
+     pref_ae=1.0,
+     use_huber=False,
+     huber_delta=1.0,
+ ):
Then update the input_dict:
     "start_pref_ae": pref_ae,
     "limit_pref_ae": pref_ae / 2,
     "use_huber": use_huber,
+    "huber_delta": huber_delta,
And possibly add it to parameterization:
     "pref_v": (1.0, 0.0),
     "pref_ae": (1.0, 0.0),
     "use_huber": (False, True),
+    "huber_delta": (1.0, 0.5),
Also applies to: 36-36, 49-49
deepmd/pt/loss/ener.py (2)
26-32: Consider validating the delta parameter in the new custom_huber_loss.
While the implementation is standard Huber logic, it would be safer to raise an error if delta < 0. For instance:
 def custom_huber_loss(predictions, targets, delta=1.0):
+    if delta < 0:
+        raise ValueError("The Huber delta should be non-negative.")
     error = targets - predictions
     ...
214-221: Duplicate Huber vs. L2 logic across multiple loss branches.
Repeated “if not self.use_huber … else …” structures for energy, force, virial, and atom energy may be condensed into a helper function to reduce duplication. This could simplify maintenance and ensure consistency.
deepmd/dpmodel/loss/ener.py (2)
20-27: Consider enforcing non-negative delta in custom_huber_loss.
The logic is correct, but raising an error when delta is negative would prevent anomalous results:
 def custom_huber_loss(predictions, targets, delta=1.0):
+    if delta < 0:
+        raise ValueError("The Huber delta should be non-negative.")
     xp = array_api_compat.array_namespace(predictions, targets)
     ...
171-179: Introducing Huber-based energy loss.
Consolidating L2 and Huber under a single condition is correct, though you could factor out the repeated logic to reduce duplication.
deepmd/pd/loss/ener.py (3)

56-57: Consider reviewing default Huber parameters
Setting use_huber=False by default is fine, but a delta of 0.01 might be quite small for many scenarios. You may wish to confirm whether this default threshold is broadly applicable or should be adjusted to a more typical value (e.g., 1.0).

103-109: Improve the parameter documentation
These added docstrings are helpful. You might consider including a usage example or linking to references explaining Huber loss advantages in training to guide new users.

146-150: Runtime check is clear
Raising a RuntimeError when Huber loss is combined with force-based configurations not yet supported is a good defensive measure. Consider adding a note in the docstring clarifying that Huber loss is unavailable for those terms.

deepmd/tf/loss/ener.py (4)

76-84: Documentation for Huber parameters
Clear explanation of how Huber loss transitions from L2 to L1. Including a brief usage example could further clarify for new users.

138-143: Restriction on combining Huber loss with certain force features
The explicit runtime error is a good safeguard. If future implementation is planned, consider explaining the rationale or timeline in a comment or docstring.

357-357: Naming consideration for the summary
l2_loss_summary might be misleading if Huber loss is used. Consider a more general label like loss_summary.

386-386: Storing and returning total loss
Assigning self.l2_l = loss continues the older naming. You might rename it for clarity, e.g. self.total_loss, since Huber loss is no longer purely L2.

Also applies to: 388-388

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 52f8ece and b4f0729.

📒 Files selected for processing (7)

deepmd/dpmodel/loss/ener.py (11 hunks)
deepmd/pd/loss/ener.py (11 hunks)
deepmd/pt/loss/ener.py (11 hunks)
deepmd/tf/loss/ener.py (9 hunks)
deepmd/utils/argcheck.py (2 hunks)
source/tests/consistent/loss/test_ener.py (3 hunks)
source/tests/universal/dpmodel/loss/test_loss.py (3 hunks)

🧰 Additional context used

🧬 Code Definitions (2)

source/tests/consistent/loss/test_ener.py (1)

source/tests/consistent/loss/common.py (1)

LossTest (4-5)

deepmd/pt/loss/ener.py (4)

deepmd/dpmodel/loss/ener.py (1)

custom_huber_loss (20-27)

deepmd/pd/loss/ener.py (1)

custom_huber_loss (26-32)

deepmd/tf/loss/ener.py (1)

custom_huber_loss (28-34)

source/tests/consistent/loss/test_ener.py (1)

data (64-78)

⏰ Context from checks skipped due to timeout of 90000ms (29)

GitHub Check: Test Python (6, 3.12)
GitHub Check: Test Python (6, 3.9)
GitHub Check: Test Python (5, 3.12)
GitHub Check: Test Python (5, 3.9)
GitHub Check: Test Python (4, 3.12)
GitHub Check: Test Python (4, 3.9)
GitHub Check: Build wheels for cp310-manylinux_aarch64
GitHub Check: Test Python (3, 3.12)
GitHub Check: Build wheels for cp311-win_amd64
GitHub Check: Test Python (3, 3.9)
GitHub Check: Build wheels for cp311-macosx_arm64
GitHub Check: Test Python (2, 3.12)
GitHub Check: Build C++ (clang, clang)
GitHub Check: Build wheels for cp311-macosx_x86_64
GitHub Check: Test Python (2, 3.9)
GitHub Check: Build C++ (rocm, rocm)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Test Python (1, 3.12)
GitHub Check: Analyze (python)
GitHub Check: Build C++ (cuda120, cuda)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Build C library (2.14, >=2.5.0rc0,<2.15, libdeepmd_c_cu11.tar.gz)
GitHub Check: Test Python (1, 3.9)
GitHub Check: Test C++ (false)
GitHub Check: Build C++ (cuda, cuda)
GitHub Check: Build C library (2.18, libdeepmd_c.tar.gz)
GitHub Check: Test C++ (true)
GitHub Check: Analyze (c-cpp)
GitHub Check: Build C++ (cpu, cpu)

🔇 Additional comments (39)

source/tests/universal/dpmodel/loss/test_loss.py (3)

17-17: LGTM! New parameter for Huber loss.

The addition of use_huber parameter with a default value of False ensures backward compatibility while enabling the new Huber loss functionality.

36-36: Good implementation of the parameter in the input dictionary.

The use_huber parameter is correctly passed through to the output dictionary, making it available for the loss calculations.

49-49: LGTM! Proper test parameterization for the new feature.

The test parameterization correctly includes both False and True values for use_huber, ensuring test coverage for both standard and Huber loss calculations.

deepmd/utils/argcheck.py (2)

2449-2457: Well-documented Huber loss parameters.

The documentation clearly explains the purpose and behavior of the Huber loss function, including how it transitions between L2 and L1 loss based on the threshold delta.

2571-2584: Good implementation of Huber loss parameters.

The implementation of use_huber and huber_delta parameters is clean and follows the existing pattern in the codebase. The default values (False for use_huber and 0.01 for huber_delta) provide sensible defaults for this functionality.

source/tests/consistent/loss/test_ener.py (4)

24-24: Use of parameterized decorator is a good test strategy.
Importing and using parameterized here is helpful for systematically testing multiple parameter sets, enhancing test coverage.

59-61: Parameterizing test scenarios for Huber usage.
This approach clearly tests both standard and Huber-enabled scenarios in a single test class, improving reliability.

65-65: Extracting the use_huber parameter.
This assignment is clear and consistent with the parameterized approach.

75-77: Zeroing out prefactors when use_huber is true.
Setting start_pref_pf and limit_pref_pf to 0.0 conditionally aligns with the code restriction that Huber loss is not combined with prefactor forces. Consider adding a negative test to confirm the expected behavior if these prefactors were non-zero when Huber is enabled.

deepmd/pt/loss/ener.py (6)

56-57: New constructor parameters for Huber support.
Defining use_huber and huber_delta in the signature is a clear extension for conditional Huber loss computation.

102-110: Well-documented new parameters.
The docstring provides a concise explanation about use_huber and huber_delta, improving clarity for maintainers.

143-151: Runtime error for unsupported Huber configurations.
Raising an exception if combined usage is not implemented prevents undefined or incorrect behavior and is a robust safeguard.

276-285: Same Huber vs. L2 logic repetition.

352-360: Same Huber vs. L2 logic repetition.

381-389: Same Huber vs. L2 logic repetition.

deepmd/dpmodel/loss/ener.py (8)

49-50: Introducing use_huber and huber_delta in the constructor.
These parameters are consistent with the new Huber loss support.

79-86: Runtime error for unimplemented force modes with Huber.
This check helps avoid silent misbehavior in cases where force-based features conflict with Huber loss.

167-167: Initializing the loss aggregator to 0.
A simple and effective way to accumulate terms. Looks good.

183-191: Duplicate Huber vs. L2 logic.

201-209: Duplicate Huber vs. L2 logic.

219-227: Duplicate Huber vs. L2 logic.

263-265: Capturing the final loss result.
Storing intermediate loss values in self.l2_l and self.l2_more before returning can aid in debugging or logging.

355-355: Version increment ensures backward compatibility.
Updating to version 2 properly reflects the introduction of Huber-related parameters.

deepmd/pd/loss/ener.py (7)

26-32: Validate correctness of the custom Huber loss implementation
The piecewise definition and the mean reduction closely follow the standard Huber loss formula, and the implementation looks accurate.

213-221: Energy term Huber logic
Conditionally using Huber loss for the energy term is consistent with the rest of the design. To ensure correctness, please confirm the ordering of (predictions, targets) across all calls to custom_huber_loss is intentional and consistent.

277-284: Force term Huber logic
The usage of Huber loss for forces is analogous to the energy term. Consider validating shapes or dimensional consistency for force arrays in case further expansions (e.g., multi-dimensional or large-batch) are introduced.

363-371: Virial term Huber logic
The conditional Huber application for virial looks correct. No issues identified.

392-400: Atomic energy term Huber logic
Similarly consistent approach for atomic energy. The code is straightforward and matches the style used for the other terms.

498-498: Version bump to 2
This ensures serialization includes the new Huber-related parameters. No further concerns.

534-534: Check version compatibility
You are correctly narrowing the acceptable versions to accommodate the new features. The lack of explicit backward handling for older versions might be acceptable if such usage is unsupported.

deepmd/tf/loss/ener.py (9)

28-34: Implementation of custom Huber loss
This follows the standard piecewise definition correctly. If desired, you could explore using tf.keras.losses.Huber(delta=...) to reduce custom code.

106-107: New constructor parameters
Having use_huber=False and huber_delta=0.01 as defaults is logically consistent. Ensure these defaults align with typical dataset scales.

293-304: Initial loss setup and energy Huber logic
Setting loss = 0 first helps maintain consistent accumulation. The conditional Huber branch for energy is properly matching the earlier lines. No major concerns.

307-307: Force term Huber logic
Mirrors the approach used for energy. Confirm the shared force shape remains valid when extending to different spin or multi-system cases.

Also applies to: 310-315

320-320: Virial term Huber logic
The usage of Huber loss for virial is parallel to energy/force. No issues noted.

Also applies to: 323-328

333-333: Atomic energy Huber logic
Using Huber for atomic energy matches the approach for energy. This keeps the code design consistent across loss components.

Also applies to: 336-341

351-351: No Huber logic for generalized force
Retaining the standard L2 path is consistent with the earlier explicit error blocking Huber for generalized force use cases.

482-482: Updated serialization version
Version 2 properly signals that the new Huber parameters are included.

520-520: Deserialization version check
This ensures alignment with the updated serialization. Minimal or no backward compatibility is acceptable if older versions are no longer in use.

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b4f0729 and 1acecd7.

📒 Files selected for processing (2)

deepmd/pd/loss/ener.py (11 hunks)
deepmd/tf/loss/ener.py (15 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

deepmd/pd/loss/ener.py

⏰ Context from checks skipped due to timeout of 90000ms (29)

GitHub Check: Test Python (6, 3.12)
GitHub Check: Test Python (6, 3.9)
GitHub Check: Test Python (5, 3.12)
GitHub Check: Test Python (5, 3.9)
GitHub Check: Test Python (4, 3.12)
GitHub Check: Test Python (4, 3.9)
GitHub Check: Build wheels for cp310-manylinux_aarch64
GitHub Check: Test Python (3, 3.12)
GitHub Check: Build wheels for cp311-win_amd64
GitHub Check: Test Python (3, 3.9)
GitHub Check: Build wheels for cp311-macosx_arm64
GitHub Check: Test Python (2, 3.12)
GitHub Check: Build C++ (clang, clang)
GitHub Check: Build wheels for cp311-macosx_x86_64
GitHub Check: Test Python (2, 3.9)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Build C++ (rocm, rocm)
GitHub Check: Test Python (1, 3.12)
GitHub Check: Analyze (python)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Build C++ (cuda120, cuda)
GitHub Check: Test Python (1, 3.9)
GitHub Check: Build C library (2.14, >=2.5.0rc0,<2.15, libdeepmd_c_cu11.tar.gz)
GitHub Check: Test C++ (false)
GitHub Check: Build C++ (cuda, cuda)
GitHub Check: Build C library (2.18, libdeepmd_c.tar.gz)
GitHub Check: Analyze (c-cpp)
GitHub Check: Test C++ (true)
GitHub Check: Build C++ (cpu, cpu)

🔇 Additional comments (12)

deepmd/tf/loss/ener.py (12)

28-34: New Huber loss implementation looks good.

The implementation correctly calculates Huber loss with a smooth transition between L2 loss (quadratic) for small errors and L1 loss (linear) for large errors based on the delta threshold.

76-83: Well-documented parameters.

The documentation for use_huber and huber_delta parameters is thorough, explaining both what the Huber loss does and how it transitions between L2 and L1 behavior.

106-107: Parameters added with sensible defaults.

The new parameters use_huber and huber_delta are added with sensible default values (False and 0.01 respectively), maintaining backward compatibility.

136-143: Good runtime compatibility check.

The code properly validates that Huber loss is not used with incompatible features (prefactor forces, generalized forces, or relative forces), which prevents potential issues during training.

311-323: Good conditional implementation of Huber loss for energy.

The code correctly applies either L2 or Huber loss based on the use_huber flag, properly normalizing the inputs to the Huber loss function.

325-334: Good conditional implementation of Huber loss for forces.

The force loss calculation correctly applies Huber loss when the flag is enabled, with proper reshaping of tensors.

338-347: Good conditional implementation of Huber loss for virial.

The virial loss calculation correctly applies Huber loss when the flag is enabled, with proper normalization and reshaping.

351-360: Good conditional implementation of Huber loss for atomic energies.

The atomic energy loss calculation correctly applies Huber loss when the flag is enabled, with proper reshaping of tensors.

375-375: Appropriate summary name update.

The summary name is updated to reflect that the loss might not be L2 loss when Huber loss is used, ensuring clarity in TensorBoard visualizations.

404-406: Variable name change from l2_loss to loss.

The variable name is appropriately changed from l2_loss to loss to accurately reflect that it may not be L2 loss when Huber loss is enabled.

500-500: Correct version update and serialization of new parameters.

The version number is appropriately updated from 1 to 2, and the new parameters use_huber and huber_delta are included in the serialized data.

Also applies to: 517-518

538-538: Version compatibility check updated.

The version compatibility check is updated to handle both version 1 and version 2, ensuring backward compatibility with previously saved models.

deepmd/tf/loss/ener.py

codecov · 2025-03-31T09:04:34Z

Codecov Report

Attention: Patch coverage is 95.55556% with 6 lines in your changes missing coverage. Please review.

Project coverage is 84.82%. Comparing base (5f740f9) to head (b920880).
Report is 74 commits behind head on devel.

Files with missing lines	Patch %	Lines
deepmd/dpmodel/loss/ener.py	94.11%	2 Missing ⚠️
deepmd/tf/loss/ener.py	95.34%	2 Missing ⚠️
deepmd/pd/loss/ener.py	96.42%	1 Missing ⚠️
deepmd/pt/loss/ener.py	96.42%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##            devel    #4684      +/-   ##
==========================================
+ Coverage   84.80%   84.82%   +0.01%     
==========================================
  Files         692      692              
  Lines       66410    66511     +101     
  Branches     3539     3539              
==========================================
+ Hits        56320    56418      +98     
- Misses       8949     8951       +2     
- Partials     1141     1142       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

deepmd/utils/argcheck.py

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

deepmd/utils/argcheck.py (1)
2449-2456: Documentation formatting needs adjustment for RST compatibility.

The Huber loss documentation looks good in terms of content, but needs formatting adjustments to be consistent with other documentation in the codebase:

Add a linebreak before each bullet point to match other docstrings in this file

Ensure proper RST formatting for bullet points
 doc_use_huber = (
     "Enables Huber loss calculation for energy/force/virial terms with user-defined threshold delta (D). "
-    "The loss function smoothly transitions between L2 and L1 loss: \n\n"
-    "- For absolute prediction errors within D: quadratic loss (0.5 * (error**2)) \n\n"
-    "- For absolute errors exceeding D: linear loss (D * \\|error\\| - 0.5 * D) \n\n"
+    "The loss function smoothly transitions between L2 and L1 loss: \n\n\
+    - For absolute prediction errors within D: quadratic loss (0.5 * (error**2)) \n\n\
+    - For absolute errors exceeding D: linear loss (D * \\|error\\| - 0.5 * D) \n\n\
     "Formula: loss = 0.5 * (error**2) if \\|error\\| <= D else D * (\\|error\\| - 0.5 * D). "
 )

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1acecd7 and 5aeeb74.

📒 Files selected for processing (1)

deepmd/utils/argcheck.py (2 hunks)

⏰ Context from checks skipped due to timeout of 90000ms (29)

GitHub Check: Test Python (6, 3.12)
GitHub Check: Test Python (6, 3.9)
GitHub Check: Test Python (5, 3.12)
GitHub Check: Test Python (5, 3.9)
GitHub Check: Test Python (4, 3.12)
GitHub Check: Test Python (4, 3.9)
GitHub Check: Build wheels for cp310-manylinux_aarch64
GitHub Check: Test Python (3, 3.12)
GitHub Check: Test Python (3, 3.9)
GitHub Check: Build wheels for cp311-win_amd64
GitHub Check: Build C++ (clang, clang)
GitHub Check: Test Python (2, 3.12)
GitHub Check: Build wheels for cp311-macosx_arm64
GitHub Check: Build C++ (rocm, rocm)
GitHub Check: Test Python (2, 3.9)
GitHub Check: Build C++ (cuda120, cuda)
GitHub Check: Build wheels for cp311-macosx_x86_64
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Build C++ (cuda, cuda)
GitHub Check: Test Python (1, 3.12)
GitHub Check: Build C library (2.14, >=2.5.0rc0,<2.15, libdeepmd_c_cu11.tar.gz)
GitHub Check: Build C library (2.18, libdeepmd_c.tar.gz)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Test Python (1, 3.9)
GitHub Check: Analyze (python)
GitHub Check: Test C++ (false)
GitHub Check: Build C++ (cpu, cpu)
GitHub Check: Test C++ (true)
GitHub Check: Analyze (c-cpp)

🔇 Additional comments (1)

deepmd/utils/argcheck.py (1)

2571-2584: The Huber loss parameters are well implemented.

The implementation of use_huber and huber_delta parameters looks good. They follow the same pattern as other parameters in the codebase, with appropriate types, defaults, and documentation.

deepmd/pd/loss/ener.py

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 301799e and 57e9d7c.

📒 Files selected for processing (4)

deepmd/pd/loss/ener.py (11 hunks)
deepmd/pt/loss/ener.py (11 hunks)
deepmd/tf/loss/ener.py (15 hunks)
deepmd/utils/argcheck.py (2 hunks)

🧰 Additional context used

🧬 Code Definitions (2)

deepmd/pt/loss/ener.py (4)

deepmd/pd/loss/ener.py (1)

custom_huber_loss (26-32)

deepmd/tf/loss/ener.py (1)

custom_huber_loss (28-34)

deepmd/dpmodel/loss/ener.py (1)

custom_huber_loss (20-27)

source/tests/consistent/loss/test_ener.py (1)

data (64-78)

deepmd/pd/loss/ener.py (3)

deepmd/tf/loss/ener.py (1)

custom_huber_loss (28-34)

deepmd/pt/loss/ener.py (1)

custom_huber_loss (26-32)

deepmd/dpmodel/loss/ener.py (1)

custom_huber_loss (20-27)

⏰ Context from checks skipped due to timeout of 90000ms (29)

GitHub Check: Test Python (6, 3.12)
GitHub Check: Test Python (6, 3.9)
GitHub Check: Test Python (5, 3.12)
GitHub Check: Test Python (5, 3.9)
GitHub Check: Test Python (4, 3.12)
GitHub Check: Build wheels for cp310-manylinux_aarch64
GitHub Check: Test Python (4, 3.9)
GitHub Check: Build wheels for cp311-win_amd64
GitHub Check: Test Python (3, 3.12)
GitHub Check: Build wheels for cp311-macosx_arm64
GitHub Check: Test Python (3, 3.9)
GitHub Check: Build C++ (clang, clang)
GitHub Check: Build wheels for cp311-macosx_x86_64
GitHub Check: Test Python (2, 3.12)
GitHub Check: Test Python (2, 3.9)
GitHub Check: Build C++ (rocm, rocm)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Analyze (python)
GitHub Check: Test Python (1, 3.12)
GitHub Check: Build C++ (cuda120, cuda)
GitHub Check: Test C++ (false)
GitHub Check: Build C library (2.14, >=2.5.0rc0,<2.15, libdeepmd_c_cu11.tar.gz)
GitHub Check: Build wheels for cp311-manylinux_x86_64
GitHub Check: Build C++ (cuda, cuda)
GitHub Check: Test Python (1, 3.9)
GitHub Check: Test C++ (true)
GitHub Check: Build C library (2.18, libdeepmd_c.tar.gz)
GitHub Check: Analyze (c-cpp)
GitHub Check: Build C++ (cpu, cpu)

🔇 Additional comments (18)

deepmd/utils/argcheck.py (2)

2449-2456: Documentation formatting looks good with proper linebreaks and escaping

The documentation for the new Huber loss parameters is well-formatted with:

Clear descriptions of the Huber loss functionality

Proper linebreaks before list items

Correctly escaped pipe characters (|) to prevent RST from interpreting them as links

This addresses the formatting concerns from the previous review.

2571-2584: Parameter implementation looks good

The implementation of the use_huber and huber_delta parameters is correctly done with appropriate default values:

use_huber defaults to False (disabled by default)

huber_delta defaults to 0.01 (reasonable threshold for Huber loss)

These parameters match the functionality described in the PR objectives for conditional Huber loss computation.

deepmd/pt/loss/ener.py (5)

26-32: Good implementation of the Huber loss function.

The custom_huber_loss function correctly implements the Huber loss, providing a smooth transition between L2 loss for small errors and L1 loss for larger errors. This helps with training stability when dealing with outliers.

56-57: Well-documented new parameters for Huber loss.

The new parameters use_huber and huber_delta are well-documented, with clear explanations of what they do and how the Huber loss function works. The documentation clearly explains the threshold behavior and formula.

Also applies to: 102-109

213-221: Consistent implementation of Huber loss across different loss terms.

The implementation of Huber loss is consistent across energy, force, virial, and atomic energy loss terms. Each section correctly checks the use_huber flag and applies either standard L2 loss or Huber loss accordingly.

Also applies to: 276-284, 352-360, 381-389

487-487: Proper serialization of new parameters.

The new use_huber and huber_delta parameters are correctly included in the serialization method, and the version number is incremented from 1 to 2 to reflect these changes.

Also applies to: 504-505

523-523: Ensure backward compatibility in deserialization.

The deserialization method correctly uses check_version_compatibility to handle both old (version 1) and new (version 2) serialized data.

deepmd/pd/loss/ener.py (4)

26-32: Good implementation of Huber loss for Paddle.

The custom_huber_loss function correctly implements the Huber loss using Paddle's API, providing the same functionality as in other frameworks.

56-57: Good documentation for Huber loss parameters.

The documentation for use_huber and huber_delta clearly explains the parameters and the threshold behavior of Huber loss, addressing the previous review comment asking for clarification about the delta parameter.

Also applies to: 102-109

213-221: Consistent implementation of conditional Huber loss across all loss components.

The implementation correctly applies Huber loss to energy, force, virial, and atomic energy loss terms when use_huber is enabled, maintaining consistency with the PyTorch and TensorFlow implementations.

Also applies to: 276-284, 363-371, 392-400

498-498: Proper versioning and serialization.

The version number is correctly updated, new parameters are added to serialization, and the deserialization method appropriately handles version compatibility.

Also applies to: 515-516, 534-534

deepmd/tf/loss/ener.py (7)

28-34: Well-implemented Huber loss for TensorFlow.

The custom_huber_loss function correctly implements the Huber loss using TensorFlow operations, consistent with the implementations in other frameworks.

76-83: Good documentation for Huber loss parameters.

The documentation for use_huber and huber_delta clearly explains the threshold behavior and formula of the Huber loss function.

Also applies to: 106-107

136-143: Good validation of Huber loss compatibility.

The code correctly checks for incompatible features and raises informative error messages when Huber loss is used with atom prefactors, generalized forces, or relative forces.

163-164: Fixed potential uninitialized variable warnings.

These additions properly initialize variables that could otherwise be uninitialized, addressing static analysis warnings seen in previous reviews.

Also applies to: 189-190, 201-202, 210-211, 228-229, 238-239, 248-249, 308-309

311-322: Consistent implementation of conditional Huber loss.

The implementation correctly checks the use_huber flag and applies either standard L2 loss or Huber loss accordingly for energy, force, virial, and atomic energy loss terms.

Also applies to: 325-333, 338-346, 351-359

500-500: Proper versioning and serialization.

The version number is correctly updated, new parameters are added to serialization, and the deserialization method appropriately handles version compatibility.

Also applies to: 517-518, 538-538

311-311: Improved variable naming for clarity.

Changing from l2_loss to loss in various places makes the code more accurate since the loss is no longer exclusively L2 when Huber loss is enabled.

Also applies to: 374-375, 404-406

deepmd/pt/loss/ener.py

deepmd/tf/loss/ener.py

feat: add huber loss

b4f0729

github-actions bot added the Python label Mar 30, 2025

github-advanced-security bot found potential problems Mar 30, 2025

View reviewed changes

coderabbitai bot reviewed Mar 30, 2025

View reviewed changes

fix uts

1acecd7

coderabbitai bot reviewed Mar 31, 2025

View reviewed changes

deepmd/tf/loss/ener.py Show resolved Hide resolved

deepmd/tf/loss/ener.py Show resolved Hide resolved

iProzd requested review from njzjz and wanghan-iapcm March 31, 2025 09:06

njzjz reviewed Mar 31, 2025

View reviewed changes

deepmd/utils/argcheck.py Show resolved Hide resolved

Update argcheck.py

5aeeb74

coderabbitai bot reviewed Mar 31, 2025

View reviewed changes

Update argcheck.py

301799e

wanghan-iapcm requested changes Apr 1, 2025

View reviewed changes

deepmd/pd/loss/ener.py Outdated Show resolved Hide resolved

make docstr clear

57e9d7c

coderabbitai bot reviewed Apr 1, 2025

View reviewed changes

deepmd/pt/loss/ener.py Show resolved Hide resolved

deepmd/tf/loss/ener.py Show resolved Hide resolved

njzjz approved these changes Apr 2, 2025

View reviewed changes

Merge branch 'devel' into D0330_huber_loss

b920880

iProzd requested a review from wanghan-iapcm April 2, 2025 09:17

wanghan-iapcm approved these changes Apr 2, 2025

View reviewed changes

wanghan-iapcm added this pull request to the merge queue Apr 2, 2025

Merged via the queue into deepmodeling:devel with commit a1b5089 Apr 2, 2025
60 checks passed

coderabbitai bot mentioned this pull request May 28, 2025

fix(dpmodel): fix energy loss #4765

Merged

feat: add huber loss #4684

feat: add huber loss #4684

Uh oh!

Conversation

iProzd commented Mar 30, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Possibly related PRs

Suggested labels

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iProzd commented Mar 30, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 30, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

codecov bot commented Mar 31, 2025 •

edited

Loading