[v2] Start type checking #3176

Samoed · 2025-09-13T17:55:57Z

Aligned task._evaluate_subset with each other
Aligned Evaluator.__call__, but classification and regression task return 2 items instead of 1

# Conflicts: # mteb/abstasks/AbsTask.py # mteb/models/model_implementations/listconranker.py # pyproject.toml

Copilot

Pull Request Overview

This PR introduces initial type checking setup for the MTEB project using mypy as referenced in issue #2714. The changes focus on establishing a basic type checking configuration and addressing fundamental typing issues across the codebase.

Adds mypy configuration and type stub dependencies for comprehensive type checking
Updates function signatures to include proper type annotations and return types
Refactors descriptive statistics type hierarchy to support better type safety

Reviewed Changes

Copilot reviewed 32 out of 32 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
pyproject.toml	Adds mypy dependency group and configuration with module overrides
mteb/types/statistics.py	Refactors statistics type hierarchy with new base class
mteb/create_dataloaders.py	Updates function signatures and removes **kwargs for explicit parameters
mteb/abstasks/*.py	Updates abstract task classes to use proper type annotations
mteb/_evaluators/*.py	Adds return type annotations to evaluator methods
mteb/models/model_implementations/listconranker.py	Removes optional parameter type
mteb/types/_encoder_io.py	Adds type ignore comment for multiple inheritance

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

pyproject.toml

mteb/create_dataloaders.py

# Conflicts: # mteb/_evaluators/clustering_evaluator.py # mteb/abstasks/AbsTaskAnyClustering.py # mteb/abstasks/AbsTaskMultilabelClassification.py

KennethEnevoldsen · 2025-09-19T10:12:01Z

mteb/_evaluators/classification_evaluator.py

        encode_kwargs: dict[str, Any],
        test_cache: np.ndarray | None = None,
-    ) -> tuple[dict[str, float], Any]:
+    ) -> tuple[dict[str, float], np.ndarray | None]:


a docstring would be good here

KennethEnevoldsen · 2025-09-19T10:15:15Z

pyproject.toml

    {include-group = "lint"},
    {include-group = "test"},
 ]
+typing = [


Let us add a makefile command as well

KennethEnevoldsen · 2025-09-19T10:17:52Z

pyproject.toml

+[tool.mypy]
+plugins = ['pydantic.mypy']
+
+[[tool.mypy.overrides]]


Should we add some comments to these overrides - not quite sure what they do

pyproject.toml

KennethEnevoldsen · 2025-09-19T10:22:13Z

mteb/create_dataloaders.py

    task_metadata: TaskMetadata,
    input_column: str | None = None,
-    **dataloader_kwargs: dict[str, Any],
+    batch_size: int = 32,


not dataloader_kwargs?

I don't think we need them, because only batch_size is helpful for us, I think https://docs.pytorch.org/docs/stable/data.html#torch.utils.data.DataLoader

What about num_workers?

I don't think that we require this, because we don't do something tricky, but I will test this

# Conflicts: # mteb/abstasks/AbsTaskRetrieval.py # mteb/abstasks/AbsTaskTextRegression.py # mteb/abstasks/aggregated_task.py

start type check integration

12aa79b

Samoed requested a review from KennethEnevoldsen September 13, 2025 17:55

Samoed added the v2 Issues and PRs related to `v2` branch label Sep 13, 2025

Samoed added 2 commits September 13, 2025 20:58

Merge branch 'v2.0.0' into add_start_mypy

838b667

# Conflicts: # mteb/abstasks/AbsTask.py # mteb/models/model_implementations/listconranker.py # pyproject.toml

fix imports

e59d2ac

Samoed changed the title ~~[v2] Add start mypy~~ [v2] Add start typecheking Sep 13, 2025

Samoed changed the title ~~[v2] Add start typecheking~~ [v2] Start type checking Sep 13, 2025

Samoed added 2 commits September 14, 2025 16:22

fix validation

70de62e

fix required import

4641c67

Samoed requested a review from Copilot September 14, 2025 19:51

Copilot AI reviewed Sep 14, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

mteb/create_dataloaders.py Show resolved Hide resolved

mteb/create_dataloaders.py Show resolved Hide resolved

mteb/create_dataloaders.py Show resolved Hide resolved

Merge branch 'v2.0.0' into add_start_mypy

014176a

# Conflicts: # mteb/_evaluators/clustering_evaluator.py # mteb/abstasks/AbsTaskAnyClustering.py # mteb/abstasks/AbsTaskMultilabelClassification.py

KennethEnevoldsen approved these changes Sep 19, 2025

View reviewed changes

Samoed added 2 commits September 19, 2025 14:03

Merge branch 'v2.0.0' into add_start_mypy

9abb3f9

# Conflicts: # mteb/abstasks/AbsTaskRetrieval.py # mteb/abstasks/AbsTaskTextRegression.py # mteb/abstasks/aggregated_task.py

fix comments

517eec6

Samoed merged commit 01a86e9 into v2.0.0 Sep 19, 2025
9 checks passed

Samoed deleted the add_start_mypy branch September 19, 2025 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v2] Start type checking #3176

[v2] Start type checking #3176

Uh oh!

Samoed commented Sep 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen Sep 19, 2025

Uh oh!

Samoed Sep 19, 2025

Uh oh!

KennethEnevoldsen Sep 19, 2025

Uh oh!

KennethEnevoldsen Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen Sep 19, 2025

Uh oh!

Samoed Sep 19, 2025 •

edited

Loading

Uh oh!

KennethEnevoldsen Sep 22, 2025

Uh oh!

Samoed Sep 22, 2025

Uh oh!

Uh oh!

Uh oh!

[v2] Start type checking #3176

[v2] Start type checking #3176

Uh oh!

Conversation

Samoed commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Samoed commented Sep 13, 2025 •

edited

Loading

Samoed Sep 19, 2025 •

edited

Loading