Update training overview docs based on the blogpost reviews

tomaarsen · tomaarsen · commit 85890d5713b2 · 2024-05-28T12:10:32.000+02:00
diff --git a/docs/sentence_transformer/training_overview.md b/docs/sentence_transformer/training_overview.md
@@ -128,14 +128,14 @@ The :class:`SentenceTransformerTrainer` trains and evaluates using :class:`datas
 
         from datasets import Dataset
 
-        sentence1_list = []
-        sentence2_list = []
+        anchors = []
+        positives = []
         # Open a file, do preprocessing, filtering, cleaning, etc.
         # and append to the lists
 
         dataset = Dataset.from_dict({
-            "sentence1": sentence1_list,
-            "sentence2": sentence2_list,
+            "anchor": anchors,
+            "positive": positives,
         })
 
     Each key from the dictionary will become a column in the resulting dataset.
@@ -276,9 +276,10 @@ args = SentenceTransformerTrainingArguments(
 
 ## Evaluator
 
-```eval_rst
-Several evaluators exist that can help with evaluation before, during, and after training:
+You can provide the [`SentenceTransformerTrainer`](https://sbert.net/docs/package_reference/sentence_transformer/SentenceTransformer.html#sentence_transformers.SentenceTransformer) with an `eval_dataset` to get the evaluation loss during training, but it may be useful to get more concrete metrics during training, too. For this, you can use evaluators to assess the model's performance with useful metrics before, during, or after training. You can both an `eval_dataset` and an evaluator, one or the other, or neither. They evaluate based on the `eval_strategy` and `eval_steps` [Training Arguments](#training-arguments).
 
+Here are the implemented Evaluators that come with Sentence Tranformers:
+```eval_rst
 ========================================================================  ===========================================================================================================================
 Evaluator                                                                 Required Data
 ========================================================================  ===========================================================================================================================
@@ -292,7 +293,7 @@ Evaluator                                                                 Requir
 :class:`~sentence_transformers.evaluation.TripletEvaluator`               (anchor, positive, negative) pairs.
 ========================================================================  ===========================================================================================================================
 
-Additionally, :class:`~sentence_transformers.evaluation.SequentialEvaluator` should be used to combine multiple evaluators into one Evaluator that can be passed to the :class:`~sentence_transformers.trainer.SentenceTransformerTrainer`. When the evaluator is run depends on the ``eval_strategy`` and ``eval_steps`` `Training Arguments <#training-arguments>`_.
+Additionally, :class:`~sentence_transformers.evaluation.SequentialEvaluator` should be used to combine multiple evaluators into one Evaluator that can be passed to the :class:`~sentence_transformers.trainer.SentenceTransformerTrainer`.
 
 Sometimes you don't have the required evaluation data to prepare one of these evaluators on your own, but you still want to track how well the model performs on some common benchmarks. In that case, you can use these evaluators with data from Hugging Face.
 
diff --git a/sentence_transformers/losses/CachedMultipleNegativesRankingLoss.py b/sentence_transformers/losses/CachedMultipleNegativesRankingLoss.py
@@ -95,7 +95,7 @@ def __init__(
                 the slower the training will be. It's recommended to set it as high as your GPU memory allows. The default
                 value is 32.
             show_progress_bar: If True, a progress bar for the mini-batches is shown during training. The default is False.
-                
+
         References:
             - Efficient Natural Language Response Suggestion for Smart Reply, Section 4.4: https://arxiv.org/pdf/1705.00652.pdf
             - Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup: https://arxiv.org/pdf/2101.06983.pdf