WMS ID 9001: Fix typos & markdown errors in OCI Speech customizations.md (#418)

dhanwin-rao-oracle · web-flow · commit 2f346b96a9a2 · 2025-08-20T18:19:25.000+03:00
Fix typos in OCI Speech customizations.md
diff --git a/oci-artificial-intelligence/ai-speech/customizations/customizations.md b/oci-artificial-intelligence/ai-speech/customizations/customizations.md
@@ -1,11 +1,11 @@
-# Lab 3: Create and manage customizations using OCI Console
+# Lab 3: Create, enable and manage customizations using OCI Console
 
 ## Introduction
-When the Live Transcribe service as is, it may not provide perfect transcriptions for domain-specific words, acronyms and proper nouns. 
+When the Live Transcribe service is used as is, it may not provide perfect transcriptions for domain-specific words, acronyms and proper nouns. 
 Speech Customizations can be enabled when using the Live Transcribe service to improve the transcription accuracy in such cases.
 In this session, we will help users get familiar with customizations and how to create and manage them using the OCI Console. 
 
-***Estimated Lab Time***: 5 minutes
+***Estimated Lab Time***: 30 minutes
 
 ### Objectives
 
@@ -20,12 +20,12 @@ In this lab, you will:
 
 ## Task 1: Navigate to Speech Overview Page
 
-Log into OCI Cloud Console. Using the Burger Menu on the top left corner, navigate to Analytics and AI menu and click it, and then select Language item under AI services.
+Log into OCI Cloud Console. Using the Burger Menu in the top left corner, navigate to Analytics and AI menu and click it, and then select Speech item under AI services.
     ![Navigate speech service menu](./images/navigate-to-ai-speech-menu.png " ")
 
 This will navigate you to the Speech overview page.
-On the left you can toggle between overview and transcription jobs listing page.
-Under documentation, you can find helpful links relevant to OCI speech service
+On the left, you can access various features of the OCI Speech service i.e. Transcription Jobs, Live Transcribe, Customizations, and Text to Speech.
+Under documentation, you can find helpful links relevant to OCI speech service.
     ![Speech service overview page](./images/overview-page.png " ")
 
 
@@ -93,6 +93,7 @@ We see that the live transcribe produces the right output only once. In order to
    
       Let's create an ObjectStorageDataset from the console. For EBITDA, let's create a JSON file like so
        ```
+        <copy>
        {
          "datasetType": "ENTITY_LIST",
          "entityList": [
@@ -117,6 +118,7 @@ We see that the live transcribe produces the right output only once. In order to
            }
          ]
        }
+     </copy>
       ```
      
 2. Upload this JSON file to an Object Storage bucket
@@ -132,6 +134,7 @@ We see that the live transcribe produces the right output only once. In order to
 
 5. If you want to provide an audio file as the pronunciation, first upload the audio file to object storage. You can provide multiple audio files for the same entity. The JSON file can look like:
     ```
+    <copy>
     {
         "datasetType": "ENTITY_LIST",
         "entityList": [
@@ -167,13 +170,15 @@ We see that the live transcribe produces the right output only once. In order to
             }
         ]
     }
+    </copy>
     ```
 
 
 ## Task 4: Create and enable a customization with custom pronunciations and reference examples
 Sometimes, even when enabling a customization with custom pronunciations, Live Transcribe may still provide some mis-transcription. For example, let's say you have an organization that's abbreviated as "DSCRD" and is pronounced as "discord".  Even if you create and enable a customization for these entities, the transcription might still not have the right entity every time.
 For example, let's say you created a customization for DSCRD using the below dataset:
 ```
+<copy>
 {
   "datasetType": "ENTITY_LIST",
   "entityList": [
@@ -192,6 +197,7 @@ For example, let's say you created a customization for DSCRD using the below dat
     }
   ]
 }
+</copy>
 ```
 When enabling this customization with the Live Transcribe service, there might still be instances where the transcription does not have DSCRD. For example, in the screenshot below the transcript says "What is the return on equity for discard".
 ![RT without DSCRD customization](./images/DSCRD-without-ref-examples.png " ")
@@ -202,13 +208,14 @@ Here is where Reference Examples come in.
 
     You can provide simple examples of sentences where you expect to see entities of a given type. For example, since DSCRD has an entity type of "organizations" here are some sample reference examples for the "organizations" entity type:
 
-   * organization called \<organizations\>
-   * return on equity for \<organizations>
-   * work for \<organizations\>
-   * statement for \<organizations\>
+   * organization called `<organizations>`
+   * return on equity for `<organizations>`
+   * work for `<organizations>`
+   * statement for `<organizations>`
     
     You can add these examples to the JSON file from before and upload the json file to object storage.
     ```
+   <copy>
    {
         "datasetType": "ENTITY_LIST",
         "entityList": [
@@ -233,12 +240,13 @@ Here is where Reference Examples come in.
             "statement for <organizations>"
         ]
    }
+   </copy>
    ```
    
 2. Use this dataset when creating the customization. The Customizations service will provide two customizations, one for the reference examples (we call this the Main Customization) and one for the entity list that you provide (we call this the Slot Customization). The Slot Customization i.e. the customization created for the entity list will have "--<entity-type>" in the display name.
     ![RT without DSCRD customization](./images/DSCRD-main-and-slot-customizations.png " ")
 
-    If you open the details of the Main Customization, which in this case has the display name "dsrcd-customization", you will see that it has an entities section that shows which Slot Customization it refers to, for the "organizations" entity type (it is also possible to override this Slot Customization with another one when calling Live Transcribe).
+    If you open the details of the Main Customization, which in this case has the display name "dsrcd-customization", you will see that it has an entities section that shows which Slot Customization it refers to, for the "organizations" entity type.
     ![DSCRD main customization details](./images/DSCRD-main-customization-details.png " ")
     
     If you open the details of the Slot Customization, which in this case has the display name "dsrcd-customization--organizations", you will see that it does not have an entities section.
@@ -247,8 +255,10 @@ Here is where Reference Examples come in.
 3. It is important to enable the Main Customization when making the Live Transcribe call. Think of the main customization as encapsulating both the reference examples and the entity lists. Enabling the Slot Customization means that Live Transcribe will use just the entity list and not the reference examples. The Live Transcribe output with the main customization enabled looks like
    ![RT with DSCRD main customization enabled](./images/DSCRD-with-reference-examples.png " ")
 
+    We see that "DSCRD" appears as expected for the first 5 sentences. The goal of reference examples is to add more context for the custom entities that you have defined.
+    Note that the transcript may not have the custom entity for utterances that are NOT included in any of the reference examples. For example, in the above screenshot, the last line "The discord app is amazing" could very well have been "The DSCRD app is amazing". If you want to increase the chances of the model transcribing it as "The DSCRD app is amazing", you can add the reference example "`<organizations>` app" into the dataset.
    
-## Task 5: Create and enable a customization with multiple entity lists, soundsLike and reference examples
+## Task 5: Create and enable a customization with multiple entity lists, custom pronunciations and reference examples
 One of the best use-cases for Live Transcribe is in the healthcare domain for doctor-patient conversations. Let's say you are a hospital using OCI Live Transcribe and you have the following requirements.
 
 - You have 2 patients with the names - Daniel and Sorabh, and you want Live Transcribe to accurately transcribe both these names. As a bonus requirement, let's say that if the doctor utters "Dan" or "Danny", the Live Transcribe should still transcribe that as Daniel 
@@ -259,91 +269,96 @@ When used as is, the Live Transcribe produces an output like so:
 ![RT without hospital customization](./images/RT-without-hospital-customization.png " ")
 
 Let's use Speech Customizations to make this transcription better.
-1. Creating and enabling the customization
+1. Creating a customization using the below dataset. This dataset has two entity lists - one for the "names" entity type and one for the "medical" entity type. 
+    Reference examples have been added for both the entity types. Note that you can have multiple entity types in a single reference example.
+    For example, you can have a reference example like "hi `<names>`, have you been taking `<medical>`".
+
    ```
+   <copy>
     {
-      "datasetType": "ENTITY_LIST",
-      "entityList": [
-        {
-          "entityType": "names",
-          "entities": [
-            {
-              "entityValue": "Daniel",
-              "pronunciations": [
-                {
-                  "soundsLike": "Danny"
-                },
-                {
-                  "soundsLike": "Dan"
-                }
-              ]
-            },
-            {
-              "entityValue": "Sorabh",
-              "pronunciations": [
-                {
-                  "soundsLike": "Saurabh"
-                },
-                {
-                  "soundsLike": "so rub"
-                },
-                {
-                  "soundsLike": "so raab"
-                }
-              ]
-            }
-          ]
-        },
-        {
-          "entityType": "medical",
-          "entities": [
-            {
-              "entityValue": "PeriCare",
-              "pronunciations": [
-                {
-                  "soundsLike": "Perry care"
-                },
-                {
-                  "soundsLike": "Paris care"
-                }
-              ]
-            },
+        "datasetType": "ENTITY_LIST",
+        "entityList": [
             {
-              "entityValue": "procapil",
-              "pronunciations": [
-                {
-                  "soundsLike": "pro sepil"
-                },
-                {
-                  "soundsLike": "pro capill"
-                }
-              ]
+                "entityType": "names",
+                "entities": [
+                    {
+                        "entityValue": "Daniel",
+                        "pronunciations": [
+                            {
+                                "soundsLike": "Danny"
+                            },
+                            {
+                                "soundsLike": "Dan"
+                            }
+                        ]
+                    },
+                    {
+                        "entityValue": "Sorabh",
+                        "pronunciations": [
+                            {
+                                "soundsLike": "Saurabh"
+                            },
+                            {
+                                "soundsLike": "so rub"
+                            },
+                            {
+                                "soundsLike": "so raab"
+                            }
+                        ]
+                    }
+                ]
             },
             {
-              "entityValue": "EpiCeram",
-              "pronunciations": [
-                {
-                  "soundsLike": "epic serum"
-                },
-                {
-                  "soundsLike": "a PC rum"
-                }
-              ]
+                "entityType": "medical",
+                "entities": [
+                    {
+                        "entityValue": "PeriCare",
+                        "pronunciations": [
+                            {
+                                "soundsLike": "Perry care"
+                            },
+                            {
+                                "soundsLike": "Paris care"
+                            }
+                        ]
+                    },
+                    {
+                        "entityValue": "procapil",
+                        "pronunciations": [
+                            {
+                                "soundsLike": "pro sepil"
+                            },
+                            {
+                                "soundsLike": "pro capill"
+                            }
+                        ]
+                    },
+                    {
+                        "entityValue": "EpiCeram",
+                        "pronunciations": [
+                            {
+                                "soundsLike": "epic serum"
+                            },
+                            {
+                                "soundsLike": "a PC rum"
+                            }
+                        ]
+                    }
+                ]
             }
-          ]
-        }
-      ],
-      "referenceExamples": [
-        "hi <names>",
-        "hello <names>",
-        "good morning <names>",
-        "good afternoon <names>",
-        "tell me <names>",
-        "prescribe <medical>",
-        "take <medical>",
-        "apply <medical>"
-      ]
+        ],
+        "referenceExamples": [
+                "hi <names>",
+                "hello <names>",
+                "good morning <names>",
+                "good afternoon <names>",
+                "tell me <names>",
+                "prescribe <medical>",
+                "take <medical>",
+                "apply <medical>"
+        ]
     }
+   </copy>
    ```
 2. This dataset would create 3 customizations - one main customization and two slot customizations.
     ![hospital customizations](./images/hostpital-main-and-slot-customizations.png " ")