Ingest Client - Multichannel support added #2186

lenisha · 2023-12-19T06:16:44Z

Purpose

Add MultiChannel Support for audio files in batch transcription

Pull Request Type

What kind of change does this Pull Request introduce?
Support to set MultiChannel transcription feature

[ ] Bugfix
[X] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

#2185

zhouwangzw · 2023-12-19T08:03:14Z

samples/ingestion/ingestion-client/Setup/ArmTemplateBatch.json

            }
        },
+        "Channels": {
+            "defaultValue": "1",


the default value of transcription is [0,1]. i.e. If no channels is specified, both channel 0 and channel 1 of stereo will be transcribed.

It's not what happens if we do not pass Channels in the Transcription request properties , it results with Invalid Data error .Tested on multichannel audios.

"channels": [ 0, 1 ],

zhouwangzw · 2023-12-19T08:03:50Z

samples/ingestion/ingestion-client/Setup/ArmTemplateBatch.json

+        "TextAnalyticsEndpoint": {
+            "defaultValue": "",
            "type": "String",
-            "allowedValues": [


So we support TextAnalytics in any region now?

Please refer to latest release of ARM in this repo yes Endpoint is supported specifically to support Private Endpoints that are not regional. Not sure why latest Tag is not merged in master

https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/ingestion-v2.0.11/samples/ingestion/ingestion-client/Setup/ArmTemplateBatch.json

zhouwangzw · 2023-12-19T08:06:20Z

...gestion/ingestion-client/StartTranscriptionByTimer/StartTranscriptionEnvironmentVariables.cs


        public static readonly string StartTranscriptionServiceBusConnectionString = Environment.GetEnvironmentVariable(nameof(StartTranscriptionServiceBusConnectionString), EnvironmentVariableTarget.Process);
+
+        public static readonly int[] Channels = int.TryParse(Environment.GetEnvironmentVariable(nameof(Channels), EnvironmentVariableTarget.Process), out int result) && result == 1 ? Constants.Channels : new int[] { result };


Reading channels from environment variables looks weird. Do we ask customers to change the environment variable value for each audio to be transcribed? How about if multiple audios are submitted but with different channel settings?

it's global setting for Azure Function applicable to all audios

zhouwangzw · 2023-12-19T08:06:37Z

...s/ingestion/ingestion-client/Connector/Serializable/Transcription/TranscriptionDefinition.cs

            string locale,
            IEnumerable<string> contentUrls,
-            Dictionary<string, string> properties,
+            Dictionary<string, object> properties,


This looks weird. Why do we need to change this to "object"?

Properties for transcription request not always just string , for example

"properties": { "diarizationEnabled": false, "wordLevelTimestampsEnabled": true, "channels": [ 0, 1 ], "punctuationMode": "DictatedAndAutomatic", "profanityFilterMode": "Masked", "languageIdentification": { "candidateLocales": [ "en-US", "de-DE", "es-ES" ] } },

https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-create?pivots=rest-api#create-a-transcription-job

zhouwangzw · 2023-12-19T09:16:36Z

samples/ingestion/ingestion-client/Connector/Constants.cs


        public const string SummarizationSupportedLocalePrefix = "en";
+
+        public static readonly int[] Channels = new int[] { 0, 1 };


Not sure that I understand the purpose of this PR. Multichannel support should be enabled by default in batch transcription. Only if a customer wants to transcribe a specific channel, he needs to specify the channel number in request property. For example, he only wants to transcribe channel 1, but not channel 0, of a stereo audio.

Please test the default settings - it always returns "Invalid Data" for multi Channel audios, it does not do that by default.

lenisha added 3 commits December 18, 2023 23:52

add channels setting for non multi-channel audio

457e223

add template update

e357d95

format template

f24542c

zhouwangzw reviewed Dec 19, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ingest Client - Multichannel support added #2186

Ingest Client - Multichannel support added #2186

Uh oh!

lenisha commented Dec 19, 2023 •

edited

Loading

Uh oh!

zhouwangzw Dec 19, 2023

Uh oh!

lenisha Dec 20, 2023

Uh oh!

zhouwangzw Dec 19, 2023

Uh oh!

lenisha Dec 20, 2023

Uh oh!

zhouwangzw Dec 19, 2023

Uh oh!

lenisha Dec 20, 2023

Uh oh!

zhouwangzw Dec 19, 2023

Uh oh!

lenisha Dec 20, 2023

Uh oh!

zhouwangzw Dec 19, 2023

Uh oh!

lenisha Dec 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		public static readonly string StartTranscriptionServiceBusConnectionString = Environment.GetEnvironmentVariable(nameof(StartTranscriptionServiceBusConnectionString), EnvironmentVariableTarget.Process);

		public static readonly int[] Channels = int.TryParse(Environment.GetEnvironmentVariable(nameof(Channels), EnvironmentVariableTarget.Process), out int result) && result == 1 ? Constants.Channels : new int[] { result };


		public const string SummarizationSupportedLocalePrefix = "en";

		public static readonly int[] Channels = new int[] { 0, 1 };

Ingest Client - Multichannel support added #2186

Are you sure you want to change the base?

Ingest Client - Multichannel support added #2186

Uh oh!

Conversation

lenisha commented Dec 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Pull Request Type

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lenisha commented Dec 19, 2023 •

edited

Loading