Azure AI Content Understanding Samples (.NET 8 Console App)

Welcome! Content Understanding is a solution that analyzes and comprehends various media content, such as documents, images, audio, and video, transforming it into structured, organized, and searchable data.

The samples in this repository use the latest preview API version by default: 2025-05-01-preview.
This repo will provide more samples for new functionalities in Preview.2 2025-05-01-preview soon.
As of 2025/05, the 2025-05-01-preview API is only available in the regions documented in Content Understanding region and language support.

👉 If you are looking for Python samples, check out this repo.

Quick Start

You can run this sample in GitHub Codespaces or on your local machine.

Option 1: GitHub Codespaces

Run this repo virtually in a cloud-based development environment.

Steps:

Click the button above to create a new Codespace.
Select the main branch, your preferred region, and a 2-core machine.
When the Codespace is ready, VS Code will automatically build the Dev Container.
Follow the instructions in Configure Azure AI service resource.
Use the integrated terminal to run the project:

cd {ProjectName}
dotnet build
cd bin/Debug/net8.0/
dotnet {ProjectName}.dll

For an overview of the available projects in {ProjectName}, refer to the Features section. We recommend starting with ContentExtraction as your first project.

Option 2: Run Locally

To run the project locally, select one of the setup options below.
For the smoothest experience, we recommend Option 2.1, which provides a hassle-free environment setup.

Option 2.1: Windows / macOS / Linux (using VS Code + Dev Container)

Prerequisites

Docker
Install Docker Desktop (available for Windows, macOS, and Linux).
Docker is used to manage and run the container environment.
- Start Docker and ensure it is running in the background.
Visual Studio Code
Download and install Visual Studio Code.
Dev Containers Extension
In the VS Code extension marketplace, install the extension named Dev Containers.
(This extension was previously called "Remote - Containers" but has since been renamed and integrated into Dev Containers.)

Steps

Clone the repo:

git clone https://github.com/Azure-Samples/azure-ai-content-understanding-dotnet.git
cd azure-ai-content-understanding-dotnet

Launch VS Code and open the folder:

code .

Press F1, then select Dev Containers: Reopen in Container.
Wait for the setup to complete. Follow the instructions in Configure Azure AI service resource, then use the integrated terminal in Visual Studio Code to run:

cd {ProjectName}
dotnet build
cd bin/Debug/net8.0/
dotnet {ProjectName}.dll

For an overview of the available projects in {ProjectName}, refer to the Features section. We recommend starting with ContentExtraction as your first project.

Option 2.2: Windows (using Visual Studio 2022 or later)

Prerequisites

Clone the repo using Git or from Visual Studio:

git clone https://github.com/Azure-Samples/azure-ai-content-understanding-dotnet

Open the .sln solution file in Visual Studio 2022+.
Ensure the target framework is set to .NET 8.
Follow the instructions in Configure Azure AI service resource.
We recommend setting ContentExtraction as your Startup Project. For an overview of the available projects, refer to the Features section.
Press F5 or click Start to run the console app.

Configure Azure AI Service Resource

(Option 1) Use `azd` commands to automatically create temporary resources to run the sample

Ensure you have permission to grant roles under your subscription.
Login to Azure:

azd auth login

If this does not work, try:

azd auth login --use-device-code

and follow the on-screen instructions.

Set up the environment, following prompts to choose the location:

azd up

(Option 2) Manually create resources and set environment variables

Create an Azure AI Services resource.
In the Azure portal, navigate to Access Control (IAM) in your resource, and grant yourself the role Cognitive Services User.
- This step is required even if you are the owner of the resource.
Fill the Endpoint field in appsettings.json with the endpoint from your Azure AI Services instance in the Azure portal.
The SubscriptionKey field in appsettings.json is optional. It is only needed if you choose not to authenticate using azd auth login or az login, and instead prefer key-based authentication.
If using the recommended identity-based authentication, ensure you are signed in by running:

azd auth login

This ensures your Azure identity is properly authenticated for accessing secured resources.

Features

Azure AI Content Understanding is a new Generative AI-based Azure AI service, designed to process and ingest content of any type (documents, images, audio, and video) into a user-defined output format. Content Understanding offers a streamlined process to reason over large amounts of unstructured data, accelerating time-to-value by generating output that can be integrated into automation and analytical workflows.

Project	Key Source File	Description
ContentExtraction	ContentExtractionService.cs	In this sample we will show content understanding API can help you get semantic information from your file. For example OCR with table in document, audio transcription, and face analysis in video.
FieldExtraction	FieldExtractionService.cs	In this sample we will show how to create an analyzer to extract fields in your file. For example invoice amount in the document, how many people in an image, names mentioned in an audio, or summary of a video. You can customize the fields by creating your own analyzer template.
FieldExtractionProMode	FieldExtractionProModeSerivce.cs	In this sample we will demonstrate how to use Pro mode in Azure AI Content Understanding to enhance your analyzer with multiple inputs and optional reference data. Pro mode is designed for advanced use cases, particularly those requiring multi-step reasoning, and complex decision-making (for instance, identifying inconsistencies, drawing inferences, and making sophisticated decisions).
Classifier	ClassifierService.cs	This sample will demo how to (1) create a classifier to categorize documents, (2) create a custom analyzer to extract specific fields, and (3) combine classifier and analyzers to classify, optionally split, and analyze documents in a flexible processing pipeline.
ConversationalFieldExtraction	ConversationalFieldExtraction.cs	Shows how to efficiently evaluate conversational audio data previously transcribed with Content Understanding or Azure AI Speech. Enables re-analysis of data cost-effectively. Based on the FieldExtraction sample.
AnalyzerTraining	AnalyzerTrainingService.cs	If you want to futher boost the performance for field extraction, we can do training when you provide few labeled samples to the API. Note: This feature is available to document scenario now.
Management	ManagementService.cs	This sample will demo how to create a minimal analyzer, list all the analyzers in your resource, and delete the analyzer you don't need.
BuildPersonDirectory	BuildPersonDirectoryService.cs	This sample will demo how to enroll people’s faces from images and build a Person Directory.

Sample Console Output

Here is an example of the console output from the ContentExtraction project.

$ dotnet ContentExtraction.dll
Please enter a number to run sample: 
[1] - Extract Document Content
[2] - Extract Audio Content
[3] - Extract Video Content
[4] - Extract Video Content With Face 
1
Document Content Extraction Sample is running...
Use prebuilt-documentAnalyzer to extract document content from the file: ./data/invoice.pdf

===== Document Extraction has been saved to the following output file path =====

./outputs/content_extraction/AnalyzeDocumentAsync_20250714034618.json

===== The markdown output contains layout information, which is very useful for Retrieval-Augmented Generation (RAG) scenarios. You can paste the markdown into a viewer such as Visual Studio Code and preview the layout structure. =====
CONTOSO LTD.


# INVOICE

Contoso Headquarters
123 456th St
New York, NY, 10001

INVOICE: INV-100

INVOICE DATE: 11/15/2019

DUE DATE: 12/15/2019

CUSTOMER NAME: MICROSOFT CORPORATION

SERVICE PERIOD: 10/14/2019 - 11/14/2019

CUSTOMER ID: CID-12345

<<< Truncated for brevity >>>

Additional Resources

Notes

Trademarks - This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos is subject to those third-party’s policies.
Data Collection - The software may collect information about you and your use of the software and send it to Microsoft. Microsoft may use this information to provide services and improve our products and services. You may turn off the telemetry as described in the repository. There are also some features in the software that may enable you and Microsoft to collect data from users of your applications. If you use these features, you must comply with applicable law, including providing appropriate notices to users of your applications together with a copy of Microsoft’s privacy statement. Our privacy statement is located at https://go.microsoft.com/fwlink/?LinkID=824704. You can learn more about data collection and use in the help documentation and our privacy statement. Your use of the software operates as your consent to these practices.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
AnalyzerTraining		AnalyzerTraining
AzureAiContentUnderstanding.Tests		AzureAiContentUnderstanding.Tests
BuildPersonDirectory		BuildPersonDirectory
Classifier		Classifier
ContentExtraction		ContentExtraction
ContentUnderstanding.Common		ContentUnderstanding.Common
ConversationalFieldExtraction		ConversationalFieldExtraction
FieldExtraction		FieldExtraction
FieldExtractionProMode		FieldExtractionProMode
Management		Management
docs		docs
infra		infra
.gitignore		.gitignore
AzureAiContentUnderstandingDotNet.sln		AzureAiContentUnderstandingDotNet.sln
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
azure.yaml		azure.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Azure AI Content Understanding Samples (.NET 8 Console App)

Quick Start

Option 1: GitHub Codespaces

Steps:

Option 2: Run Locally

Option 2.1: Windows / macOS / Linux (using VS Code + Dev Container)

Prerequisites

Steps

Option 2.2: Windows (using Visual Studio 2022 or later)

Prerequisites

Configure Azure AI Service Resource

(Option 1) Use `azd` commands to automatically create temporary resources to run the sample

(Option 2) Manually create resources and set environment variables

Features

Sample Console Output

More Samples Using Azure Content Understanding

Additional Resources

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

Azure-Samples/azure-ai-content-understanding-dotnet

Folders and files

Latest commit

History

Repository files navigation

Azure AI Content Understanding Samples (.NET 8 Console App)

Quick Start

Option 1: GitHub Codespaces

Steps:

Option 2: Run Locally

Option 2.1: Windows / macOS / Linux (using VS Code + Dev Container)

Prerequisites

Steps

Option 2.2: Windows (using Visual Studio 2022 or later)

Prerequisites

Configure Azure AI Service Resource

(Option 1) Use azd commands to automatically create temporary resources to run the sample

(Option 2) Manually create resources and set environment variables

Features

Sample Console Output

More Samples Using Azure Content Understanding

Additional Resources

Notes

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

(Option 1) Use `azd` commands to automatically create temporary resources to run the sample

Packages