Skip to content

Add language hint for Omnilingual ASR CTC on Android & iOS #2812

@anita-smith1

Description

@anita-smith1

Many thanks for adding Omnilingual ASR CTC support to sherpa-onnx.

I’m using the sherpa-onnx-omnilingual-asr-1600-languages-300M-ctc-int8-2025-11-12 model on Android (type 44 in getOfflineModelConfig). The Android bindings expose:

data class OfflineOmnilingualAsrCtcModelConfig(
    var model: String = "",
)

In many mobile use cases, however, the target language is already known (e.g. “always English” or a fixed eng_Latn / cmn_Hans etc.). It would be very useful to be able to pass a language hint from Android to bias or restrict decoding to that language.

Feature request

Add an optional language field to the Android config, for example:

data class OfflineOmnilingualAsrCtcModelConfig(
    var model: String = "",
    var language: String = "", // optional language hint, e.g. "eng_Latn"
)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions