Add support for OpenAI text-to-speech `instructions` parameter #1011

faisal-alvi · 2025-09-18T14:24:41Z

Description of the Change

This PR adds support for OpenAI's instructions parameter in the text-to-speech feature, allowing users to control voice characteristics using natural language instructions.

Changes made:

Added a new "Voice instructions" textarea field to the OpenAI Text-to-Speech settings
Updated default settings to include the instructions parameter
Modified the API request to include instructions when provided
Added get_instructions() method with filter support
Added two new filters: classifai_openai_text_to_speech_instructions and classifai_openai_text_to_speech_request_body
Users can now control voice characteristics with natural language (e.g., "Speak in a calm, professional tone")
Provides more flexibility in audio generation
Maintains backward compatibility (instructions are optional)

Approaches considered:

Initially considered making this a universal setting across all TTS providers, but research showed that each provider uses different approaches (OpenAI uses natural language, Amazon Polly uses SSML, Azure uses SSML tags, ElevenLabs uses technical parameters)
Decided to keep this OpenAI-specific to maintain clean separation of concerns

Closes #998, Closes #997

How to test the Change

Build the frontend assets:
```
npm run build
```
Access the settings:
- Go to WordPress Admin > Tools > ClassifAI > Language Processing > Text to Speech > Settings
- Select "OpenAI Text to Speech" as provider
- Verify the new "Voice instructions" field appears below "Audio speed"
Test the functionality:
- Enter test instructions like "Say Welcome before starting", "Speak in a calm, professional tone" or "Use a more energetic delivery"
- Save settings
- Create/edit a post and generate text-to-speech audio
- Verify the audio reflects the voice characteristics specified in instructions

Test filters:

add_filter( 'classifai_openai_text_to_speech_instructions', function( $instructions ) {
    return $instructions . ' Speak with enthusiasm.';
});

Changelog Entry

Added - Support for OpenAI text-to-speech instructions parameter to control voice characteristics

Credits

Props @swissky @dkotter @faisalalvi

Checklist:

I agree to follow this project's Code of Conduct.
I have updated the documentation accordingly.
I have added Critical Flows, Test Cases, and/or End-to-End Tests to cover my change.
All new and existing tests pass.

…AI Text-to-Speech settings

/home/runner/work/classifai/classifai/src/js/settings/components/provider-settings/openai-text-to-speech.js Error: 209:29 error Replace `·onChange(·{·instructions:·value·}·)·` with `⏎↹↹↹↹↹↹onChange(·{·instructions:·value·}·)⏎↹↹↹↹↹` prettier/prettier Error: 212:20 error Use ellipsis character (…) in place of three dots @wordpress/i18n-ellipsis

faisal-alvi added 2 commits September 17, 2025 23:11

Add voice instructions feature to OpenAI Text-to-Speech settings

bd81aae

Add instructions field and rendering method for voice control in Open…

b2fb9be

…AI Text-to-Speech settings

faisal-alvi self-assigned this Sep 18, 2025

github-actions bot added this to the Future Release milestone Sep 18, 2025

faisal-alvi added 2 commits September 18, 2025 22:57

update version in doc comment

ea07ea2

faisal-alvi marked this pull request as ready for review September 19, 2025 14:30

faisal-alvi requested review from dkotter, jeffpaul and a team as code owners September 19, 2025 14:30

faisal-alvi removed request for a team and jeffpaul September 19, 2025 14:30

github-actions bot added the needs:code-review This requires code review. label Sep 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for OpenAI text-to-speech `instructions` parameter #1011

Add support for OpenAI text-to-speech `instructions` parameter #1011

Uh oh!

faisal-alvi commented Sep 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add support for OpenAI text-to-speech instructions parameter #1011

Are you sure you want to change the base?

Add support for OpenAI text-to-speech instructions parameter #1011

Uh oh!

Conversation

faisal-alvi commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the Change

How to test the Change

Changelog Entry

Credits

Checklist:

Uh oh!

Uh oh!

Add support for OpenAI text-to-speech `instructions` parameter #1011

Add support for OpenAI text-to-speech `instructions` parameter #1011

faisal-alvi commented Sep 18, 2025 •

edited

Loading