Product roadmap

OpenVox roadmap.

What shipped, what is next, and how OpenVox is evolving.

Active milestone

1.6.0

Now shipping

Conversations, multi-speaker scripts, and a more polished voice workflow

Conversations

Script Import

Saved Voice Management

Voice Workflow Polish

Release spotlight

1.6.0 adds Conversations for multi-speaker scripts with up to 4 voices, direct `.txt` and `.pdf` import, saved voice renaming, and broader Voice Design improvements.

  • Introducing Conversations for multi-speaker scripts with up to 4 voices for interviews, skits, dialogue, and character scenes.
  • Import `.txt` and `.pdf` scripts directly into Conversations.
  • Voice Clone now warns when reference audio is longer than the recommended 10 to 20 seconds.
  • Rename existing saved voices in both Voice Clone and Voice Design.
  • Better layout, fuller space usage, and improved control styling across Local API and Voice Cloning.
  • Voice Design templates are now available for all supported languages.
  • Added the missing Italian language in Voice Design.

Direction

From voice app to local voice infrastructure.

Future roadmap

Next up

Planned

Watch Folder Automation

A watch folder that auto-converts any text file or document dropped into it using your preset voice and model, with no UI interaction needed.

Planned
Planned

SRT / Subtitle Export

Native .srt export so generated speech can drop straight into video editing timelines without hours of manual subtitle work in post-production.

Planned

Release history

The path to 1.6.0

Recent releases focused on language expansion, audiobook workflows, model management, and reliability.

1.5.0

Local API
  • Added OpenVox Local API support for AI agents and external tools.
  • Enhanced Voice Templates so gender tags stay in sync with the voice design input.
  • Added PDF import in AI Speech single mode for faster text-to-audio workflows.
  • Improved word replacements under Pre Processing so they apply more reliably.

1.4.1

Stability
  • Fixed reversed typing in Batch mode on older Macs.
  • Fixed microphone permission detection and recording access for voice cloning.

1.4.0

OmniVoice
  • Added OmniVoice support for more natural, expressive, and context-aware speech.
  • Expanded language coverage to 600+ languages with stronger multilingual performance.
  • Added new library voices for 25 more languages using the OmniVoice model.

1.3.0

Audiobooks
  • Added speed control support to more TTS models beyond Kokoro.
  • Improved the AI Audiobook experience across import, chapter handling, and export.
  • Added EPUB import for AI AudioBook.
  • Large ebook imports now show a proper loading animation to keep long imports responsive.
  • Merge and export now include processing feedback for long audiobook exports.
  • Individual chapters can now be deleted after ebook import.
  • Exported chapter filenames are cleaner and easier to sort.
  • Fixed batch paragraph editor text direction behavior for RTL languages on older macOS versions.
  • The Daily Power badge is now hidden when Pro is active.

1.2.3

Batch mode
  • Added support for custom model storage paths in Settings, including moving existing model files to a new location.
  • Batch Mode now supports importing text from .txt, .rtf, and .csv files.
  • Added a downloadable CSV template and faster cleanup with the Clear All button.

1.2.2

Fixes
  • Fixed a launch crash affecting users with M1 and M1 Max Macs.
  • Model downloads now persist across updates and do not need to be downloaded again after updating.
  • Paste now always inserts plain text, avoiding unwanted formatting from browsers and docs.
  • Fixed post-processing failures affecting silence removal and normalization.

1.2.1

Voice changer
  • Fixed an issue where AI Voice Changer could fail for certain audio clip lengths.
  • Improved AI Voice Changer processing for longer audio clips.

1.2

Voice design
  • Added Qwen3 TTS support with high-quality reference voice cloning across 10 languages.
  • Added Voice Design so you can describe a voice in natural language and generate a brand new reusable voice locally on your Mac.
  • Improved output generation times across the app.
  • Added a speech recognition helper for Voice Cloning that can detect the spoken reference script from uploaded samples.
  • Improved Model Manager download and setup flows.
  • Improved model memory management.

1.1

Core fixes
  • Fixed a Voice Conversion issue where MP3 inputs could fail during processing with a format error.
  • Fixed an issue where Manage Models opened the Kokoro model page instead of the main Model Manager page.
  • Fixed Resource Manager status so the currently loaded model is shown correctly instead of displaying "no model loaded" when a model is active.