NEURAL VOICES
Audio Synthesis & Cloning
Overview of Text-to-Speech (TTS), Voice Changer, Noise Reduction cleaner, and Custom Voice Cloning settings.
Text-to-Speech (TTS)
Enter your text and synthesis variables (Stability and Similarity controls). The model outputs highly realistic voice streams.
- Stability (0.0 - 1.0): Adjusts prosody variation. Lower values make vocal delivery more dynamic.
- Similarity Boost (0.0 - 1.0): Determines how closely the clone parameters adhere to the original training asset.
Custom Voice Cloning
Clone specific vocal profiles by uploading 1-2 minutes of clean audio datasets. Ensure minimal background echoes.
Language Restrictions
Custom voices are optimized for exactly 13 languages (EN, ZH, JA, DE, FR, ES, KO, AR, RU, NL, IT, PL, PT). Attempting to use cloned voices on unsupported languages may result in degradation or gibberish.
Changer & Cleaner Tools
Transform source audio tracks into target voices while preserving pitch and timing, or eliminate ambient sound artifacts using neural cleaning algorithms.