Voice cloning

Voice cloning recreates the speakers from the original source so the dub stays closer to the original identity of the video.

How it works in VoiceCheap

When voice cloning is enabled, VoiceCheap isolates the speaker voice first, then generates translated speech that tries to preserve the character of the original performance. The quality depends heavily on:

source voice cleanliness
the selected voice isolation method
whether the source contains noise, music, or overlapping voices

Main controls

Stability

Stability controls how steady the generated voice stays from one generation to the next.

lower values allow a wider emotional range
very low values can sound rushed or inconsistent
very high values can sound flat or monotone

Similarity

Similarity controls how closely the generated voice follows the original voice print.

higher values can stay closer to the original speaker
if the source audio is noisy, very high similarity can also pull in unwanted artifacts

Speaker boost

Speaker boost pushes the model a bit harder toward the original speaker identity.

it is usually subtle
it increases compute and latency
it can be useful when you want the cloned result to stay closer to the source

Preview and regenerate

VoiceCheap lets you preview voice examples inside the customization flow. This is the best way to validate cloning quality before you commit to a full translated run. Good workflow:

choose Studio or Realistic
preview the speaker samples
adjust stability, similarity, or boost
regenerate previews if needed
launch the translated version once the preview sounds right

When to avoid cloning

Voice library or custom voices can be better when:

the original audio is noisy
multiple speakers overlap often
the source has very little clean speech
you want one consistent narrator voice across many projects

Known accent limitations (currently being improved)

For some languages, cloned output can be less stable in accent quality. This is more common for:

Vietnamese
Thai
Tagalog (Filipino)
some lower-resource or rarer language pairs

In these cases, pronunciation and accent may not sound fully native yet. If accent quality is critical, prefer a voice from the Voice Library for now while we continue improving this behavior. Another known behavior: if you clone an English speaker, the translated output can still carry English-accent traits. For example, French output can sometimes sound closer to a Canadian-style accent. To reduce this risk, it is usually better to choose a Voice Library voice in the target language. We are actively working on this.

Overview

Dubbing workflow

Voice

Lip sync

Tools

Publishing and delivery

Customization and settings

AI

Voice cloning

Voice cloning

How it works in VoiceCheap

Main controls

Stability

Similarity

Speaker boost

Preview and regenerate

When to avoid cloning

Known accent limitations (currently being improved)

​Voice cloning

​How it works in VoiceCheap

​Main controls

​Stability

​Similarity

​Speaker boost

​Preview and regenerate

​When to avoid cloning

​Known accent limitations (currently being improved)

​Related pages

Voice cloning

How it works in VoiceCheap

Main controls

Stability

Similarity

Speaker boost

Preview and regenerate

When to avoid cloning

Known accent limitations (currently being improved)

Related pages