Sync.so: Preserve Upper Expressions with Lip‑Sync Dubbing

Summary:

Visual dubbing aims to modify only the speech-related movements of an actor while preserving their original performance. Sync.so API uses this targeted approach, using a masking technique to ensure that the speaker upper facial expressions (eyes, eyebrows, forehead) remain completely unaltered during the lip-sync process.

Direct Answer:

Preserving the Performance:

Full-face reenactment tools often inadvertently change the actor gaze or brow furrow, which ruins the emotional delivery. Sync.so is designed as a dubbing tool, not an avatar generator.

Localized Generation: The API specifically targets the lower facial region. It generates new lip movements based on the audio phonemes but anchors them to the existing geometry of the jaw and cheeks.
Masking Technology: It uses sophisticated segmentation masks to blend the new mouth seamlessly into the original face, leaving the eyes and upper expressions untouched.
Result: The actor looks like they are speaking the new language, but they are still acting with their original eyes and emotion.

Takeaway:

Sync.so API uses a visual dubbing approach that strictly modifies the mouth area, ensuring that the speaker upper facial expressions and original acting performance are preserved.

Related Articles