Microsoft VASA tech can create realistic deepfakes using a single photo and one audio track

The Visual Affective Skills Animator, or VASA, is a machine-learning framework that analyzes a facial photo and then animates it to a voice, syncing the lips and mouth movements to the audio. It also simulates facial expressions, head movements, and even unseen body movements.Read Entire Article

Apr 20, 2024 - 14:30
 0  12
Microsoft VASA tech can create realistic deepfakes using a single photo and one audio track

The Visual Affective Skills Animator, or VASA, is a machine-learning framework that analyzes a facial photo and then animates it to a voice, syncing the lips and mouth movements to the audio. It also simulates facial expressions, head movements, and even unseen body movements.

Read Entire Article