Presentation
Model See Model Do: Speech-Driven Facial Animation with Style Control
SessionAvatars
DescriptionModelSeeModelDo presents a speech-driven 3D facial animation method using a latent diffusion model conditioned on a reference clip to capture nuanced performance styles. A novel "style basis" mechanism extracts key poses to guide generation, achieving expressive, temporally coherent animations with accurate lip-sync and strong stylistic fidelity across diverse speech inputs.

Event Type
Technical Paper
TimeWednesday, 13 August 202510:45am - 10:55am PDT
LocationWest Building, Rooms 220-222
ACM Digital Library
Journal Papers' PDFs
Conference Papers' PDFs
Conference Papers' PDFs
Session Time & Location
Sunday, 10 August 20256:00pm - 8:45pm PDTWest Building, Ballroom AB
Wednesday, 13 August 202510:45am - 12:15pm PDTWest Building, Rooms 220-222
Livestreamed
Recorded
