☆ 4.7 Article

Synthesizing Obama: Learning Lip Sync from Audio

ACM TRANSACTIONS ON GRAPHICS (2017)

期刊

ACM TRANSACTIONS ON GRAPHICS

卷 36, 期 4, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3072959.3073640

关键词

Audio; Face Synthesis; LSTM; RNN; Pig data. Videos; Audiovisual Speech; Uncanny Valley; Lip Sync

类别

Computer Science, Software Engineering

资金

Samsung
Google
Intel
University of Washington Animation Research Labs

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Given audio of President Barack Obama, we synthesize a high quality video of him speaking with accurate lip sync, composited into a target video clip. Trained on many hours of his weekly address footage, a recurrent neural network learns the mapping from raw audio features to mouth shapes. Given the mouth shape at each time instant, we synthesize high quality mouth texture, and composite it with proper 3D pose matching to change what he appears to be saying in a target video to match the input audio track. Our approach produces photorealistic results.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

Synthesizing Obama: Learning Lip Sync from Audio

期刊

ACM TRANSACTIONS ON GRAPHICS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Synthesizing Obama: Learning Lip Sync from Audio

期刊

ACM TRANSACTIONS ON GRAPHICS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文