4.8 Article

Joint Gaze Correction and Face Beautification for Conference Video Using Dual Sparsity Prior

期刊

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS
卷 66, 期 12, 页码 9601-9611

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIE.2018.2889616

关键词

Face beautification; gaze correction; sparse representation; video conferencing.

资金

  1. Major State Basic Research Development Program of China (973 Program) [2015CB351804]
  2. National Science Foundation of China [61672193, 61502122, 61620106005]
  3. Fundamental Research Funds for the Central Universities [HIT NSRIF 2015067]
  4. Tsinghua University Initiative Scientific Research Program

向作者/读者索取更多资源

Gaze mismatch is a common problem in video conferencing, where the viewpoint captured by a camera (usually located above or below a display monitor) is not aligned with the gaze direction of the human subject, who typically looks at his counterpart in the center of the screen. This means that the two parties cannot converse eye-to-eye, hampering the visual communication experience. A recent popular approach to the gaze mismatch problem is to synthesize a gaze-corrected face image as viewed from the screen center via depth-image-based rendering (DIBR), assuming texture and depth maps are both available at the camera-captured viewpoint. Due to self-occlusion, however, there will be missing pixels in the DIBR-synthesized view image that require satisfactory filling. In this paper, we propose to jointly solve the hole-filling problem and the face beautification problem (subtle modifications of facial components and contour to enhance attractiveness of the rendered face) using dual sparsity prior. Specifically, prior to the start of a video conference session, we first train two dictionaries separately offline using two large data sets: one with general face images, and the other with beautiful human faces, which means faces with high beauty scores. During the actual conference session, we solve the hole-filling and facial components beautification problems simultaneously by seeking two code vectors-one is sparse in the first dictionary and explains the available DIBR-synthesized pixels, the other is sparse in the second dictionary and matches well with the first vector in terms of feature space distance. This ensures an acceptable level of recognizability of the conference subject, while increasing proximity to beautiful facial features to improve attractiveness. Experimental results show naturally rendered human faces with noticeably improved attractiveness.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据