4.8 Article

Joint Gaze Correction and Face Beautification for Conference Video Using Dual Sparsity Prior

Journal

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS
Volume 66, Issue 12, Pages 9601-9611

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIE.2018.2889616

Keywords

Face beautification; gaze correction; sparse representation; video conferencing.

Funding

  1. Major State Basic Research Development Program of China (973 Program) [2015CB351804]
  2. National Science Foundation of China [61672193, 61502122, 61620106005]
  3. Fundamental Research Funds for the Central Universities [HIT NSRIF 2015067]
  4. Tsinghua University Initiative Scientific Research Program

Ask authors/readers for more resources

Gaze mismatch is a common problem in video conferencing, where the viewpoint captured by a camera (usually located above or below a display monitor) is not aligned with the gaze direction of the human subject, who typically looks at his counterpart in the center of the screen. This means that the two parties cannot converse eye-to-eye, hampering the visual communication experience. A recent popular approach to the gaze mismatch problem is to synthesize a gaze-corrected face image as viewed from the screen center via depth-image-based rendering (DIBR), assuming texture and depth maps are both available at the camera-captured viewpoint. Due to self-occlusion, however, there will be missing pixels in the DIBR-synthesized view image that require satisfactory filling. In this paper, we propose to jointly solve the hole-filling problem and the face beautification problem (subtle modifications of facial components and contour to enhance attractiveness of the rendered face) using dual sparsity prior. Specifically, prior to the start of a video conference session, we first train two dictionaries separately offline using two large data sets: one with general face images, and the other with beautiful human faces, which means faces with high beauty scores. During the actual conference session, we solve the hole-filling and facial components beautification problems simultaneously by seeking two code vectors-one is sparse in the first dictionary and explains the available DIBR-synthesized pixels, the other is sparse in the second dictionary and matches well with the first vector in terms of feature space distance. This ensures an acceptable level of recognizability of the conference subject, while increasing proximity to beautiful facial features to improve attractiveness. Experimental results show naturally rendered human faces with noticeably improved attractiveness.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available