4.6 Article

Cascade Regression-Based Face Frontalization for Dynamic Facial Expression Analysis

Journal

COGNITIVE COMPUTATION
Volume 14, Issue 5, Pages 1571-1584

Publisher

SPRINGER
DOI: 10.1007/s12559-021-09843-8

Keywords

Face frontalization; Facial expression recognition; Cascade regression; Facial analysis; Person-independent

Funding

  1. Engineering and Physical Sciences Research Council [EP/N025849/1]
  2. EPSRC [EP/N025849/1] Funding Source: UKRI

Ask authors/readers for more resources

Facial expression recognition has made significant progress, but recognizing expressions with occlusions and large head-poses remains challenging. This paper proposes a cascade regression-based face frontalization method, which improves the performance of static and dynamic facial expression recognition by predicting frontal shape and reconstructing facial texture.
Facial expression recognition has seen rapid development in recent years due to its wide range of applications such as human-computer interaction, health care, and social robots. Although significant progress has been made in this field, it is still challenging to recognize facial expressions with occlusions and large head-poses. To address these issues, this paper presents a cascade regression-based face frontalization (CRFF) method, which aims to immediately reconstruct a clean, frontal and expression-aware face given an in-the-wild facial image. In the first stage, a frontal facial shape is predicted by developing a cascade regression model to learn the pairwise spatial relation between non-frontal face-shape and its frontal counterpart. Unlike most existing shape prediction methods that used single-step regression, the cascade model is a multi-step regressor that gradually aligns non-frontal shape to its frontal view. We employ several different regressors and make a ensemble decision to boost prediction performance. For facial texture reconstruction, active appearance model instantiation is employed to warp the input face to the predicted frontal shape and generate a clean face. To remove occlusions, we train this generative model on manually selected clean-face sets, which ensures generating a clean face as output regardless of whether the input face involves occlusions or not. Unlike the existing face reconstruction methods that are computational expensive, the proposed method works in real time, so it is suitable for dynamic analysis of facial expression. The experimental validation shows that the ensembling cascade model has improved frontal shape prediction accuracy for an average of 5% and the proposed method has achieved superior performance on both static and dynamic recognition of facial expressions over the state-of-the-art approaches. The experimental results demonstrate that the proposed method has achieved expression-preserving frontalization, de-occlusion and has improved performance of facial expression recognition.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available