Eigen-PEP for Video Face Recognition

Li, Haoxiang; Hua, Gang; Shen, Xiaohui; Lin, Zhe; Brandt, Jonathan

doi:10.1007/978-3-319-16811-1_2

Haoxiang Li¹⁷,
Gang Hua¹⁷,
Xiaohui Shen¹⁸,
Zhe Lin¹⁸ &
…
Jonathan Brandt¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9005))

Included in the following conference series:

Asian Conference on Computer Vision

2976 Accesses
23 Citations

Abstract

To effectively solve the problem of large scale video face recognition, we argue for a comprehensive, compact, and yet flexible representation of a face subject. It shall comprehensively integrate the visual information from all relevant video frames of the subject in a compact form. It shall also be flexible to be incrementally updated, incorporating new or retiring obsolete observations. In search for such a representation, we present the Eigen-PEP that is built upon the recent success of the probabilistic elastic part (PEP) model. It first integrates the information from relevant video sources by a part-based average pooling through the PEP model, which produces an intermediate high dimensional, part-based, and pose-invariant representation. We then compress the intermediate representation through principal component analysis, and only a number of principal eigen dimensions are kept (as small as 100). We evaluate the Eigen-PEP representation both for video-based face verification and identification on the YouTube Faces Dataset and a new Celebrity-1000 video face dataset, respectively. On YouTube Faces, we further improve the state-of-the-art recognition accuracy. On Celebrity-1000, we lead the competing baselines by a significant margin while offering a scalable solution that is linear with respect to the number of subjects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Probabilistic Elastic Part Model for Real-World Face Recognition

Pyramid Mean Representation of Image Sequences for Fast Face Retrieval in Unconstrained Video Data

Understanding videos with face recognition: a complete pipeline and applications

Article 15 June 2022

Notes

1.
http://www.cs.colostate.edu/~vision/pasc/ijcb2014/.
2.
http://www.lv-nus.org/facedb/.
3.
We thank the authors for sharing theirs results.

References

Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: CVPR (2011)
Google Scholar
Chen, D., Cao, X., Wen, F., Sun, J.: Blessing of dimensionality: high dimensional feature and its efficient compression for face verification. In: CVPR (2013)
Google Scholar
Cao, X., Wipf, D., Wen, F., Duan, G.: A practical transfer learning algorithm for face verification. In: ICCV (2013)
Google Scholar
Liao, S., Jain, A., Li, S.: Partial face recognition: alignment-free approach. T-PAMI 35, 1193–1205 (2013)
Article Google Scholar
Barkan, O., Weill, Y., Wolf, L., Aronowitz., H.: Fast high dimensional vector multiplication based face recognition. In: ICCV (2013)
Google Scholar
Lei, Z., Pietikainen, M., Li, S.Z.: Learning discriminant face descriptor. T-PAMI 36, 289–302 (2014)
Article Google Scholar
Cao, Q., Ying, Y., Li, P.: Similarity metric learning for face recognition. In: ICCV (2013)
Google Scholar
Yuan, X.T., Liu, X., Yan, S.: Visual classification with multitask joint sparse representation. IEEE Trans. Image Process. 21, 4349–4360 (2012)
Article MathSciNet Google Scholar
Chen, Y.C., Patel, V., Shekhar, S., Chellappa, R., Phillips, P.: Video-based face recognition via joint sparse representation. In: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) (2013)
Google Scholar
Zhou, S., Krueger, V., Chellappa, R.: Probabilistic recognition of human faces from video. Comput. Vis. Image Underst. 91, 214–245 (2003)
Article Google Scholar
Lee, K.C., Ho, J., Yang, M.H., Kriegman, D.: Video-based face recognition using probabilistic appearance manifolds. In: Proceedings of 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2003)
Google Scholar
Zhang, Y., Martnez, A.M.: A weighted probabilistic approach to face recognition from multiple images and video sequences. Image Vis. Comput. 24, 626–638 (2006)
Article Google Scholar
Zhao, W., Chellappa, R., Rosenfeld, A., Phillips, P.J.: Face recognition: a literature survey. ACM Comput. Surv. 35, 399–458 (2003)
Article Google Scholar
Li, H., Hua, G., Lin, Z., Brandt, J., Yang, J.: Probabilistic elastic matching for pose variant face verification. In: CVPR (2013)
Google Scholar
Li, H., Hua, G., Lin, Z., Brandt, J., Yang, J.: Probabilistic elastic part model for unsupervised face detector adaptation. In: ICCV (2013)
Google Scholar
Chen, D., Cao, X., Wang, L., Wen, F., Sun, J.: Bayesian face revisited: a joint formulation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 566–579. Springer, Heidelberg (2012)
Chapter Google Scholar
Beveridge, J.R., et al.: The IJCB 2014 pasc video face and person recognition competition. In: IJCB (2014)
Google Scholar
Liu, L., Zhang, L., Liu, H., Lao, S., Yan, S.: Towards large-population face identification in unconstrained videos. In: CSVT (2013)
Google Scholar
Huang, G.B., Mattar, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. In: Faces in Real-Life Images Workshop in ECCV (2008)
Google Scholar
Parkhi, O.M., Simonyan, K., Vedaldi, A., Zisserman, A.: A compact and discriminative face track descriptor. In: CVPR (2014)
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: CVPR (2014)
Google Scholar
Wolf, L., Levy, N.: The SVM-minus similarity score for video face recognition. In: CVPR (2013)
Google Scholar
Cui, Z., Li, W., Xu, D., Shan, S., Chen, X.: Fusing robust face region descriptors via multiple metric learning for face recognition in the wild. In: CVPR (2013)
Google Scholar
Mendez-Vazquez, H., Martinez-Diaz, Y., Chai, Z.: Volume structured ordinal features with background similarity measure for video face recognition. In: ICB (2013)
Google Scholar
Hu, J., Lu, J., Tan, Y.P.: Discriminative deep metric learning for face verification in the wild. In: CVPR (2014)
Google Scholar
Moon, H., Phillips, P.J.: Computational and performance aspects of pca-based facerecognition algorithms. Perception 30, 303–321 (2001)
Article Google Scholar
Huang, G.B., Learned-Miller, E.: Labeled faces in the wild: updates and new reporting procedures. Technical report UM-CS-2014-003, UMass Amherst (2014)
Google Scholar
Huang, G., Jain, V., Learned-Miller, E.: Unsupervised joint alignment of complex images. In: ICCV (2007)
Google Scholar
Pinto, N., DiCarlo, J.J., Cox, D.D.: How far can you get with a modern face recognition test set using only simple features? In: CVPR (2009)
Google Scholar
Simonyan, K., Parkhi, O.M., Vedaldi, A., Zisserman, A.: Fisher vector faces in the wild. In: BMVC (2013)
Google Scholar
Ahonen, T., Hadid, A., Pietikäinen, M.: Face recognition with local binary patterns. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3021, pp. 469–481. Springer, Heidelberg (2004)
Chapter Google Scholar

Download references

Acknowledgement

Research reported in this publication was partly supported by the National Institute Of Nursing Research of the National Institutes of Health under Award Number R01NR015371. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This work is also partly supported by US National Science Foundation Grant IIS 1350763, China National Natural Science Foundation Grant 61228303, GH’s start-up funds form Stevens Institute of Technology, a Google Research Faculty Award, a gift grant from Microsoft Research, and a gift grant from NEC Labs America.

Author information

Authors and Affiliations

Stevens Institute of Technology, Hoboken, USA
Haoxiang Li & Gang Hua
Adobe Systems Inc., San Jose, USA
Xiaohui Shen, Zhe Lin & Jonathan Brandt

Authors

Haoxiang Li
View author publications
You can also search for this author in PubMed Google Scholar
Gang Hua
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohui Shen
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Brandt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haoxiang Li .

Editor information

Editors and Affiliations

Technische Universität München, Garching, Bayern, Germany
Daniel Cremers
University of Adelaide, Adelaide, South Australia, Australia
Ian Reid
Keio University, Yokohama, Kanagawa, Japan
Hideo Saito
University of California at Merced, Merced, California, USA
Ming-Hsuan Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, H., Hua, G., Shen, X., Lin, Z., Brandt, J. (2015). Eigen-PEP for Video Face Recognition. In: Cremers, D., Reid, I., Saito, H., Yang, MH. (eds) Computer Vision -- ACCV 2014. ACCV 2014. Lecture Notes in Computer Science(), vol 9005. Springer, Cham. https://doi.org/10.1007/978-3-319-16811-1_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-16811-1_2
Published: 16 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16810-4
Online ISBN: 978-3-319-16811-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Eigen-PEP for Video Face Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Probabilistic Elastic Part Model for Real-World Face Recognition

Pyramid Mean Representation of Image Sequences for Fast Face Retrieval in Unconstrained Video Data

Understanding videos with face recognition: a complete pipeline and applications

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Eigen-PEP for Video Face Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Probabilistic Elastic Part Model for Real-World Face Recognition

Pyramid Mean Representation of Image Sequences for Fast Face Retrieval in Unconstrained Video Data

Understanding videos with face recognition: a complete pipeline and applications

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation