Gender recognition from facial images: two or three dimensions?

Wenhao Zhang; Melvyn L. Smith; Lyndon N. Smith; Abdul Farooq

doi:10.1364/JOSAA.33.000333

Journal of the Optical Society of America A
Vol. 33,
Issue 3,
pp. 333-344
(2016)
•https://doi.org/10.1364/JOSAA.33.000333

Gender recognition from facial images: two or three dimensions?

Wenhao Zhang, Melvyn L. Smith, Lyndon N. Smith, and Abdul Farooq

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Wenhao Zhang, Melvyn L. Smith, Lyndon N. Smith, and Abdul Farooq, "Gender recognition from facial images: two or three dimensions?," J. Opt. Soc. Am. A 33, 333-344 (2016)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Abstract

This paper seeks to compare encoded features from both two-dimensional (2D) and three-dimensional (3D) face images in order to achieve automatic gender recognition with high accuracy and robustness. The Fisher vector encoding method is employed to produce 2D, 3D, and fused features with escalated discriminative power. For 3D face analysis, a two-source photometric stereo (PS) method is introduced that enables 3D surface reconstructions with accurate details as well as desirable efficiency. Moreover, a $2 D + 3 D$ imaging device, taking the two-source PS method as its core, has been developed that can simultaneously gather color images for 2D evaluations and PS images for 3D analysis. This system inherits the superior reconstruction accuracy from the standard (three or more light) PS method but simplifies the reconstruction algorithm as well as the hardware design by only requiring two light sources. It also offers great potential for facilitating human computer interaction by being accurate, cheap, efficient, and nonintrusive. Ten types of low-level 2D and 3D features have been experimented with and encoded for Fisher vector gender recognition. Evaluations of the Fisher vector encoding method have been performed on the FERET database, Color FERET database, LFW database, and FRGCv2 database, yielding 97.7%, 98.0%, 92.5%, and 96.7% accuracy, respectively. In addition, the comparison of 2D and 3D features has been drawn from a self-collected dataset, which is constructed with the aid of the $2 D + 3 D$ imaging device in a series of data capture experiments. With a variety of experiments and evaluations, it can be proved that the Fisher vector encoding method outperforms most state-of-the-art gender recognition methods. It has also been observed that 3D features reconstructed by the two-source PS method are able to further boost the Fisher vector gender recognition performance, i.e., up to a 6% increase on the self-collected database.

Full Article | PDF Article

More Like This

Eye center localization and gaze gesture recognition for human–computer interaction

Wenhao Zhang, Melvyn L. Smith, Lyndon N. Smith, and Abdul Farooq
J. Opt. Soc. Am. A 33(3) 314-325 (2016)

Discriminant analysis for recognition of human face images

Kamran Etemad and Rama Chellappa
J. Opt. Soc. Am. A 14(8) 1724-1733 (1997)

Illumination invariant recognition and 3D reconstruction of faces using desktop optics

Ajmal Mian
Opt. Express 19(8) 7491-7506 (2011)

Previous Article Next Article

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (11)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (1)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (22)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Method	Description	Database	Validation Method	Accuracy	Limitation
The proposed method	FV encoding	FERET fa 1762	5-CV	97.7%	Slow at training stage
		FERET fa 1762	50%/50%	96.9%
		FERET fa 1762	50%/50%*	97.9%
		FERET fa+fb 900 (u)	5-CV	96.1%
		FERET fa+fb 2400	50%/50%	98.3%
		FERET fa+fb 2400	50%/50%*	99.5%
		Color FERET 700	2-CV*	98.0%
		LFW all (u)	5-CV	92.5%
		FRGCv2 depthall 466 subjects (u)	5-CV	96.7%
[15]	Classifier fusion	FERET fa+fb 900 (u)	5-CV	92.9%	6 classifiers needed
[17]	RBF-SVM	FERET thumbnail 1855	5-CV*	96.6%	*
[18]	Refined LBP	LFW 7443 selected	5-CV	94.8%	Manual data selection
[19]	LBP, wavelet transform	FERET fa+fb 2400	50%/50%*	99.3%	Manual data selection
[20]	Facial strips	FERET fa 1763	Not specified	98.8%	Slow and need alignment
[22]	CNN	FERET fa 1762	5-CV*	97.2%	*
[23]	CNN	FERET fa 1762	5-CV*	96.4%	*
[11]	Geometric facial features	Indian face	Trained with 40 subjects	95.6%	Database of small size
[26]	2DPCA	Color FERET 700	2-CV*	98.4%	*
[26]	Gabor space	LFW all	2-CV*	89.1%
[27]	2DPCA	Color FERET	2-CV*	98.2%	*
[27]	Gabor space	LFW all	2-CV*	88.3%
[28]	LBP, shape index	FRGCv2 depth all 466 subjects	5-CV*	93.7%	*
[29]	Random Forest votes	FRGCv2 depth all 466 subjects	Leave-one-out CV	97.2%	/

Abstract

Cited By

Figures (11)

Tables (1)

Equations (22)

Journal of the Optical Society of America A