Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors

University of Notre Dame The Australian National University Indiana University South Bend

Abstract

This paper studies how to synthesize face images of non-existent persons, to create a dataset that allows effective training of face recognition models. Two important goals are (1) the ability to generate a large number of distinct identities (inter-class separation) with (2) a wide variation in appearance of each identity (intra-class variation). However, existing works 1) are typically limited in how many well-separated identities can be generated and 2) either neglect or use a separate editing model for attribute augmentation.

We propose Vec2Face, a holistic model that uses only a sampled vector as input and can flexibly generate and control face images and their attributes. Composed of a feature masked autoencoder and a decoder, Vec2Face is supervised by face image reconstruction and can be conveniently used in inference. Using vectors with low similarity among themselves as inputs, Vec2Face generates well-separated identities. Randomly perturbing an input identity vector within a small range allows Vec2Face to generate faces of the same identity with robust variation in face attributes. It is also possible to generate images with designated attributes by adjusting vector values with a gradient descent method.

Vec2Face has efficiently synthesized as many as 200K identities with 10 million total images, whereas 60K is the largest number of identities created in the previous works. Face recognition models trained with the generated HSFace datasets, from 10k to 200k identities, achieve state-of-the-art accuracy, from 92\% to 93.22\%, on five real-world test sets. For the first time, our model created using a synthetic training set achieves higher accuracy than the model created using a same-scale training set of real face images (on the CALFW test set).

Quantitative Results

Comparisons of three approaches (Diffusion model^†, 3D rendering^☆, and GAN^♦) on five test sets.
Methods	Venue	# images	LFW	CFP-FP	CPLFW	AgeDB	CALFW	Avg.
IDiff-Face^†	ICCV23	0.5M	98.00	85.47	80.45	86.43	90.65	88.20
DCFace^†	CVPR23	0.5M	98.55	85.33	82.62	89.70	91.60	89.56
Arc2Face^†	-	0.5M	98.81	91.87	85.16	90.18	92.63	91.73

DigiFace^☆	WACV23	1M	95.40	87.40	78.87	76.97	78.62	83.45

SynFace^♦	ICCV21	0.5M	91.93	75.03	70.43	61.63	74.73	74.75
SFace^♦	IJCB22	0.6M	91.87	73.86	73.20	71.68	77.93	77.71
IDnet^♦	CVPR23	0.5M	92.58	75.40	74.25	63.88	79.90	79.13
ExFaceGAN^♦	IJCB23	0.5M	93.50	73.84	71.60	77.37	83.40	81.62
SFace2^♦	T-BIOM24	0.6M	95.60	77.11	74.60	77.37	83.40	81.62
Langevin-Disco^♦	-	0.6M	96.60	73.89	74.77	80.70	87.77	82.75
HSFace10K^♦	-	0.5M	98.87	88.97	85.47	93.12	93.57	92.00

CASIA-WebFace (Real)		0.49M	99.42	96.56	89.73	94.08	93.32	94.62

Dataset scaling effect on face recognition accuracy.
Datasets	# images	LFW	CFP-FP	CPLFW	AgeDB	CALFW	Avg.
HSFace10k	0.5M	98.87	88.97	85.47	93.12	93.57	92.00
HSFace20k	1M	98.87	89.87	86.13	93.85	93.65	92.47
HSFace100k	5M	99.25	90.36	86.75	94.38	94.12	92.97
HSFace200K	10M	99.23	90.81	87.30	94.22	94.52	93.22
HSFace300K	15M	99.30	91.54	87.70	94.45	94.58	93.52

CASIA-WebFace (Real)	0.49M	99.38	96.91	89.78	94.50	93.35	94.79
CASIA-WebFace + HSFace10K	0.99M	99.58	97.06	90.58	95.62	94.67	95.50

Additional intra-class separability (Hadrian and Eclipse) and similar-looking (SLLFW and DoppelVer) test
Datasets	Hadrian	Eclipse	SLLFW	DoppelVer
HSFace10k	69.47	64.55	92.87	86.91
HSFace20k	75.22	67.55	94.37	88.90
HSFace100k	80.00	70.35	95.58	90.38
HSFace200K	79.85	71.12	95.70	89.86
HSFace300K	81.55	71.35	95.95	90.49

CASIA-WebFace (Real)	77.82	71.12	96.95	95.11

Qualitative Results

Comparison between available synthetic datasets

SynFace

SFace

DigiFace

iDiff-Face

DCFace

HSFace10K

Feature Interpolation Results

@article{wu2024vec2face, title={Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors}, author={Wu, Haiyu and Singh, Jaskirat and Tian, Sicong and Zheng, Liang and Bowyer, Kevin W.}, journal={arXiv preprint arXiv:2409.02979}, year={2024} }

Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors

Vec2Face generates unlimited number of well-separated identities from feature space and can boost any face task up (e.g., face recognition).

Abstract

Method Overview

Quantitative Results

Qualitative Results

Comparison between available synthetic datasets

Feature Interpolation Results

BibTeX

Acknowledgements