Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors

University of Notre Dame         The Australian National University         Indiana University South Bend

Vec2Face generates unlimited number of well-separated identities from feature space and can boost any face task up (e.g., face recognition).



Abstract

This paper studies how to synthesize face images of non-existent persons, to create a dataset that allows effective training of face recognition models. Two important goals are (1) the ability to generate a large number of distinct identities (inter-class separation) with (2) a wide variation in appearance of each identity (intra-class variation). However, existing works 1) are typically limited in how many well-separated identities can be generated and 2) either neglect or use a separate editing model for attribute augmentation.

We propose Vec2Face, a holistic model that uses only a sampled vector as input and can flexibly generate and control face images and their attributes. Composed of a feature masked autoencoder and a decoder, Vec2Face is supervised by face image reconstruction and can be conveniently used in inference. Using vectors with low similarity among themselves as inputs, Vec2Face generates well-separated identities. Randomly perturbing an input identity vector within a small range allows Vec2Face to generate faces of the same identity with robust variation in face attributes. It is also possible to generate images with designated attributes by adjusting vector values with a gradient descent method.

Vec2Face has efficiently synthesized as many as 200K identities with 10 million total images, whereas 60K is the largest number of identities created in the previous works. Face recognition models trained with the generated HSFace datasets, from 10k to 200k identities, achieve state-of-the-art accuracy, from 92\% to 93.22\%, on five real-world test sets. For the first time, our model created using a synthetic training set achieves higher accuracy than the model created using a same-scale training set of real face images (on the CALFW test set).



Method Overview

pipeline

Quantitative Results

Comparisons of three approaches (Diffusion model, 3D rendering, and GAN) on five test sets.
Methods Venue # images LFW CFP-FP CPLFW AgeDB CALFW Avg.
IDiff-Face ICCV23 0.5M 98.00 85.47 80.45 86.43 90.65 88.20
DCFace CVPR23 0.5M 98.55 85.33 82.62 89.70 91.60 89.56
Arc2Face - 0.5M 98.81 91.87 85.16 90.18 92.63 91.73
DigiFace WACV23 1M 95.40 87.40 78.87 76.97 78.62 83.45
SynFace ICCV21 0.5M 91.93 75.03 70.43 61.63 74.73 74.75
SFace IJCB22 0.6M 91.87 73.86 73.20 71.68 77.93 77.71
IDnet CVPR23 0.5M 92.58 75.40 74.25 63.88 79.90 79.13
ExFaceGAN IJCB23 0.5M 93.50 73.84 71.60 77.37 83.40 81.62
SFace2 T-BIOM24 0.6M 95.60 77.11 74.60 77.37 83.40 81.62
Langevin-Disco - 0.6M 96.60 73.89 74.77 80.70 87.77 82.75
HSFace10K - 0.5M 98.87 88.97 85.47 93.12 93.57 92.00
CASIA-WebFace (Real) 0.49M 99.42 96.56 89.73 94.08 93.32 94.62
Dataset scaling effect on face recognition accuracy.
Datasets # images LFW CFP-FP CPLFW AgeDB CALFW Avg.
HSFace10k 0.5M 98.87 88.97 85.47 93.12 93.57 92.00
HSFace20k 1M 98.87 89.87 86.13 93.85 93.65 92.47
HSFace100k 5M 99.25 90.36 86.75 94.38 94.12 92.97
HSFace200K 10M 99.23 90.81 87.30 94.22 94.52 93.22
HSFace300K 15M 99.30 91.54 87.70 94.45 94.58 93.52
CASIA-WebFace (Real) 0.49M 99.38 96.91 89.78 94.50 93.35 94.79
CASIA-WebFace + HSFace10K 0.99M 99.58 97.06 90.58 95.62 94.67 95.50
Additional intra-class separability (Hadrian and Eclipse) and similar-looking (SLLFW and DoppelVer) test
Datasets Hadrian Eclipse SLLFW DoppelVer
HSFace10k 69.47 64.55 92.87 86.91
HSFace20k 75.22 67.55 94.37 88.90
HSFace100k 80.00 70.35 95.58 90.38
HSFace200K 79.85 71.12 95.70 89.86
HSFace300K 81.55 71.35 95.95 90.49
CASIA-WebFace (Real) 77.82 71.12 96.95 95.11
ID separability analysis graph
ID separability analysis of synthetic datasets and generative models.

Qualitative Results

Comparison between available synthetic datasets

Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
SynFace
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
SFace
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
DigiFace
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
iDiff-Face
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
DCFace
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
HSFace10K

Feature Interpolation Results


Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.
Interpolate start reference image.


BibTeX


@article{wu2024vec2face,
  title={Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors},
  author={Wu, Haiyu and Singh, Jaskirat and Tian, Sicong and Zheng, Liang and Bowyer, Kevin W.},
  journal={arXiv preprint arXiv:2409.02979},
  year={2024}
}
            

Acknowledgements


Website adapted from the following template.