TY - GEN
T1 - Ensemble methods for robust 3D face recognition using commodity depth sensors
AU - Schimbinschi, F.
AU - Schomaker, L.
AU - Wiering, M.
PY - 2016
Y1 - 2016
N2 - In this paper we introduce a new dataset and pose invariant sampling method and describe the ensemble methods used for recognizing faces in 3D scenes, captured using commodity depth sensors. We use the 3D SIFT key point detector to take advantage of the similarities between faces, which leads to a set of points of interest based on the curvature of the face. For all key points, features are extracted using a 3D feature descriptor. Then, a variable-sized amount of features are generated per each 3D face image. The first ensemble method we constructed uses a K-nearest neighbors classifier to classify each key point-sampled feature vector as belonging to one of the subjects recorded in our dataset. All votes over all key points are combined. In the second ensemble technique, the key points are clustered with K-means, using the feature vectors and approximated sampling positions relative to the face. This leads to a set of experts that specialize for a specific region. Then a K-nearest neighbors classifier is trained on the examples falling in each expert's specialized region. Finally, for a new 3D face image, votes from all experts are combined in a sum ensemble technique to categorize the 3D face. We also introduce 6 new "real world" datasets with different variances: 3 types of 3D rotations, distance to sensor, expressions, and an all-in-one dataset. The results show very high cross validation accuracies for the same type of variance. In addition, 36 variance specific pair-Tests in which the system is trained on one dataset and tested on a completely different dataset also show encouraging results.
AB - In this paper we introduce a new dataset and pose invariant sampling method and describe the ensemble methods used for recognizing faces in 3D scenes, captured using commodity depth sensors. We use the 3D SIFT key point detector to take advantage of the similarities between faces, which leads to a set of points of interest based on the curvature of the face. For all key points, features are extracted using a 3D feature descriptor. Then, a variable-sized amount of features are generated per each 3D face image. The first ensemble method we constructed uses a K-nearest neighbors classifier to classify each key point-sampled feature vector as belonging to one of the subjects recorded in our dataset. All votes over all key points are combined. In the second ensemble technique, the key points are clustered with K-means, using the feature vectors and approximated sampling positions relative to the face. This leads to a set of experts that specialize for a specific region. Then a K-nearest neighbors classifier is trained on the examples falling in each expert's specialized region. Finally, for a new 3D face image, votes from all experts are combined in a sum ensemble technique to categorize the 3D face. We also introduce 6 new "real world" datasets with different variances: 3 types of 3D rotations, distance to sensor, expressions, and an all-in-one dataset. The results show very high cross validation accuracies for the same type of variance. In addition, 36 variance specific pair-Tests in which the system is trained on one dataset and tested on a completely different dataset also show encouraging results.
KW - face recognition
KW - machine learning
U2 - 10.1109/SSCI.2015.36
DO - 10.1109/SSCI.2015.36
M3 - Conference contribution
SN - 978-1-4799-7560-0
T3 - Proceedings - 2015 IEEE Symposium Series on Computational Intelligence, SSCI 2015
BT - IEEE Symposium Series on Computational Intelligence
PB - IEEE (The Institute of Electrical and Electronics Engineers)
T2 - IEEE Symposium on Computational Intelligence in Biometrics and Identity
Y2 - 7 December 2015 through 10 December 2015
ER -