Large-scale Multi-modal Person Identification in Real Unconstrained Environments

doi:10.48550/arXiv.1912.12134

Large-scale Multi-modal Person Identification in Real Unconstrained Environments

Person identification (P-ID) under real unconstrained noisy environments is a huge challenge. In multiple-feature learning with Deep Convolutional Neural Networks (DCNNs) or Machine Learning method for large-scale person identification in the wild, the key is to design an appropriate strategy for decision layer fusion or feature layer fusion which can enhance discriminative power. It is necessary to extract different types of valid features and establish a reasonable framework to fuse different types of information. In traditional methods, different persons are identified based on single modal features to identify, such as face feature, audio feature, and head feature. These traditional methods cannot realize a highly accurate level of person identification in real unconstrained environments. The study aims to propose a fusion module to fuse multi-modal features for person identification in real unconstrained environments.

Publication:

arXiv e-prints

Pub Date:

December 2019

DOI:

10.48550/arXiv.1912.12134

arXiv:

arXiv:1912.12134

Bibcode:

2019arXiv191212134Y

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
Electrical Engineering and Systems Science - Signal Processing

E-Print:

6 pages, IEEE International Conference on Robotics and Biomimetics 2019

NASA/ADS

Large-scale Multi-modal Person Identification in Real Unconstrained Environments

Abstract