Das, 2007 - Google Patents

Audio visual person authentication by multiple nearest neighbor classifiers

Das, 2007

Document ID: 11589024079034030780
Author: Das A
Publication year: 2007
Publication venue: International Conference on Biometrics

External Links

Cited by

Snippet

We propose a low-complexity audio-visual person authentication framework based on multiple features and multiple nearest-neighbor classifiers, which instead of a single template uses a set of codebooks or collection of templates. Several novel highly …

Continue reading at link.springer.com (other versions)

230000000007 visual effect 0 title description 8

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00288—Classification, e.g. identification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6288—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion
- G06K9/6292—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion of classification results, e.g. of classification results related to same input data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00228—Detection; Localisation; Normalisation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications

Similar Documents

Publication	Publication Date	Title
KR20010039771A (en)	2001-05-15	Methods and apparatus for audio-visual speaker recognition and utterance verification
Marcel et al.	2010	On the results of the first mobile biometry (MOBIO) face and speaker verification evaluation
Soltane et al.	2010	Face and speech based multi-modal biometric authentication
Dalila et al.	2020	Feature level fusion of face and voice biometrics systems using artificial neural network for personal recognition
Brunet et al.	2013	Speaker recognition for mobile user authentication: An android solution
Chetty et al.	2005	Liveness detection using cross-modal correlations in face-voice person authentication.
Kartik et al.	2008	Multimodal biometric person authentication system using speech and signature features
Das	2007	Audio visual person authentication by multiple nearest neighbor classifiers
Ly-Van et al.	2003	Signature with text-dependent and text-independent speech for robust identity verification
Shen et al.	2010	Secure mobile services by face and speech based personal authentication
Cheng et al.	2005	An efficient approach to multimodal person identity verification by fusing face and voice information
Akrouf et al.	2011	A multi-modal recognition system using face and speech
Chetty	2009	Biometric liveness detection based on cross modal fusion
Kartik et al.	2008	Noise robust multimodal biometric person authentication system using face, speech and signature features
Motlicek et al.	2012	Bi-modal authentication in mobile environments using session variability modelling
Raghavendra et al.	2010	Multimodal person verification system using face and speech
Das et al.	2008	Audio-visual person authentication with multiple Visualized-Speech Features and multiple face profiles
Beritelli et al.	2015	Performance Evaluation of Multimodal Biometric Systems based on Mathematical Models and Probabilistic Neural Networks.
Chen et al.	2005	Audio-visual information fusion for SVM-based biometric verification
Nainan et al.	2016	Performance evaluation of text independent automatic speaker recognition using VQ and GMM
Marcel et al.	2006	Bi-modal face and speech authentication: a biologin demonstration system
Das et al.	2006	Audio-visual biometric recognition by vector quantization
Amrutha et al.	2020	Multi-level Speaker Authentication: An Overview and Implementation
Poulose Jacob et al.	2011	A prototype for a multimodal biometric security system based on face and audio signatures
Das et al.	2008	Multi-feature audio-visual person recognition