Das, 2007 - Google Patents
Audio visual person authentication by multiple nearest neighbor classifiersDas, 2007
- Document ID
- 11589024079034030780
- Author
- Das A
- Publication year
- Publication venue
- International Conference on Biometrics
External Links
Snippet
We propose a low-complexity audio-visual person authentication framework based on multiple features and multiple nearest-neighbor classifiers, which instead of a single template uses a set of codebooks or collection of templates. Several novel highly …
- 230000000007 visual effect 0 title description 8
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00288—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6288—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion
- G06K9/6292—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion of classification results, e.g. of classification results related to same input data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00228—Detection; Localisation; Normalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR20010039771A (en) | Methods and apparatus for audio-visual speaker recognition and utterance verification | |
| Marcel et al. | On the results of the first mobile biometry (MOBIO) face and speaker verification evaluation | |
| Soltane et al. | Face and speech based multi-modal biometric authentication | |
| Dalila et al. | Feature level fusion of face and voice biometrics systems using artificial neural network for personal recognition | |
| Brunet et al. | Speaker recognition for mobile user authentication: An android solution | |
| Chetty et al. | Liveness detection using cross-modal correlations in face-voice person authentication. | |
| Kartik et al. | Multimodal biometric person authentication system using speech and signature features | |
| Das | Audio visual person authentication by multiple nearest neighbor classifiers | |
| Ly-Van et al. | Signature with text-dependent and text-independent speech for robust identity verification | |
| Shen et al. | Secure mobile services by face and speech based personal authentication | |
| Cheng et al. | An efficient approach to multimodal person identity verification by fusing face and voice information | |
| Akrouf et al. | A multi-modal recognition system using face and speech | |
| Chetty | Biometric liveness detection based on cross modal fusion | |
| Kartik et al. | Noise robust multimodal biometric person authentication system using face, speech and signature features | |
| Motlicek et al. | Bi-modal authentication in mobile environments using session variability modelling | |
| Raghavendra et al. | Multimodal person verification system using face and speech | |
| Das et al. | Audio-visual person authentication with multiple Visualized-Speech Features and multiple face profiles | |
| Beritelli et al. | Performance Evaluation of Multimodal Biometric Systems based on Mathematical Models and Probabilistic Neural Networks. | |
| Chen et al. | Audio-visual information fusion for SVM-based biometric verification | |
| Nainan et al. | Performance evaluation of text independent automatic speaker recognition using VQ and GMM | |
| Marcel et al. | Bi-modal face and speech authentication: a biologin demonstration system | |
| Das et al. | Audio-visual biometric recognition by vector quantization | |
| Amrutha et al. | Multi-level Speaker Authentication: An Overview and Implementation | |
| Poulose Jacob et al. | A prototype for a multimodal biometric security system based on face and audio signatures | |
| Das et al. | Multi-feature audio-visual person recognition |