Tavanaei et al., 2017 - Google Patents
A spiking network that learns to extract spike signatures from speech signalsTavanaei et al., 2017
View PDF- Document ID
- 14789101822045010460
- Author
- Tavanaei A
- Maida A
- Publication year
- Publication venue
- Neurocomputing
External Links
Snippet
Spiking neural networks (SNNs) with adaptive synapses reflect core properties of biological neural networks. Speech recognition, as an application involving audio coding and dynamic learning, provides a good test problem to study SNN functionality. We present a simple …
- 230000001537 neural 0 abstract description 41
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G06N3/049—Temporal neural nets, e.g. delay elements, oscillating neurons, pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Tavanaei et al. | A spiking network that learns to extract spike signatures from speech signals | |
| Wu et al. | A spiking neural network framework for robust sound classification | |
| Wysoski et al. | Evolving spiking neural networks for audiovisual information processing | |
| Zhang et al. | A digital liquid state machine with biologically inspired learning and its application to speech recognition | |
| Wu et al. | A biologically plausible speech recognition framework based on spiking neural networks | |
| Verma et al. | Modified convolutional neural network architecture analysis for facial emotion recognition | |
| Mini et al. | EEG based direct speech BCI system using a fusion of SMRT and MFCC/LPCC features with ANN classifier | |
| Goodman et al. | Spatiotemporal pattern recognition via liquid state machines | |
| Amiriparian et al. | Bag-of-deep-features: Noise-robust deep feature representations for audio analysis | |
| Jin et al. | AP-STDP: A novel self-organizing mechanism for efficient reservoir computing | |
| Xiao et al. | Spike-based encoding and learning of spectrum features for robust sound recognition | |
| Chrol-Cannon et al. | Learning structure of sensory inputs with synaptic plasticity leads to interference | |
| Vlasov et al. | Spoken digits classification based on spiking neural networks with memristor-based STDP | |
| Singh | Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals | |
| Du et al. | Speech emotion recognition based on spiking neural network and convolutional neural network | |
| Xu et al. | Event-driven spectrotemporal feature extraction and classification using a silicon cochlea model | |
| CN120015063B (en) | Speech emotion recognition method based on spiking neural network and convolutional neural network | |
| Martínez et al. | Bioinspired sparse spectro-temporal representation of speech for robust classification | |
| Shashanka | Latent variable framework for modeling and separating single-channel acoustic sources | |
| Ghani et al. | Neuro-inspired speech recognition based on reservoir computing | |
| Sharan | Speech emotion recognition using gammatone cepstral coefficients and deep learning features | |
| Muscar et al. | Deep Learning-Based Sound Classification Algorithms for Enhanced Service Robots Audio Capabilities | |
| Uysal et al. | Spike-based feature extraction for noise robust speech recognition using phase synchrony coding | |
| Prawira et al. | Emotion classification using fast fourier transform and recurrent neural networks | |
| Rohan et al. | Emotion recognition through speech signal using python |