[go: up one dir, main page]

Tavanaei et al., 2017 - Google Patents

A spiking network that learns to extract spike signatures from speech signals

Tavanaei et al., 2017

View PDF
Document ID
14789101822045010460
Author
Tavanaei A
Maida A
Publication year
Publication venue
Neurocomputing

External Links

Snippet

Spiking neural networks (SNNs) with adaptive synapses reflect core properties of biological neural networks. Speech recognition, as an application involving audio coding and dynamic learning, provides a good test problem to study SNN functionality. We present a simple …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06K9/6232Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
    • G06K9/6247Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • G06N3/049Temporal neural nets, e.g. delay elements, oscillating neurons, pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search

Similar Documents

Publication Publication Date Title
Tavanaei et al. A spiking network that learns to extract spike signatures from speech signals
Wu et al. A spiking neural network framework for robust sound classification
Wysoski et al. Evolving spiking neural networks for audiovisual information processing
Zhang et al. A digital liquid state machine with biologically inspired learning and its application to speech recognition
Wu et al. A biologically plausible speech recognition framework based on spiking neural networks
Verma et al. Modified convolutional neural network architecture analysis for facial emotion recognition
Mini et al. EEG based direct speech BCI system using a fusion of SMRT and MFCC/LPCC features with ANN classifier
Goodman et al. Spatiotemporal pattern recognition via liquid state machines
Amiriparian et al. Bag-of-deep-features: Noise-robust deep feature representations for audio analysis
Jin et al. AP-STDP: A novel self-organizing mechanism for efficient reservoir computing
Xiao et al. Spike-based encoding and learning of spectrum features for robust sound recognition
Chrol-Cannon et al. Learning structure of sensory inputs with synaptic plasticity leads to interference
Vlasov et al. Spoken digits classification based on spiking neural networks with memristor-based STDP
Singh Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals
Du et al. Speech emotion recognition based on spiking neural network and convolutional neural network
Xu et al. Event-driven spectrotemporal feature extraction and classification using a silicon cochlea model
CN120015063B (en) Speech emotion recognition method based on spiking neural network and convolutional neural network
Martínez et al. Bioinspired sparse spectro-temporal representation of speech for robust classification
Shashanka Latent variable framework for modeling and separating single-channel acoustic sources
Ghani et al. Neuro-inspired speech recognition based on reservoir computing
Sharan Speech emotion recognition using gammatone cepstral coefficients and deep learning features
Muscar et al. Deep Learning-Based Sound Classification Algorithms for Enhanced Service Robots Audio Capabilities
Uysal et al. Spike-based feature extraction for noise robust speech recognition using phase synchrony coding
Prawira et al. Emotion classification using fast fourier transform and recurrent neural networks
Rohan et al. Emotion recognition through speech signal using python