Tavanaei et al., 2017 - Google Patents

A spiking network that learns to extract spike signatures from speech signals

Tavanaei et al., 2017

Document ID: 14789101822045010460
Author: Tavanaei A; Maida A
Publication year: 2017
Publication venue: Neurocomputing

External Links

Cited by

Snippet

Spiking neural networks (SNNs) with adaptive synapses reflect core properties of biological neural networks. Speech recognition, as an application involving audio coding and dynamic learning, provides a good test problem to study SNN functionality. We present a simple …

Continue reading at arxiv.org (PDF) (other versions)

230000001537 neural 0 abstract description 41

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G06N3/049—Temporal neural nets, e.g. delay elements, oscillating neurons, pulsed inputs
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search

Similar Documents

Publication	Publication Date	Title
Tavanaei et al.	2017	A spiking network that learns to extract spike signatures from speech signals
Wu et al.	2018	A spiking neural network framework for robust sound classification
Wysoski et al.	2010	Evolving spiking neural networks for audiovisual information processing
Zhang et al.	2015	A digital liquid state machine with biologically inspired learning and its application to speech recognition
Wu et al.	2018	A biologically plausible speech recognition framework based on spiking neural networks
Verma et al.	2019	Modified convolutional neural network architecture analysis for facial emotion recognition
Mini et al.	2021	EEG based direct speech BCI system using a fusion of SMRT and MFCC/LPCC features with ANN classifier
Goodman et al.	2006	Spatiotemporal pattern recognition via liquid state machines
Amiriparian et al.	2018	Bag-of-deep-features: Noise-robust deep feature representations for audio analysis
Jin et al.	2016	AP-STDP: A novel self-organizing mechanism for efficient reservoir computing
Xiao et al.	2018	Spike-based encoding and learning of spectrum features for robust sound recognition
Chrol-Cannon et al.	2015	Learning structure of sensory inputs with synaptic plasticity leads to interference
Vlasov et al.	2022	Spoken digits classification based on spiking neural networks with memristor-based STDP
Singh	2022	Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals
Du et al.	2025	Speech emotion recognition based on spiking neural network and convolutional neural network
Xu et al.	2023	Event-driven spectrotemporal feature extraction and classification using a silicon cochlea model
CN120015063B (en)	2025-10-17	Speech emotion recognition method based on spiking neural network and convolutional neural network
Martínez et al.	2012	Bioinspired sparse spectro-temporal representation of speech for robust classification
Shashanka	2007	Latent variable framework for modeling and separating single-channel acoustic sources
Ghani et al.	2010	Neuro-inspired speech recognition based on reservoir computing
Sharan	2023	Speech emotion recognition using gammatone cepstral coefficients and deep learning features
Muscar et al.	2024	Deep Learning-Based Sound Classification Algorithms for Enhanced Service Robots Audio Capabilities
Uysal et al.	2007	Spike-based feature extraction for noise robust speech recognition using phase synchrony coding
Prawira et al.	2021	Emotion classification using fast fourier transform and recurrent neural networks
Rohan et al.	2020	Emotion recognition through speech signal using python