[go: up one dir, main page]

WO2003048711A3 - System de detection de parole dans un signal audio en environnement bruite - Google Patents

System de detection de parole dans un signal audio en environnement bruite Download PDF

Info

Publication number
WO2003048711A3
WO2003048711A3 PCT/FR2002/003910 FR0203910W WO03048711A3 WO 2003048711 A3 WO2003048711 A3 WO 2003048711A3 FR 0203910 W FR0203910 W FR 0203910W WO 03048711 A3 WO03048711 A3 WO 03048711A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
speech detection
detection system
noisy surrounding
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/FR2002/003910
Other languages
English (en)
Other versions
WO2003048711A2 (fr
Inventor
Arnaud Martin
Laurent Mauuary
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Priority to AU2002352339A priority Critical patent/AU2002352339A1/en
Priority to US10/497,874 priority patent/US7359856B2/en
Priority to EP02788059A priority patent/EP1451548A2/fr
Publication of WO2003048711A2 publication Critical patent/WO2003048711A2/fr
Publication of WO2003048711A3 publication Critical patent/WO2003048711A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

Un procédé de détection de parole dans un signal audio comporte une étape d'obtention d'une information d'énergie du signal audio, cette information d'énergie étant utilisée pour détecter de la parole dans le signal audio. Selon l'invention ce procédé comporte en outre une étape d'obtention d'une information de voisement du signal audio, cette information de voisement étant utilisée conjointement à l'information d'énergie pour la détection de parole dans le signal audio.
PCT/FR2002/003910 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite Ceased WO2003048711A2 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU2002352339A AU2002352339A1 (en) 2001-12-05 2002-11-15 Speech detection system in an audio signal in noisy surrounding
US10/497,874 US7359856B2 (en) 2001-12-05 2002-11-15 Speech detection system in an audio signal in noisy surrounding
EP02788059A EP1451548A2 (fr) 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR01/15685 2001-12-05
FR0115685A FR2833103B1 (fr) 2001-12-05 2001-12-05 Systeme de detection de parole dans le bruit

Publications (2)

Publication Number Publication Date
WO2003048711A2 WO2003048711A2 (fr) 2003-06-12
WO2003048711A3 true WO2003048711A3 (fr) 2004-02-12

Family

ID=8870113

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2002/003910 Ceased WO2003048711A2 (fr) 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite

Country Status (5)

Country Link
US (1) US7359856B2 (fr)
EP (1) EP1451548A2 (fr)
AU (1) AU2002352339A1 (fr)
FR (1) FR2833103B1 (fr)
WO (1) WO2003048711A2 (fr)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2856506B1 (fr) * 2003-06-23 2005-12-02 France Telecom Procede et dispositif de detection de parole dans un signal audio
FR2864319A1 (fr) * 2005-01-19 2005-06-24 France Telecom Procede et dispositif de detection de parole dans un signal audio
CN1815550A (zh) * 2005-02-01 2006-08-09 松下电器产业株式会社 可识别环境中的语音与非语音的方法及系统
US8175877B2 (en) * 2005-02-02 2012-05-08 At&T Intellectual Property Ii, L.P. Method and apparatus for predicting word accuracy in automatic speech recognition systems
GB2450886B (en) * 2007-07-10 2009-12-16 Motorola Inc Voice activity detector and a method of operation
KR100930039B1 (ko) * 2007-12-18 2009-12-07 한국전자통신연구원 음성 인식기의 성능 평가 장치 및 그 방법
US8380497B2 (en) * 2008-10-15 2013-02-19 Qualcomm Incorporated Methods and apparatus for noise estimation
WO2010070839A1 (fr) * 2008-12-17 2010-06-24 日本電気株式会社 Dispositif et programme de détection sonore et procédé de réglage de paramètre
CA2778342C (fr) * 2009-10-19 2017-08-22 Martin Sehlstedt Procede et estimateur de fond pour detection d'activite vocale
US9165567B2 (en) * 2010-04-22 2015-10-20 Qualcomm Incorporated Systems, methods, and apparatus for speech feature detection
CN102237081B (zh) * 2010-04-30 2013-04-24 国际商业机器公司 语音韵律评估方法与系统
US8898058B2 (en) 2010-10-25 2014-11-25 Qualcomm Incorporated Systems, methods, and apparatus for voice activity detection
JP5747562B2 (ja) * 2010-10-28 2015-07-15 ヤマハ株式会社 音響処理装置
US20150281853A1 (en) * 2011-07-11 2015-10-01 SoundFest, Inc. Systems and methods for enhancing targeted audibility
KR20140147587A (ko) * 2013-06-20 2014-12-30 한국전자통신연구원 Wfst를 이용한 음성 끝점 검출 장치 및 방법
US9905225B2 (en) * 2013-12-26 2018-02-27 Panasonic Intellectual Property Management Co., Ltd. Voice recognition processing device, voice recognition processing method, and display device
CA2956531C (fr) * 2014-07-29 2020-03-24 Telefonaktiebolaget Lm Ericsson (Publ) Estimation d'un bruit de fond dans des signaux audio
CN111739515B (zh) * 2019-09-18 2023-08-04 北京京东尚科信息技术有限公司 语音识别方法、设备、电子设备和服务器、相关系统
KR20210089347A (ko) * 2020-01-08 2021-07-16 엘지전자 주식회사 음성 인식 장치 및 음성데이터를 학습하는 방법
CN111599377B (zh) * 2020-04-03 2023-03-31 厦门快商通科技股份有限公司 基于音频识别的设备状态检测方法、系统及移动终端
CN111554314B (zh) * 2020-05-15 2024-08-16 腾讯科技(深圳)有限公司 噪声检测方法、装置、终端及存储介质
CN116295799A (zh) * 2021-12-20 2023-06-23 武汉市聚芯微电子有限责任公司 用于检测信号突变的方法和装置及电子设备
CN115602152B (zh) * 2022-12-14 2023-02-28 成都启英泰伦科技有限公司 一种基于多阶段注意力网络的语音增强方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4696039A (en) * 1983-10-13 1987-09-22 Texas Instruments Incorporated Speech analysis/synthesis system with silence suppression
US5579431A (en) * 1992-10-05 1996-11-26 Panasonic Technologies, Inc. Speech detection in presence of noise by determining variance over time of frequency band limited energy
US5598466A (en) * 1995-08-28 1997-01-28 Intel Corporation Voice activity detector for half-duplex audio communication system
JPH0990974A (ja) * 1995-09-25 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> 信号処理方法
US5819217A (en) * 1995-12-21 1998-10-06 Nynex Science & Technology, Inc. Method and system for differentiating between speech and noise
US5890109A (en) * 1996-03-28 1999-03-30 Intel Corporation Re-initializing adaptive parameters for encoding audio signals
US6023674A (en) * 1998-01-23 2000-02-08 Telefonaktiebolaget L M Ericsson Non-parametric voice activity detection
US6122531A (en) * 1998-07-31 2000-09-19 Motorola, Inc. Method for selectively including leading fricative sounds in a portable communication device operated in a speakerphone mode
US6327564B1 (en) * 1999-03-05 2001-12-04 Matsushita Electric Corporation Of America Speech detection using stochastic confidence measures on the frequency spectrum
US6775649B1 (en) * 1999-09-01 2004-08-10 Texas Instruments Incorporated Concealment of frame erasures for speech transmission and storage system and method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
MARTIN A ET AL: "Robust speech/non-speech detection using LDA applied to MFCC", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.01CH37221), 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS, SALT LAKE CITY, UT, USA, 7-11 MAY 2001, 2001, Piscataway, NJ, USA, IEEE, USA, pages 237 - 240 vol.1, XP002245514, ISBN: 0-7803-7041-4 *
MARTIN P: "COMPARISON OF PITCH DETECTION BY CEPSTRUM AND SPECTRAL COMB ANALYSIS", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH & SIGNAL PROCESSING. ICASSP. PARIS, MAY 3 - 5, 1982, NEW YORK, IEEE, US, vol. 1 CONF. 7, 3 May 1982 (1982-05-03), pages 180 - 183, XP002906644 *
MORENO-BILBAO A ET AL: "PITCH DETECTOR IN SPEECH SIGNALS CORRUPTED BY NOISE", SIGNAL PROCESSING THEORIES AND APPLICATIONS. BARCELONA, SEPT. 18 - 21, 1990, PROCEEDINGS OF THE EUROPEAN SIGNAL PROCESSING CONFERENCE, AMSTERDAM, ELSEVIER, NL, vol. 2 CONF. 5, 18 September 1990 (1990-09-18), pages 1163 - 1166, XP000365761 *
RAMANA RAO G V ET AL: "Word boundary detection using pitch variations", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 813 - 816 vol.2, XP002245515, ISBN: 0-7803-3555-4 *
See also references of EP1451548A2 *

Also Published As

Publication number Publication date
EP1451548A2 (fr) 2004-09-01
US7359856B2 (en) 2008-04-15
US20050143978A1 (en) 2005-06-30
AU2002352339A1 (en) 2003-06-17
WO2003048711A2 (fr) 2003-06-12
AU2002352339A8 (en) 2003-06-17
FR2833103B1 (fr) 2004-07-09
FR2833103A1 (fr) 2003-06-06

Similar Documents

Publication Publication Date Title
WO2003048711A3 (fr) System de detection de parole dans un signal audio en environnement bruite
AU7339000A (en) A system, method, and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters
AU2001245272A1 (en) System and method for referencing object instances and invoking methods on thoseobject instances from within speech recognition grammar
WO2001020965A3 (fr) Procede de determination d&#39;une situation d&#39;environnement acoustique momentanee, utilisation de ce procede, et prothese auditive
WO2003015464A3 (fr) Traitement de signaux audio directionnel par banc de filtres surechantillonnes
WO1998014116A3 (fr) Systeme de phonopneumographie
ATE441175T1 (de) Verteiltes spracherkennungsverfahren
AU2003225928A1 (en) Method for robust voice recognition by analyzing redundant features of source signal
AU2001284588A1 (en) Multi-channel signal encoding and decoding
WO2003038804A3 (fr) Detection d&#39;intervention non voulue
WO2005081686A3 (fr) Systeme sonar et procede associe
WO2002052542A3 (fr) Procede et dispositif d&#39;analyse d&#39;un signal sonore issu d&#39;une source sonore
AU7750700A (en) Method and apparatus for the provision of information signals based upon speech recognition
AU2003280474A1 (en) Multi-phoneme streamer and knowledge representation speech recognition system and method
AU2002322102A1 (en) Systems and methods for sensing an acoustic signal using microelectromechanical systems technology
EP1647972A3 (fr) Amélioration de l&#39;intelligibilité des signaux audio contenant de la voix
ATE381237T1 (de) Verfahren zum betrieb eines hörgerätes sowie hörgerät
AU2002232795A1 (en) Perceptual audio signal compression system and method
WO2002007481A3 (fr) Convertisseur stereo multicanaux de derivation d&#39;un signal centrale stereo d&#39;ambiophonie et/ou audio
WO1998001956A3 (fr) Systeme servant a supprimer le bruit d&#39;un micro
AU1888100A (en) System and method for relatively noise robust speech recognition
AU2003269418A1 (en) Method for operating a speech recognition system
AU2002364174A1 (en) System and method for speech recognition and transcription
EP1220203A3 (fr) Procédé et dispositif de détermination de facteurs d&#39;échelles pour un codeur de signal audio
EP1722598A3 (fr) Dispositif audio pour la production de son à effet spatial

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REEP Request for entry into the european phase

Ref document number: 2002788059

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2002788059

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002788059

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10497874

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Ref document number: JP