WO2003048711A3 - System de detection de parole dans un signal audio en environnement bruite - Google Patents
System de detection de parole dans un signal audio en environnement bruite Download PDFInfo
- Publication number
- WO2003048711A3 WO2003048711A3 PCT/FR2002/003910 FR0203910W WO03048711A3 WO 2003048711 A3 WO2003048711 A3 WO 2003048711A3 FR 0203910 W FR0203910 W FR 0203910W WO 03048711 A3 WO03048711 A3 WO 03048711A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio signal
- speech detection
- detection system
- noisy surrounding
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Abstract
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2002352339A AU2002352339A1 (en) | 2001-12-05 | 2002-11-15 | Speech detection system in an audio signal in noisy surrounding |
| US10/497,874 US7359856B2 (en) | 2001-12-05 | 2002-11-15 | Speech detection system in an audio signal in noisy surrounding |
| EP02788059A EP1451548A2 (fr) | 2001-12-05 | 2002-11-15 | System de detection de parole dans un signal audio en environnement bruite |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR01/15685 | 2001-12-05 | ||
| FR0115685A FR2833103B1 (fr) | 2001-12-05 | 2001-12-05 | Systeme de detection de parole dans le bruit |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2003048711A2 WO2003048711A2 (fr) | 2003-06-12 |
| WO2003048711A3 true WO2003048711A3 (fr) | 2004-02-12 |
Family
ID=8870113
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/FR2002/003910 Ceased WO2003048711A2 (fr) | 2001-12-05 | 2002-11-15 | System de detection de parole dans un signal audio en environnement bruite |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US7359856B2 (fr) |
| EP (1) | EP1451548A2 (fr) |
| AU (1) | AU2002352339A1 (fr) |
| FR (1) | FR2833103B1 (fr) |
| WO (1) | WO2003048711A2 (fr) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FR2856506B1 (fr) * | 2003-06-23 | 2005-12-02 | France Telecom | Procede et dispositif de detection de parole dans un signal audio |
| FR2864319A1 (fr) * | 2005-01-19 | 2005-06-24 | France Telecom | Procede et dispositif de detection de parole dans un signal audio |
| CN1815550A (zh) * | 2005-02-01 | 2006-08-09 | 松下电器产业株式会社 | 可识别环境中的语音与非语音的方法及系统 |
| US8175877B2 (en) * | 2005-02-02 | 2012-05-08 | At&T Intellectual Property Ii, L.P. | Method and apparatus for predicting word accuracy in automatic speech recognition systems |
| GB2450886B (en) * | 2007-07-10 | 2009-12-16 | Motorola Inc | Voice activity detector and a method of operation |
| KR100930039B1 (ko) * | 2007-12-18 | 2009-12-07 | 한국전자통신연구원 | 음성 인식기의 성능 평가 장치 및 그 방법 |
| US8380497B2 (en) * | 2008-10-15 | 2013-02-19 | Qualcomm Incorporated | Methods and apparatus for noise estimation |
| WO2010070839A1 (fr) * | 2008-12-17 | 2010-06-24 | 日本電気株式会社 | Dispositif et programme de détection sonore et procédé de réglage de paramètre |
| CA2778342C (fr) * | 2009-10-19 | 2017-08-22 | Martin Sehlstedt | Procede et estimateur de fond pour detection d'activite vocale |
| US9165567B2 (en) * | 2010-04-22 | 2015-10-20 | Qualcomm Incorporated | Systems, methods, and apparatus for speech feature detection |
| CN102237081B (zh) * | 2010-04-30 | 2013-04-24 | 国际商业机器公司 | 语音韵律评估方法与系统 |
| US8898058B2 (en) | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
| JP5747562B2 (ja) * | 2010-10-28 | 2015-07-15 | ヤマハ株式会社 | 音響処理装置 |
| US20150281853A1 (en) * | 2011-07-11 | 2015-10-01 | SoundFest, Inc. | Systems and methods for enhancing targeted audibility |
| KR20140147587A (ko) * | 2013-06-20 | 2014-12-30 | 한국전자통신연구원 | Wfst를 이용한 음성 끝점 검출 장치 및 방법 |
| US9905225B2 (en) * | 2013-12-26 | 2018-02-27 | Panasonic Intellectual Property Management Co., Ltd. | Voice recognition processing device, voice recognition processing method, and display device |
| CA2956531C (fr) * | 2014-07-29 | 2020-03-24 | Telefonaktiebolaget Lm Ericsson (Publ) | Estimation d'un bruit de fond dans des signaux audio |
| CN111739515B (zh) * | 2019-09-18 | 2023-08-04 | 北京京东尚科信息技术有限公司 | 语音识别方法、设备、电子设备和服务器、相关系统 |
| KR20210089347A (ko) * | 2020-01-08 | 2021-07-16 | 엘지전자 주식회사 | 음성 인식 장치 및 음성데이터를 학습하는 방법 |
| CN111599377B (zh) * | 2020-04-03 | 2023-03-31 | 厦门快商通科技股份有限公司 | 基于音频识别的设备状态检测方法、系统及移动终端 |
| CN111554314B (zh) * | 2020-05-15 | 2024-08-16 | 腾讯科技(深圳)有限公司 | 噪声检测方法、装置、终端及存储介质 |
| CN116295799A (zh) * | 2021-12-20 | 2023-06-23 | 武汉市聚芯微电子有限责任公司 | 用于检测信号突变的方法和装置及电子设备 |
| CN115602152B (zh) * | 2022-12-14 | 2023-02-28 | 成都启英泰伦科技有限公司 | 一种基于多阶段注意力网络的语音增强方法 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4696039A (en) * | 1983-10-13 | 1987-09-22 | Texas Instruments Incorporated | Speech analysis/synthesis system with silence suppression |
| US5579431A (en) * | 1992-10-05 | 1996-11-26 | Panasonic Technologies, Inc. | Speech detection in presence of noise by determining variance over time of frequency band limited energy |
| US5598466A (en) * | 1995-08-28 | 1997-01-28 | Intel Corporation | Voice activity detector for half-duplex audio communication system |
| JPH0990974A (ja) * | 1995-09-25 | 1997-04-04 | Nippon Telegr & Teleph Corp <Ntt> | 信号処理方法 |
| US5819217A (en) * | 1995-12-21 | 1998-10-06 | Nynex Science & Technology, Inc. | Method and system for differentiating between speech and noise |
| US5890109A (en) * | 1996-03-28 | 1999-03-30 | Intel Corporation | Re-initializing adaptive parameters for encoding audio signals |
| US6023674A (en) * | 1998-01-23 | 2000-02-08 | Telefonaktiebolaget L M Ericsson | Non-parametric voice activity detection |
| US6122531A (en) * | 1998-07-31 | 2000-09-19 | Motorola, Inc. | Method for selectively including leading fricative sounds in a portable communication device operated in a speakerphone mode |
| US6327564B1 (en) * | 1999-03-05 | 2001-12-04 | Matsushita Electric Corporation Of America | Speech detection using stochastic confidence measures on the frequency spectrum |
| US6775649B1 (en) * | 1999-09-01 | 2004-08-10 | Texas Instruments Incorporated | Concealment of frame erasures for speech transmission and storage system and method |
-
2001
- 2001-12-05 FR FR0115685A patent/FR2833103B1/fr not_active Expired - Fee Related
-
2002
- 2002-11-15 AU AU2002352339A patent/AU2002352339A1/en not_active Abandoned
- 2002-11-15 WO PCT/FR2002/003910 patent/WO2003048711A2/fr not_active Ceased
- 2002-11-15 EP EP02788059A patent/EP1451548A2/fr not_active Withdrawn
- 2002-11-15 US US10/497,874 patent/US7359856B2/en not_active Expired - Fee Related
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
Non-Patent Citations (5)
| Title |
|---|
| MARTIN A ET AL: "Robust speech/non-speech detection using LDA applied to MFCC", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.01CH37221), 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS, SALT LAKE CITY, UT, USA, 7-11 MAY 2001, 2001, Piscataway, NJ, USA, IEEE, USA, pages 237 - 240 vol.1, XP002245514, ISBN: 0-7803-7041-4 * |
| MARTIN P: "COMPARISON OF PITCH DETECTION BY CEPSTRUM AND SPECTRAL COMB ANALYSIS", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH & SIGNAL PROCESSING. ICASSP. PARIS, MAY 3 - 5, 1982, NEW YORK, IEEE, US, vol. 1 CONF. 7, 3 May 1982 (1982-05-03), pages 180 - 183, XP002906644 * |
| MORENO-BILBAO A ET AL: "PITCH DETECTOR IN SPEECH SIGNALS CORRUPTED BY NOISE", SIGNAL PROCESSING THEORIES AND APPLICATIONS. BARCELONA, SEPT. 18 - 21, 1990, PROCEEDINGS OF THE EUROPEAN SIGNAL PROCESSING CONFERENCE, AMSTERDAM, ELSEVIER, NL, vol. 2 CONF. 5, 18 September 1990 (1990-09-18), pages 1163 - 1166, XP000365761 * |
| RAMANA RAO G V ET AL: "Word boundary detection using pitch variations", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 813 - 816 vol.2, XP002245515, ISBN: 0-7803-3555-4 * |
| See also references of EP1451548A2 * |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1451548A2 (fr) | 2004-09-01 |
| US7359856B2 (en) | 2008-04-15 |
| US20050143978A1 (en) | 2005-06-30 |
| AU2002352339A1 (en) | 2003-06-17 |
| WO2003048711A2 (fr) | 2003-06-12 |
| AU2002352339A8 (en) | 2003-06-17 |
| FR2833103B1 (fr) | 2004-07-09 |
| FR2833103A1 (fr) | 2003-06-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2003048711A3 (fr) | System de detection de parole dans un signal audio en environnement bruite | |
| AU7339000A (en) | A system, method, and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters | |
| AU2001245272A1 (en) | System and method for referencing object instances and invoking methods on thoseobject instances from within speech recognition grammar | |
| WO2001020965A3 (fr) | Procede de determination d'une situation d'environnement acoustique momentanee, utilisation de ce procede, et prothese auditive | |
| WO2003015464A3 (fr) | Traitement de signaux audio directionnel par banc de filtres surechantillonnes | |
| WO1998014116A3 (fr) | Systeme de phonopneumographie | |
| ATE441175T1 (de) | Verteiltes spracherkennungsverfahren | |
| AU2003225928A1 (en) | Method for robust voice recognition by analyzing redundant features of source signal | |
| AU2001284588A1 (en) | Multi-channel signal encoding and decoding | |
| WO2003038804A3 (fr) | Detection d'intervention non voulue | |
| WO2005081686A3 (fr) | Systeme sonar et procede associe | |
| WO2002052542A3 (fr) | Procede et dispositif d'analyse d'un signal sonore issu d'une source sonore | |
| AU7750700A (en) | Method and apparatus for the provision of information signals based upon speech recognition | |
| AU2003280474A1 (en) | Multi-phoneme streamer and knowledge representation speech recognition system and method | |
| AU2002322102A1 (en) | Systems and methods for sensing an acoustic signal using microelectromechanical systems technology | |
| EP1647972A3 (fr) | Amélioration de l'intelligibilité des signaux audio contenant de la voix | |
| ATE381237T1 (de) | Verfahren zum betrieb eines hörgerätes sowie hörgerät | |
| AU2002232795A1 (en) | Perceptual audio signal compression system and method | |
| WO2002007481A3 (fr) | Convertisseur stereo multicanaux de derivation d'un signal centrale stereo d'ambiophonie et/ou audio | |
| WO1998001956A3 (fr) | Systeme servant a supprimer le bruit d'un micro | |
| AU1888100A (en) | System and method for relatively noise robust speech recognition | |
| AU2003269418A1 (en) | Method for operating a speech recognition system | |
| AU2002364174A1 (en) | System and method for speech recognition and transcription | |
| EP1220203A3 (fr) | Procédé et dispositif de détermination de facteurs d'échelles pour un codeur de signal audio | |
| EP1722598A3 (fr) | Dispositif audio pour la production de son à effet spatial |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| REEP | Request for entry into the european phase |
Ref document number: 2002788059 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2002788059 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: 2002788059 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 10497874 Country of ref document: US |
|
| NENP | Non-entry into the national phase |
Ref country code: JP |
|
| WWW | Wipo information: withdrawn in national office |
Ref document number: JP |