[go: up one dir, main page]

WO2009069662A1 - 音声検出システム、音声検出方法および音声検出プログラム - Google Patents

音声検出システム、音声検出方法および音声検出プログラム Download PDF

Info

Publication number
WO2009069662A1
WO2009069662A1 PCT/JP2008/071459 JP2008071459W WO2009069662A1 WO 2009069662 A1 WO2009069662 A1 WO 2009069662A1 JP 2008071459 W JP2008071459 W JP 2008071459W WO 2009069662 A1 WO2009069662 A1 WO 2009069662A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
section
nonvoice
feature value
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2008/071459
Other languages
English (en)
French (fr)
Inventor
Takayuki Arakawa
Masanori Tsujikawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2009543830A priority Critical patent/JP5446874B2/ja
Priority to US12/744,671 priority patent/US8694308B2/en
Publication of WO2009069662A1 publication Critical patent/WO2009069662A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

本発明は 雑音環境に頑健な音声検出システム、音声検出プログラムを提供する。 フレーム単位に切り出された入力信号から特徴量を算出する特徴量算出部2と、フレーム単位に算出された特徴量から音声区間・非音声区間を仮判定する仮音声・非音声判定部3と、音声区間継続長閾値あるいは非音声区間継続長閾値を、フレーム毎に求められた特徴量と、特徴量に関する閾値との比を用いて決定し、前記決定された音声区間継続長閾値と非音声区間継続長閾値を用いて音声区間・非音声区間を再判定する音声・非音声判定部6を備え、音声区間継続長閾値と非音声区間継続長閾値をフレーム毎に求められる特徴量と特徴量に対する閾値とを用いて決定することで、フレーム毎に求まる特徴量に信頼の置けるときには整形ルールの縛りを弱くし、逆にフレーム毎に求まる特徴量に信頼の置けないときには整形ルールの縛りを強くすることで、雑音環境に依存せずに音声検出を行う。
PCT/JP2008/071459 2007-11-27 2008-11-26 音声検出システム、音声検出方法および音声検出プログラム Ceased WO2009069662A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2009543830A JP5446874B2 (ja) 2007-11-27 2008-11-26 音声検出システム、音声検出方法および音声検出プログラム
US12/744,671 US8694308B2 (en) 2007-11-27 2008-11-26 System, method and program for voice detection

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007-305966 2007-11-27
JP2007305966 2007-11-27

Publications (1)

Publication Number Publication Date
WO2009069662A1 true WO2009069662A1 (ja) 2009-06-04

Family

ID=40678555

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/071459 Ceased WO2009069662A1 (ja) 2007-11-27 2008-11-26 音声検出システム、音声検出方法および音声検出プログラム

Country Status (3)

Country Link
US (1) US8694308B2 (ja)
JP (1) JP5446874B2 (ja)
WO (1) WO2009069662A1 (ja)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011070972A1 (ja) * 2009-12-10 2011-06-16 日本電気株式会社 音声認識システム、音声認識方法および音声認識プログラム
JP2013508744A (ja) * 2009-10-19 2013-03-07 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声区間検出器及び方法
JP2013545133A (ja) * 2010-10-29 2013-12-19 安徽科大訊飛信息科技股▲分▼有限公司 録音の終了点自動検出のための方法及びシステム
JP2018045193A (ja) * 2016-09-16 2018-03-22 株式会社リコー 通信端末、音声変換方法、及びプログラム

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456343A (zh) * 2010-10-29 2012-05-16 安徽科大讯飞信息科技股份有限公司 录音结束点检测方法及系统
TWI474317B (zh) * 2012-07-06 2015-02-21 Realtek Semiconductor Corp 訊號處理裝置以及訊號處理方法
KR102446392B1 (ko) * 2015-09-23 2022-09-23 삼성전자주식회사 음성 인식이 가능한 전자 장치 및 방법
CN114360587A (zh) * 2021-12-27 2022-04-15 北京百度网讯科技有限公司 识别音频的方法、装置、设备、介质及产品
US20230402057A1 (en) * 2022-06-14 2023-12-14 Himax Technologies Limited Voice activity detection system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10207491A (ja) * 1997-01-23 1998-08-07 Toshiba Corp 背景音/音声分類方法、有声/無声分類方法および背景音復号方法
WO2001039175A1 (en) * 1999-11-24 2001-05-31 Fujitsu Limited Method and apparatus for voice detection
JP2008151840A (ja) * 2006-12-14 2008-07-03 Nippon Telegr & Teleph Corp <Ntt> 仮音声区間決定装置、方法、プログラム及びその記録媒体、音声区間決定装置

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3349180A (en) * 1964-05-07 1967-10-24 Bell Telephone Labor Inc Extrapolation of vocoder control signals
US3420955A (en) * 1965-11-19 1969-01-07 Bell Telephone Labor Inc Automatic peak selector
US3916105A (en) * 1972-12-04 1975-10-28 Ibm Pitch peak detection using linear prediction
ATE15563T1 (de) * 1981-09-24 1985-09-15 Gretag Ag Verfahren und vorrichtung zur redundanzvermindernden digitalen sprachverarbeitung.
US4509186A (en) * 1981-12-31 1985-04-02 Matsushita Electric Works, Ltd. Method and apparatus for speech message recognition
IT1229725B (it) * 1989-05-15 1991-09-07 Face Standard Ind Metodo e disposizione strutturale per la differenziazione tra elementi sonori e sordi del parlato
JP3277398B2 (ja) * 1992-04-15 2002-04-22 ソニー株式会社 有声音判別方法
EP1569200A1 (en) * 2004-02-26 2005-08-31 Sony International (Europe) GmbH Identification of the presence of speech in digital audio data
JP4798601B2 (ja) 2004-12-28 2011-10-19 株式会社国際電気通信基礎技術研究所 音声区間検出装置および音声区間検出プログラム
CN101292283B (zh) * 2005-10-20 2012-08-08 日本电气株式会社 声音判别系统及声音判别方法
JP4714129B2 (ja) 2006-11-29 2011-06-29 日本電信電話株式会社 音声/非音声判定補正装置、音声/非音声判定補正方法、音声/非音声判定補正プログラムおよびこれを記録した記録媒体、音声ミキシング装置、音声ミキシング方法、音声ミキシングプログラムおよびこれを記録した記録媒体

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10207491A (ja) * 1997-01-23 1998-08-07 Toshiba Corp 背景音/音声分類方法、有声/無声分類方法および背景音復号方法
WO2001039175A1 (en) * 1999-11-24 2001-05-31 Fujitsu Limited Method and apparatus for voice detection
JP2008151840A (ja) * 2006-12-14 2008-07-03 Nippon Telegr & Teleph Corp <Ntt> 仮音声区間決定装置、方法、プログラム及びその記録媒体、音声区間決定装置

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013508744A (ja) * 2009-10-19 2013-03-07 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声区間検出器及び方法
US9773511B2 (en) 2009-10-19 2017-09-26 Telefonaktiebolaget Lm Ericsson (Publ) Detector and method for voice activity detection
US9990938B2 (en) 2009-10-19 2018-06-05 Telefonaktiebolaget Lm Ericsson (Publ) Detector and method for voice activity detection
US11361784B2 (en) 2009-10-19 2022-06-14 Telefonaktiebolaget Lm Ericsson (Publ) Detector and method for voice activity detection
WO2011070972A1 (ja) * 2009-12-10 2011-06-16 日本電気株式会社 音声認識システム、音声認識方法および音声認識プログラム
JPWO2011070972A1 (ja) * 2009-12-10 2013-04-22 日本電気株式会社 音声認識システム、音声認識方法および音声認識プログラム
US9002709B2 (en) 2009-12-10 2015-04-07 Nec Corporation Voice recognition system and voice recognition method
JP2013545133A (ja) * 2010-10-29 2013-12-19 安徽科大訊飛信息科技股▲分▼有限公司 録音の終了点自動検出のための方法及びシステム
US9330667B2 (en) 2010-10-29 2016-05-03 Iflytek Co., Ltd. Method and system for endpoint automatic detection of audio record
JP2018045193A (ja) * 2016-09-16 2018-03-22 株式会社リコー 通信端末、音声変換方法、及びプログラム

Also Published As

Publication number Publication date
US20100268532A1 (en) 2010-10-21
JPWO2009069662A1 (ja) 2011-04-14
US8694308B2 (en) 2014-04-08
JP5446874B2 (ja) 2014-03-19

Similar Documents

Publication Publication Date Title
WO2009069662A1 (ja) 音声検出システム、音声検出方法および音声検出プログラム
EP4379711A3 (en) Method and apparatus for adaptively detecting a voice activity in an input audio signal
WO2006019556A3 (en) Low-complexity music detection algorithm and system
CA2699316A1 (en) Apparatus and method for calculating bandwidth extension data using a spectral tilt controlled framing
KR101437830B1 (ko) 음성 구간 검출 방법 및 장치
WO2002056297A8 (en) Adaptive-block-length audio coder
IL154397A0 (en) Voice enhancement system
IL194430A0 (en) Audio gain control using specific-loudness-based auditory event detection
JP3255584B2 (ja) 有音検知装置および方法
WO2008143226A1 (ja) コネクタ嵌合状態判定装置、コネクタ嵌合状態判定システム及びコネクタ嵌合状態判定方法
EP1256487A3 (en) System, method, and program for detecting approach to object
WO2009142453A3 (ko) 복수의 접촉 입력을 감지하는 방법 및 장치
WO2006104576A3 (en) Adaptive voice mode extension for a voice activity detector
WO2008082793A3 (en) A method and noise suppression circuit incorporating a plurality of noise suppression techniques
WO2006121180A3 (en) Voice activity detection apparatus and method
AU2002367237A1 (en) Method, apparatus, and program for evolving algorithms for detecting
WO2007070622A3 (en) Detecting and rejecting annoying documents
WO2008091874A3 (en) Method and device for acute sound detection and reproduction
WO2008149559A1 (ja) 脈波検出装置、機器制御装置および脈波検出方法
WO2009144655A8 (en) Method and system for determining a threshold for spike detection of electrophysiological signals
ATE513280T1 (de) Flash-erkennung
WO2010062845A3 (en) System and method for detecting low tire pressure on a machine
WO2010104995A3 (en) Noise error amplitude reduction
CN104464722A (zh) 基于时域和频域的语音活性检测方法和设备
WO2008150762A3 (en) Method and apparatus for real-time pulse parameter estimation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08855299

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12744671

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2009543830

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08855299

Country of ref document: EP

Kind code of ref document: A1