[go: up one dir, main page]

WO2008126254A1 - 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム - Google Patents

話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム Download PDF

Info

Publication number
WO2008126254A1
WO2008126254A1 PCT/JP2007/057113 JP2007057113W WO2008126254A1 WO 2008126254 A1 WO2008126254 A1 WO 2008126254A1 JP 2007057113 W JP2007057113 W JP 2007057113W WO 2008126254 A1 WO2008126254 A1 WO 2008126254A1
Authority
WO
WIPO (PCT)
Prior art keywords
speaker
adaptive
model
acoustic model
model update
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2007/057113
Other languages
English (en)
French (fr)
Inventor
Soichi Toyama
Ikuo Fujita
Yukio Kamoshida
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pioneer Corp
Original Assignee
Pioneer Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pioneer Corp filed Critical Pioneer Corp
Priority to PCT/JP2007/057113 priority Critical patent/WO2008126254A1/ja
Priority to JP2009508804A priority patent/JP4847581B2/ja
Publication of WO2008126254A1 publication Critical patent/WO2008126254A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

 時間の経過とともに変化していく話者本人の発話音声の特徴に対応して、精度良く話者を認識することができる話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラムを提供する。  発話した話者が当該適応話者モデルに対応する登録話者であると判定された場合には、適応話者モデルを更新する。このとき、算出された音声特徴量を適応音声特徴量記憶部11に記憶させ、適応音声特徴量記憶部11に記憶された音声特徴量のうち、現時点から過去に遡ってK個の音声特徴量で初期話者モデルを適応処理を行うことによって、新たな適応話者モデルを作成し、この新たな適応話者モデルを登録話者モデル記憶部9に記憶させ、登録話者モデル記憶部9に記憶された新たな適応話者モデルを用いて、発話した話者が当該適応話者モデルに対応する登録話者であるか否かを判定する。
PCT/JP2007/057113 2007-03-30 2007-03-30 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム Ceased WO2008126254A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/JP2007/057113 WO2008126254A1 (ja) 2007-03-30 2007-03-30 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム
JP2009508804A JP4847581B2 (ja) 2007-03-30 2007-03-30 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2007/057113 WO2008126254A1 (ja) 2007-03-30 2007-03-30 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム

Publications (1)

Publication Number Publication Date
WO2008126254A1 true WO2008126254A1 (ja) 2008-10-23

Family

ID=39863434

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2007/057113 Ceased WO2008126254A1 (ja) 2007-03-30 2007-03-30 話者認識装置、音響モデル更新方法及び音響モデル更新処理プログラム

Country Status (2)

Country Link
JP (1) JP4847581B2 (ja)
WO (1) WO2008126254A1 (ja)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160055839A (ko) * 2013-09-16 2016-05-18 퀄컴 인코포레이티드 애플리케이션들에 대한 액세스를 제어하기 위한 방법 및 장치
CN109155128A (zh) * 2016-05-20 2019-01-04 三菱电机株式会社 声学模型学习装置、声学模型学习方法、语音识别装置和语音识别方法
CN114387635A (zh) * 2020-10-20 2022-04-22 杭州海康威视数字技术股份有限公司 更新生物特征库的方法、装置及电子设备
EP4082007A4 (en) * 2020-06-15 2023-02-01 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109147770B (zh) 2017-06-16 2023-07-28 阿里巴巴集团控股有限公司 声音识别特征的优化、动态注册方法、客户端和服务器

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001063596A2 (en) * 2000-02-25 2001-08-30 Speechworks International, Inc. Automatically retraining a speech recognition system
JP2001249681A (ja) * 1999-12-28 2001-09-14 Sony Corp モデル適応装置およびモデル適応方法、記録媒体、並びにパターン認識装置
JP2002196786A (ja) * 2000-12-26 2002-07-12 Mitsubishi Electric Corp 音声認識装置
JP2003076390A (ja) * 2001-08-31 2003-03-14 Fujitsu Ltd 話者認証システム及び方法
JP2007057714A (ja) * 2005-08-23 2007-03-08 Nec Corp 話者識別器更新データを生成する装置、方法、プログラムおよび話者識別器を更新する装置、方法、プログラム

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001249681A (ja) * 1999-12-28 2001-09-14 Sony Corp モデル適応装置およびモデル適応方法、記録媒体、並びにパターン認識装置
WO2001063596A2 (en) * 2000-02-25 2001-08-30 Speechworks International, Inc. Automatically retraining a speech recognition system
JP2002196786A (ja) * 2000-12-26 2002-07-12 Mitsubishi Electric Corp 音声認識装置
JP2003076390A (ja) * 2001-08-31 2003-03-14 Fujitsu Ltd 話者認証システム及び方法
JP2007057714A (ja) * 2005-08-23 2007-03-08 Nec Corp 話者識別器更新データを生成する装置、方法、プログラムおよび話者識別器を更新する装置、方法、プログラム

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160055839A (ko) * 2013-09-16 2016-05-18 퀄컴 인코포레이티드 애플리케이션들에 대한 액세스를 제어하기 위한 방법 및 장치
JP2016538658A (ja) * 2013-09-16 2016-12-08 クゥアルコム・インコーポレイテッドQualcomm Incorporated アプリケーションへのアクセスを制御するための方法および装置
KR101868711B1 (ko) * 2013-09-16 2018-06-18 퀄컴 인코포레이티드 애플리케이션들에 대한 액세스를 제어하기 위한 방법 및 장치
CN109155128A (zh) * 2016-05-20 2019-01-04 三菱电机株式会社 声学模型学习装置、声学模型学习方法、语音识别装置和语音识别方法
CN109155128B (zh) * 2016-05-20 2022-12-27 三菱电机株式会社 声学模型学习装置、声学模型学习方法、语音识别装置和语音识别方法
EP4082007A4 (en) * 2020-06-15 2023-02-01 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
US11664033B2 (en) 2020-06-15 2023-05-30 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
CN114387635A (zh) * 2020-10-20 2022-04-22 杭州海康威视数字技术股份有限公司 更新生物特征库的方法、装置及电子设备
WO2022083653A1 (zh) * 2020-10-20 2022-04-28 杭州海康威视数字技术股份有限公司 更新生物特征库的方法、装置及电子设备

Also Published As

Publication number Publication date
JPWO2008126254A1 (ja) 2010-07-22
JP4847581B2 (ja) 2011-12-28

Similar Documents

Publication Publication Date Title
WO2008117626A1 (ja) 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム
WO2020117639A3 (en) Text independent speaker recognition
WO2008108232A1 (ja) 音声認識装置、音声認識方法及び音声認識プログラム
WO2008118195A3 (en) System and method for a cooperative conversational voice user interface
WO2006069381A3 (en) Turn-taking confidence
WO2008047339A3 (en) Method and apparatus for large population speaker identification in telephone interactions
WO2012177646A3 (en) Speech recognition using context-aware recognition models
WO2013066409A8 (en) System, method and program for customized voice communication
WO2008114448A1 (ja) 音声認識システム、音声認識プログラムおよび音声認識方法
ATE536611T1 (de) Kommunikationsgerät mit lautsprecherunabhängiger spracherkennung
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
EP1933301A3 (en) Speech recognition method and system with intelligent speaker identification and adaptation
WO2012036424A3 (en) Method and apparatus for performing microphone beamforming
WO2004100638A3 (en) Source-dependent text-to-speech system
ATE453183T1 (de) Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
WO2011084998A3 (en) Word-level correction of speech input
JP2009527798A5 (ja)
WO2012134997A3 (en) Non-scorable response filters for speech scoring systems
EP2211561A3 (en) Speech signal processing apparatus with microphone signal selection
WO2012064408A3 (en) Method for tone/intonation recognition using auditory attention cues
WO2012134877A3 (en) Computer-implemented systems and methods evaluating prosodic features of speech
EP2590424A3 (en) Electronic apparatus and method for controlling thereof
EP1696421A3 (en) Learning in automatic speech recognition
EP1475777A3 (en) Keyword recognition apparatus and method, program for keyword recognition, including keyword and non-keyword model adaptation
EP4318463A3 (en) Multi-modal input on an electronic device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07740549

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
ENP Entry into the national phase

Ref document number: 2009508804

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07740549

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)