[go: up one dir, main page]

WO2009057739A1 - 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム - Google Patents

話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム Download PDF

Info

Publication number
WO2009057739A1
WO2009057739A1 PCT/JP2008/069853 JP2008069853W WO2009057739A1 WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1 JP 2008069853 W JP2008069853 W JP 2008069853W WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1
Authority
WO
WIPO (PCT)
Prior art keywords
speaker
speaker selection
speakers
selection
model making
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2008/069853
Other languages
English (en)
French (fr)
Inventor
Masahiro Tani
Yoshifumi Onishi
Tadashi Emori
Takafumi Koshinaka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2009539120A priority Critical patent/JP5626558B2/ja
Publication of WO2009057739A1 publication Critical patent/WO2009057739A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

 適応モデルの精度劣化を抑制することのできる話者選択装置を提供する。話者選択装置は、入力された発声話者の音声信号より抽出された特徴量とあらかじめ記憶されている複数の話者の話者モデルを用いて、話者空間における発声話者を中心とする複数の話者の分布の密度を算出する話者分布密度算出手段と、話者の分布の密度を用いて選択する話者の数を算出する選択話者数算出手段とを備える。
PCT/JP2008/069853 2007-10-31 2008-10-31 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム Ceased WO2009057739A1 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2009539120A JP5626558B2 (ja) 2007-10-31 2008-10-31 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007-283767 2007-10-31
JP2007283767 2007-10-31

Publications (1)

Publication Number Publication Date
WO2009057739A1 true WO2009057739A1 (ja) 2009-05-07

Family

ID=40591119

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/069853 Ceased WO2009057739A1 (ja) 2007-10-31 2008-10-31 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム

Country Status (2)

Country Link
JP (1) JP5626558B2 (ja)
WO (1) WO2009057739A1 (ja)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117953851A (zh) * 2022-10-20 2024-04-30 戴尔产品有限公司 文本转语音的方法、设备和计算机程序产品

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143486A (ja) * 1997-11-10 1999-05-28 Fuji Xerox Co Ltd 話者適応装置および方法
JP2002149185A (ja) * 2000-09-27 2002-05-24 Koninkl Philips Electronics Nv 複数の学習用話者を表現する固有空間の決定方法
WO2005034086A1 (ja) * 2003-10-03 2005-04-14 Asahi Kasei Kabushiki Kaisha データ処理装置及びデータ処理装置制御プログラム
JP3756879B2 (ja) * 2001-12-20 2006-03-15 松下電器産業株式会社 音響モデルを作成する方法、音響モデルを作成する装置、音響モデルを作成するためのコンピュータプログラム
WO2008117626A1 (ja) * 2007-03-27 2008-10-02 Nec Corporation 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143486A (ja) * 1997-11-10 1999-05-28 Fuji Xerox Co Ltd 話者適応装置および方法
JP2002149185A (ja) * 2000-09-27 2002-05-24 Koninkl Philips Electronics Nv 複数の学習用話者を表現する固有空間の決定方法
JP3756879B2 (ja) * 2001-12-20 2006-03-15 松下電器産業株式会社 音響モデルを作成する方法、音響モデルを作成する装置、音響モデルを作成するためのコンピュータプログラム
WO2005034086A1 (ja) * 2003-10-03 2005-04-14 Asahi Kasei Kabushiki Kaisha データ処理装置及びデータ処理装置制御プログラム
WO2008117626A1 (ja) * 2007-03-27 2008-10-02 Nec Corporation 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム

Also Published As

Publication number Publication date
JPWO2009057739A1 (ja) 2011-03-10
JP5626558B2 (ja) 2014-11-19

Similar Documents

Publication Publication Date Title
WO2008117626A1 (ja) 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム
WO2008047339A3 (en) Method and apparatus for large population speaker identification in telephone interactions
MX338524B (es) Aparato y metodo para posicionar microfonos basado en la densidad de potencia espacial.
WO2015009586A3 (en) Performing an operation relative to tabular data based upon voice input
WO2020098828A3 (en) System and method for personalized speaker verification
WO2016139670A8 (en) System and method for generating accurate speech transcription from natural speech audio signals
ATE484927T1 (de) Verfahren zur automatischen entzerrung eines tonsystems
WO2007044370A3 (en) System and method for tailoring music to an activity based on an activity goal
IN2014CN03504A (ja)
ATE470323T1 (de) Verfahren und system zum entzerren eines lautsprechers in einem raum
DE602005003643D1 (de) Verfahren zur Beschleunigung des Trainings eines akustischen Echokompensators in einem Vollduplexaudiokonferenzsystem durch akustische Strahlbildung
WO2010038075A3 (en) Apparatus and method for reproducing a sound field with a loudspeaker array controlled via a control volume
WO2011002731A3 (en) Music instruction system
WO2012134997A3 (en) Non-scorable response filters for speech scoring systems
WO2007095277A3 (en) Communication device having speaker independent speech recognition
DE602006018795D1 (de) Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache
MX2011007762A (es) Aparato, metodo y programa de computadora para obtener un parametro que describe una variacion de una caracteristica de señal de una señal.
WO2014131054A3 (en) Dynamic audio perspective change during video playback
EP2573768A3 (en) Reverberation suppression device, reverberation suppression method, and computer-readable storage medium storing a reverberation suppression program
DE102007032272B8 (de) Verfahren zur Simulation einer Kopfhörerwiedergabe von Audiosignalen durch mehrere fokussierte Schallquellen
JP2016071029A5 (ja)
JP2011085641A5 (ja)
DE602004023134D1 (de) Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist
WO2012128798A3 (en) Simulator and method for simulating an acoustic field of an acoustic waveguide
EP2277170A4 (en) METHODS AND SYSTEMS FOR SIMPLIFYING COPY-GLUE OF TRANSCRIPTIONS GENERATED FROM TEXT-BASED TEXT SPEECH BASED ON DICTATION

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08843694

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2009539120

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08843694

Country of ref document: EP

Kind code of ref document: A1