WO2009057739A1 - 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム - Google Patents
話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム Download PDFInfo
- Publication number
- WO2009057739A1 WO2009057739A1 PCT/JP2008/069853 JP2008069853W WO2009057739A1 WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1 JP 2008069853 W JP2008069853 W JP 2008069853W WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speaker
- speaker selection
- speakers
- selection
- model making
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
適応モデルの精度劣化を抑制することのできる話者選択装置を提供する。話者選択装置は、入力された発声話者の音声信号より抽出された特徴量とあらかじめ記憶されている複数の話者の話者モデルを用いて、話者空間における発声話者を中心とする複数の話者の分布の密度を算出する話者分布密度算出手段と、話者の分布の密度を用いて選択する話者の数を算出する選択話者数算出手段とを備える。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2009539120A JP5626558B2 (ja) | 2007-10-31 | 2008-10-31 | 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2007-283767 | 2007-10-31 | ||
| JP2007283767 | 2007-10-31 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2009057739A1 true WO2009057739A1 (ja) | 2009-05-07 |
Family
ID=40591119
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2008/069853 Ceased WO2009057739A1 (ja) | 2007-10-31 | 2008-10-31 | 話者選択装置、話者適応モデル作成装置、話者選択方法および話者選択用プログラム |
Country Status (2)
| Country | Link |
|---|---|
| JP (1) | JP5626558B2 (ja) |
| WO (1) | WO2009057739A1 (ja) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117953851A (zh) * | 2022-10-20 | 2024-04-30 | 戴尔产品有限公司 | 文本转语音的方法、设备和计算机程序产品 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH11143486A (ja) * | 1997-11-10 | 1999-05-28 | Fuji Xerox Co Ltd | 話者適応装置および方法 |
| JP2002149185A (ja) * | 2000-09-27 | 2002-05-24 | Koninkl Philips Electronics Nv | 複数の学習用話者を表現する固有空間の決定方法 |
| WO2005034086A1 (ja) * | 2003-10-03 | 2005-04-14 | Asahi Kasei Kabushiki Kaisha | データ処理装置及びデータ処理装置制御プログラム |
| JP3756879B2 (ja) * | 2001-12-20 | 2006-03-15 | 松下電器産業株式会社 | 音響モデルを作成する方法、音響モデルを作成する装置、音響モデルを作成するためのコンピュータプログラム |
| WO2008117626A1 (ja) * | 2007-03-27 | 2008-10-02 | Nec Corporation | 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム |
-
2008
- 2008-10-31 WO PCT/JP2008/069853 patent/WO2009057739A1/ja not_active Ceased
- 2008-10-31 JP JP2009539120A patent/JP5626558B2/ja active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH11143486A (ja) * | 1997-11-10 | 1999-05-28 | Fuji Xerox Co Ltd | 話者適応装置および方法 |
| JP2002149185A (ja) * | 2000-09-27 | 2002-05-24 | Koninkl Philips Electronics Nv | 複数の学習用話者を表現する固有空間の決定方法 |
| JP3756879B2 (ja) * | 2001-12-20 | 2006-03-15 | 松下電器産業株式会社 | 音響モデルを作成する方法、音響モデルを作成する装置、音響モデルを作成するためのコンピュータプログラム |
| WO2005034086A1 (ja) * | 2003-10-03 | 2005-04-14 | Asahi Kasei Kabushiki Kaisha | データ処理装置及びデータ処理装置制御プログラム |
| WO2008117626A1 (ja) * | 2007-03-27 | 2008-10-02 | Nec Corporation | 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2009057739A1 (ja) | 2011-03-10 |
| JP5626558B2 (ja) | 2014-11-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2008117626A1 (ja) | 話者選択装置、話者適応モデル作成装置、話者選択方法、話者選択用プログラムおよび話者適応モデル作成プログラム | |
| WO2008047339A3 (en) | Method and apparatus for large population speaker identification in telephone interactions | |
| MX338524B (es) | Aparato y metodo para posicionar microfonos basado en la densidad de potencia espacial. | |
| WO2015009586A3 (en) | Performing an operation relative to tabular data based upon voice input | |
| WO2020098828A3 (en) | System and method for personalized speaker verification | |
| WO2016139670A8 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
| ATE484927T1 (de) | Verfahren zur automatischen entzerrung eines tonsystems | |
| WO2007044370A3 (en) | System and method for tailoring music to an activity based on an activity goal | |
| IN2014CN03504A (ja) | ||
| ATE470323T1 (de) | Verfahren und system zum entzerren eines lautsprechers in einem raum | |
| DE602005003643D1 (de) | Verfahren zur Beschleunigung des Trainings eines akustischen Echokompensators in einem Vollduplexaudiokonferenzsystem durch akustische Strahlbildung | |
| WO2010038075A3 (en) | Apparatus and method for reproducing a sound field with a loudspeaker array controlled via a control volume | |
| WO2011002731A3 (en) | Music instruction system | |
| WO2012134997A3 (en) | Non-scorable response filters for speech scoring systems | |
| WO2007095277A3 (en) | Communication device having speaker independent speech recognition | |
| DE602006018795D1 (de) | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache | |
| MX2011007762A (es) | Aparato, metodo y programa de computadora para obtener un parametro que describe una variacion de una caracteristica de señal de una señal. | |
| WO2014131054A3 (en) | Dynamic audio perspective change during video playback | |
| EP2573768A3 (en) | Reverberation suppression device, reverberation suppression method, and computer-readable storage medium storing a reverberation suppression program | |
| DE102007032272B8 (de) | Verfahren zur Simulation einer Kopfhörerwiedergabe von Audiosignalen durch mehrere fokussierte Schallquellen | |
| JP2016071029A5 (ja) | ||
| JP2011085641A5 (ja) | ||
| DE602004023134D1 (de) | Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist | |
| WO2012128798A3 (en) | Simulator and method for simulating an acoustic field of an acoustic waveguide | |
| EP2277170A4 (en) | METHODS AND SYSTEMS FOR SIMPLIFYING COPY-GLUE OF TRANSCRIPTIONS GENERATED FROM TEXT-BASED TEXT SPEECH BASED ON DICTATION |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08843694 Country of ref document: EP Kind code of ref document: A1 |
|
| DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2009539120 Country of ref document: JP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 08843694 Country of ref document: EP Kind code of ref document: A1 |