[go: up one dir, main page]

PL3579227T3 - Sposób i urządzenie do aktywacji mową oraz urządzenie elektroniczne - Google Patents

Sposób i urządzenie do aktywacji mową oraz urządzenie elektroniczne

Info

Publication number
PL3579227T3
PL3579227T3 PL18823086T PL18823086T PL3579227T3 PL 3579227 T3 PL3579227 T3 PL 3579227T3 PL 18823086 T PL18823086 T PL 18823086T PL 18823086 T PL18823086 T PL 18823086T PL 3579227 T3 PL3579227 T3 PL 3579227T3
Authority
PL
Poland
Prior art keywords
electronic device
speech activation
speech
activation
electronic
Prior art date
Application number
PL18823086T
Other languages
English (en)
Inventor
Zhiming Wang
Jun Zhou
Xiaolong Li
Original Assignee
Advanced New Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced New Technologies Co., Ltd. filed Critical Advanced New Technologies Co., Ltd.
Publication of PL3579227T3 publication Critical patent/PL3579227T3/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephonic Communication Services (AREA)
  • Electric Clocks (AREA)
PL18823086T 2017-06-29 2018-06-26 Sposób i urządzenie do aktywacji mową oraz urządzenie elektroniczne PL3579227T3 (pl)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201710514348.6A CN107358951A (zh) 2017-06-29 2017-06-29 一种语音唤醒方法、装置以及电子设备
PCT/CN2018/092899 WO2019001428A1 (zh) 2017-06-29 2018-06-26 一种语音唤醒方法、装置以及电子设备
EP18823086.6A EP3579227B1 (en) 2017-06-29 2018-06-26 Voice wake-up method and device and electronic device

Publications (1)

Publication Number Publication Date
PL3579227T3 true PL3579227T3 (pl) 2021-10-18

Family

ID=60274110

Family Applications (1)

Application Number Title Priority Date Filing Date
PL18823086T PL3579227T3 (pl) 2017-06-29 2018-06-26 Sposób i urządzenie do aktywacji mową oraz urządzenie elektroniczne

Country Status (11)

Country Link
US (2) US20200013390A1 (pl)
EP (1) EP3579227B1 (pl)
JP (1) JP6877558B2 (pl)
KR (1) KR102181836B1 (pl)
CN (1) CN107358951A (pl)
ES (1) ES2878137T3 (pl)
PH (1) PH12019501674A1 (pl)
PL (1) PL3579227T3 (pl)
SG (1) SG11201906576WA (pl)
TW (1) TWI692751B (pl)
WO (1) WO2019001428A1 (pl)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107358951A (zh) * 2017-06-29 2017-11-17 阿里巴巴集团控股有限公司 一种语音唤醒方法、装置以及电子设备
CN108320733B (zh) * 2017-12-18 2022-01-04 上海科大讯飞信息科技有限公司 语音数据处理方法及装置、存储介质、电子设备
CN108182937B (zh) * 2018-01-17 2021-04-13 出门问问创新科技有限公司 关键词识别方法、装置、设备及存储介质
US11488002B2 (en) * 2018-02-15 2022-11-01 Atlazo, Inc. Binary neural network accelerator engine methods and systems
CN108597523B (zh) * 2018-03-23 2019-05-17 平安科技(深圳)有限公司 说话人认证方法、服务器及计算机可读存储介质
WO2019222996A1 (en) * 2018-05-25 2019-11-28 Beijing Didi Infinity Technology And Development Co., Ltd. Systems and methods for voice recognition
CN110619871B (zh) * 2018-06-20 2023-06-30 阿里巴巴集团控股有限公司 语音唤醒检测方法、装置、设备以及存储介质
US11257481B2 (en) 2018-10-24 2022-02-22 Tencent America LLC Multi-task training architecture and strategy for attention-based speech recognition system
CN111276138B (zh) * 2018-12-05 2023-07-18 北京嘀嘀无限科技发展有限公司 一种语音唤醒系统中处理语音信号的方法及装置
CN109886386B (zh) * 2019-01-30 2020-10-27 北京声智科技有限公司 唤醒模型的确定方法及装置
CN109872713A (zh) * 2019-03-05 2019-06-11 深圳市友杰智新科技有限公司 一种语音唤醒方法及装置
CN110310628B (zh) 2019-06-27 2022-05-20 百度在线网络技术(北京)有限公司 唤醒模型的优化方法、装置、设备及存储介质
US11081102B2 (en) * 2019-08-16 2021-08-03 Ponddy Education Inc. Systems and methods for comprehensive Chinese speech scoring and diagnosis
JP7098587B2 (ja) * 2019-08-29 2022-07-11 株式会社東芝 情報処理装置、キーワード検出装置、情報処理方法およびプログラム
CN110634468B (zh) * 2019-09-11 2022-04-15 中国联合网络通信集团有限公司 语音唤醒方法、装置、设备及计算机可读存储介质
CN110648668A (zh) * 2019-09-24 2020-01-03 上海依图信息技术有限公司 关键词检测装置和方法
CN110648659B (zh) * 2019-09-24 2022-07-01 上海依图信息技术有限公司 基于多任务模型的语音识别与关键词检测装置和方法
CN110970016B (zh) * 2019-10-28 2022-08-19 苏宁云计算有限公司 一种唤醒模型生成方法、智能终端唤醒方法及装置
CN110853629A (zh) * 2019-11-21 2020-02-28 中科智云科技有限公司 一种基于深度学习的语音识别数字的方法
CN110992929A (zh) * 2019-11-26 2020-04-10 苏宁云计算有限公司 一种基于神经网络的语音关键词检测方法、装置及系统
US11341954B2 (en) * 2019-12-17 2022-05-24 Google Llc Training keyword spotters
JP7438744B2 (ja) * 2019-12-18 2024-02-27 株式会社東芝 情報処理装置、情報処理方法、およびプログラム
CN111640426A (zh) * 2020-06-10 2020-09-08 北京百度网讯科技有限公司 用于输出信息的方法和装置
CN111883121A (zh) * 2020-07-20 2020-11-03 北京声智科技有限公司 唤醒方法、装置及电子设备
CN112233655B (zh) * 2020-09-28 2024-07-16 上海声瀚信息科技有限公司 一种提高语音命令词识别性能的神经网络训练方法
CN112669818B (zh) * 2020-12-08 2022-12-02 北京地平线机器人技术研发有限公司 语音唤醒方法及装置、可读存储介质、电子设备
CN112733272A (zh) * 2021-01-13 2021-04-30 南昌航空大学 一种解决带软时间窗的车辆路径问题的方法
CN112882760A (zh) * 2021-02-22 2021-06-01 北京声智科技有限公司 一种智能设备的唤醒方法、装置及设备
US12236939B2 (en) * 2021-03-12 2025-02-25 Samsung Electronics Co., Ltd. Method of generating a trigger word detection model, and an apparatus for the same
CN113113007A (zh) * 2021-03-30 2021-07-13 北京金山云网络技术有限公司 语音数据的处理方法和装置、电子设备和存储介质
US11967322B2 (en) 2021-05-06 2024-04-23 Samsung Electronics Co., Ltd. Server for identifying false wakeup and method for controlling the same
KR102599480B1 (ko) * 2021-05-18 2023-11-08 부산대학교 산학협력단 키워드 음성인식을 위한 자동 학습 시스템 및 방법
CN113160823B (zh) * 2021-05-26 2024-05-17 中国工商银行股份有限公司 基于脉冲神经网络的语音唤醒方法、装置及电子设备
CN113744734A (zh) * 2021-08-30 2021-12-03 青岛海尔科技有限公司 一种语音唤醒方法、装置、电子设备及存储介质
KR20230068087A (ko) * 2021-11-10 2023-05-17 삼성전자주식회사 전자 장치 및 그 제어 방법
CN113990296B (zh) * 2021-12-24 2022-05-27 深圳市友杰智新科技有限公司 语音声学模型的训练方法、后处理方法和相关设备
CN114333798A (zh) * 2022-01-04 2022-04-12 厦门快商通科技股份有限公司 语音识别唤醒的方法、装置、终端设备及计算机可读介质
CN114373461A (zh) * 2022-01-21 2022-04-19 贝壳找房网(北京)信息技术有限公司 门的控制方法和装置、电子设备和存储介质
CN115223555B (zh) * 2022-06-09 2025-11-25 中国科学技术大学 语音唤醒方法、声学模型的训练方法及相关装置
CN115171736B (zh) * 2022-07-13 2025-04-04 成都市联洲国际技术有限公司 语音活性检测模型的生成方法、处理器与电子设备
US20240119925A1 (en) * 2022-10-10 2024-04-11 Samsung Electronics Co., Ltd. System and method for post-asr false wake-up suppression
CN115862604B (zh) * 2022-11-24 2024-02-20 镁佳(北京)科技有限公司 语音唤醒模型训练及语音唤醒方法、装置及计算机设备
WO2025047998A1 (ko) * 2023-08-29 2025-03-06 주식회사 엔씨소프트 지정된 텍스트에 대응하는 음성 신호를 식별하기 위한 전자 장치, 방법, 및 컴퓨터 판독 가능 저장 매체

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05128286A (ja) * 1991-11-05 1993-05-25 Ricoh Co Ltd ニユーラルネツトワークによるキーワードスポツテイング方式
JP2007179239A (ja) * 2005-12-27 2007-07-12 Kenwood Corp スケジュール管理装置及びプログラム
US9117449B2 (en) * 2012-04-26 2015-08-25 Nuance Communications, Inc. Embedded system for construction of small footprint speech recognition with user-definable constraints
US9177547B2 (en) * 2013-06-25 2015-11-03 The Johns Hopkins University System and method for processing speech to identify keywords or other information
CN104378723A (zh) * 2013-08-16 2015-02-25 上海耐普微电子有限公司 具有语音唤醒功能的麦克风
US9715660B2 (en) * 2013-11-04 2017-07-25 Google Inc. Transfer learning for deep neural network based hotword detection
US9443522B2 (en) * 2013-11-18 2016-09-13 Beijing Lenovo Software Ltd. Voice recognition method, voice controlling method, information processing method, and electronic apparatus
CN105096935B (zh) * 2014-05-06 2019-08-09 阿里巴巴集团控股有限公司 一种语音输入方法、装置和系统
US10783900B2 (en) * 2014-10-03 2020-09-22 Google Llc Convolutional, long short-term memory, fully connected deep neural networks
CN106463112B (zh) * 2015-04-10 2020-12-08 华为技术有限公司 语音识别方法、语音唤醒装置、语音识别装置及终端
CN106297774B (zh) * 2015-05-29 2019-07-09 中国科学院声学研究所 一种神经网络声学模型的分布式并行训练方法及系统
TWI639153B (zh) * 2015-11-03 2018-10-21 絡達科技股份有限公司 電子裝置及其透過語音辨識喚醒的方法
JP6679898B2 (ja) * 2015-11-24 2020-04-15 富士通株式会社 キーワード検出装置、キーワード検出方法及びキーワード検出用コンピュータプログラム
US10755698B2 (en) * 2015-12-07 2020-08-25 University Of Florida Research Foundation, Inc. Pulse-based automatic speech recognition
CN106887227A (zh) * 2015-12-16 2017-06-23 芋头科技(杭州)有限公司 一种语音唤醒方法及系统
CN105632486B (zh) * 2015-12-23 2019-12-17 北京奇虎科技有限公司 一种智能硬件的语音唤醒方法和装置
US10229672B1 (en) * 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
CN105931633A (zh) * 2016-05-30 2016-09-07 深圳市鼎盛智能科技有限公司 语音识别的方法及系统
CN106098059B (zh) * 2016-06-23 2019-06-18 上海交通大学 可定制语音唤醒方法及系统
CN106611597B (zh) * 2016-12-02 2019-11-08 百度在线网络技术(北京)有限公司 基于人工智能的语音唤醒方法和装置
CN106782536B (zh) * 2016-12-26 2020-02-28 北京云知声信息技术有限公司 一种语音唤醒方法及装置
CN107221326B (zh) * 2017-05-16 2021-05-28 百度在线网络技术(北京)有限公司 基于人工智能的语音唤醒方法、装置和计算机设备
CN107358951A (zh) * 2017-06-29 2017-11-17 阿里巴巴集团控股有限公司 一种语音唤醒方法、装置以及电子设备

Also Published As

Publication number Publication date
US20200013390A1 (en) 2020-01-09
ES2878137T3 (es) 2021-11-18
JP2020517977A (ja) 2020-06-18
US20200168207A1 (en) 2020-05-28
EP3579227A4 (en) 2020-02-26
JP6877558B2 (ja) 2021-05-26
KR102181836B1 (ko) 2020-11-25
PH12019501674A1 (en) 2020-06-01
EP3579227A1 (en) 2019-12-11
WO2019001428A1 (zh) 2019-01-03
KR20190134594A (ko) 2019-12-04
TW201905897A (zh) 2019-02-01
EP3579227B1 (en) 2021-06-09
TWI692751B (zh) 2020-05-01
CN107358951A (zh) 2017-11-17
US10748524B2 (en) 2020-08-18
SG11201906576WA (en) 2019-08-27

Similar Documents

Publication Publication Date Title
PL3579227T3 (pl) Sposób i urządzenie do aktywacji mową oraz urządzenie elektroniczne
EP3664513A4 (en) POSITIONING PROCESS AND APPARATUS
EP3579150A4 (en) OPERATING APPARATUS AND METHOD
EP3627397A4 (en) TREATMENT PROCESS AND APPARATUS
EP3681239A4 (en) RANDOM ACCESS PROCESS AND DEVICE
PL3825814T3 (pl) Urządzenie elektroniczne i sposób jego łączności
EP3697162A4 (en) RANDOM ACCESS PROCESS AND DEVICE
EP3681237A4 (en) RANDOM ACCESS PROCESS, AND APPARATUS
EP3644314A4 (en) SOUND PROCESSING METHOD AND DEVICE
EP3849254A4 (en) POSITIONING PROCESS AND APPARATUS
EP3457788A4 (en) METHOD AND TERMINAL DEVICE
EP3576036A4 (en) SERVICE EXECUTION DEVICE AND METHOD
EP3616535A4 (en) AEROSOL GENERATION PROCESS AND APPARATUS
EP3668257A4 (en) SESSION PROCESSING PROCESS AND DEVICE
EP3500947A4 (en) LANGUAGE TRANSLATION DEVICE AND METHOD
EP3644663A4 (en) POSITIONING PROCESS AND APPARATUS
EP3564710A4 (en) POSITIONING METHOD AND APPARATUS
EP3589065A4 (en) SWITCHING METHOD AND APPARATUS
EP3657086A4 (en) APPARATUS CONTROL METHOD AND DEVICE
EP3565219A4 (en) SERVICE EXECUTION METHOD AND DEVICE
EP3654568A4 (en) SYNCHRONIZATION METHOD AND APPARATUS
EP3806502A4 (en) POSITIONING PROCESS AND APPARATUS
TWI800491B (zh) 塗布裝置及塗布方法
EP3721749A4 (en) DRAWING DEVICE AND DRAWING PROCESS
EP3675543A4 (en) DEVICE AND APPARATUS CONTROL PROCESS