CN111816205B - 一种基于飞机音频的机型智能识别方法 - Google Patents
一种基于飞机音频的机型智能识别方法 Download PDFInfo
- Publication number
- CN111816205B CN111816205B CN202010657182.5A CN202010657182A CN111816205B CN 111816205 B CN111816205 B CN 111816205B CN 202010657182 A CN202010657182 A CN 202010657182A CN 111816205 B CN111816205 B CN 111816205B
- Authority
- CN
- China
- Prior art keywords
- aircraft
- audio
- model
- mel
- airplane
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000012549 training Methods 0.000 claims abstract description 16
- 230000005236 sound signal Effects 0.000 claims abstract description 9
- 238000007781 pre-processing Methods 0.000 claims abstract description 6
- 230000008569 process Effects 0.000 claims abstract description 6
- 238000011176 pooling Methods 0.000 claims abstract description 5
- 230000006870 function Effects 0.000 claims description 8
- 238000012360 testing method Methods 0.000 claims description 7
- 238000012795 verification Methods 0.000 claims description 7
- 238000001228 spectrum Methods 0.000 claims description 4
- 230000015556 catabolic process Effects 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 3
- 238000006731 degradation reaction Methods 0.000 claims description 3
- 238000011478 gradient descent method Methods 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 claims description 3
- 210000002569 neuron Anatomy 0.000 claims description 3
- 230000009467 reduction Effects 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims 1
- 239000011159 matrix material Substances 0.000 claims 1
- 238000003384 imaging method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T90/00—Enabling technologies or technologies with a potential or indirect contribution to GHG emissions mitigation
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010657182.5A CN111816205B (zh) | 2020-07-09 | 2020-07-09 | 一种基于飞机音频的机型智能识别方法 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010657182.5A CN111816205B (zh) | 2020-07-09 | 2020-07-09 | 一种基于飞机音频的机型智能识别方法 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111816205A CN111816205A (zh) | 2020-10-23 |
| CN111816205B true CN111816205B (zh) | 2023-06-20 |
Family
ID=72842330
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010657182.5A Active CN111816205B (zh) | 2020-07-09 | 2020-07-09 | 一种基于飞机音频的机型智能识别方法 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111816205B (zh) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112734709A (zh) * | 2020-12-31 | 2021-04-30 | 山西三友和智慧信息技术股份有限公司 | 一种基于注意力机制与迁移学习的黑素瘤检测方法 |
| CN112992121B (zh) * | 2021-03-01 | 2022-07-12 | 德鲁动力科技(成都)有限公司 | 基于注意力残差学习的语音增强方法 |
| CN114999529B (zh) * | 2022-08-05 | 2022-11-01 | 中国民航大学 | 一种面向机场航空噪声的机型分类方法 |
| CN116310770A (zh) * | 2023-02-08 | 2023-06-23 | 上海船舶电子设备研究所(中国船舶集团有限公司第七二六研究所) | 基于梅尔倒谱和注意力残差网络的水声目标识别方法和系统 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2018107810A1 (zh) * | 2016-12-15 | 2018-06-21 | 平安科技(深圳)有限公司 | 声纹识别方法、装置、电子设备及介质 |
| WO2019023877A1 (zh) * | 2017-07-31 | 2019-02-07 | 深圳和而泰智能家居科技有限公司 | 特定声音识别方法、设备和存储介质 |
| CN109817246A (zh) * | 2019-02-27 | 2019-05-28 | 平安科技(深圳)有限公司 | 情感识别模型的训练方法、情感识别方法、装置、设备及存储介质 |
| CN110265035A (zh) * | 2019-04-25 | 2019-09-20 | 武汉大晟极科技有限公司 | 一种基于深度学习的说话人识别方法 |
| CN110782878A (zh) * | 2019-10-10 | 2020-02-11 | 天津大学 | 一种基于注意力机制的多尺度音频场景识别方法 |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SG140445A1 (en) * | 2003-07-28 | 2008-03-28 | Sony Corp | Method and apparatus for automatically recognizing audio data |
-
2020
- 2020-07-09 CN CN202010657182.5A patent/CN111816205B/zh active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2018107810A1 (zh) * | 2016-12-15 | 2018-06-21 | 平安科技(深圳)有限公司 | 声纹识别方法、装置、电子设备及介质 |
| WO2019023877A1 (zh) * | 2017-07-31 | 2019-02-07 | 深圳和而泰智能家居科技有限公司 | 特定声音识别方法、设备和存储介质 |
| CN109817246A (zh) * | 2019-02-27 | 2019-05-28 | 平安科技(深圳)有限公司 | 情感识别模型的训练方法、情感识别方法、装置、设备及存储介质 |
| CN110265035A (zh) * | 2019-04-25 | 2019-09-20 | 武汉大晟极科技有限公司 | 一种基于深度学习的说话人识别方法 |
| CN110782878A (zh) * | 2019-10-10 | 2020-02-11 | 天津大学 | 一种基于注意力机制的多尺度音频场景识别方法 |
Non-Patent Citations (1)
| Title |
|---|
| 基于梅尔倒谱系数、深层卷积和Bagging的环境音分类方法;王天锐;鲍骞月;秦品乐;;计算机应用(第12期);全文 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111816205A (zh) | 2020-10-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111816205B (zh) | 一种基于飞机音频的机型智能识别方法 | |
| CN114298183B (zh) | 飞行动作智能识别方法 | |
| Chen et al. | Self-supervised vision transformer-based few-shot learning for facial expression recognition | |
| CN107016681B (zh) | 基于全卷积网络的脑部mri肿瘤分割方法 | |
| CN107609525B (zh) | 基于剪枝策略构建卷积神经网络的遥感图像目标检测方法 | |
| CN107481717B (zh) | 一种声学模型训练方法及系统 | |
| CN112487949B (zh) | 一种基于多模态数据融合的学习者行为识别方法 | |
| CN107506822A (zh) | 一种基于空间融合池化的深度神经网络方法 | |
| CN110879989A (zh) | 基于小样本机器学习模型的ads-b信号目标识别方法 | |
| CN112465199B (zh) | 空域态势评估系统 | |
| CN110399850A (zh) | 一种基于深度神经网络的连续手语识别方法 | |
| CN113053366A (zh) | 一种基于多模态融合的管制话音复述一致性校验方法 | |
| CN114067155B (zh) | 基于元学习的图像分类方法、装置、产品及存储介质 | |
| CN113111786B (zh) | 基于小样本训练图卷积网络的水下目标识别方法 | |
| CN108399395A (zh) | 基于端到端深度神经网络的语音和人脸复合身份认证方法 | |
| CN109492750B (zh) | 基于卷积神经网络和因素空间的零样本图像分类方法 | |
| CN118968258B (zh) | 采用基于混合专家模型的生成式对抗网络的图像生成方法 | |
| CN109859771B (zh) | 一种联合优化深层变换特征与聚类过程的声场景聚类方法 | |
| CN109447092B (zh) | 基于海冰场景分类的冰间通路提取方法 | |
| CN105631469A (zh) | 一种多层稀疏编码特征的鸟类图像识别方法 | |
| CN113850373A (zh) | 一种基于类别的滤波器剪枝方法 | |
| CN112784487A (zh) | 飞行动作识别方法和装置 | |
| CN119295917A (zh) | 一种轻量型sar图像飞机目标检测方法 | |
| Tsai et al. | Enhancing the identification accuracy of deep learning object detection using natural language processing | |
| CN114898775A (zh) | 一种基于跨层交叉融合的语音情绪识别方法及系统 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| CB03 | Change of inventor or designer information |
Inventor after: Wang Weijie Inventor after: Ye Ruida Inventor after: Ren Yuan Inventor after: He Liang Inventor after: Yu Haoyuan Inventor after: Fan Yahong Inventor after: Zhang Keming Inventor after: Zhang Xianwei Inventor before: Wang Weijie Inventor before: Ye Ruida Inventor before: Ren Yuan Inventor before: He Liang Inventor before: Fan Yahong Inventor before: Zhang Keming Inventor before: Zhang Xianwei |
|
| CB03 | Change of inventor or designer information | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |