[go: up one dir, main page]

CN116796826A - Pulsar search network training method, pulsar search device and pulsar search equipment - Google Patents

Pulsar search network training method, pulsar search device and pulsar search equipment Download PDF

Info

Publication number
CN116796826A
CN116796826A CN202210226661.0A CN202210226661A CN116796826A CN 116796826 A CN116796826 A CN 116796826A CN 202210226661 A CN202210226661 A CN 202210226661A CN 116796826 A CN116796826 A CN 116796826A
Authority
CN
China
Prior art keywords
pulsar
search
network
phase distribution
distribution image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210226661.0A
Other languages
Chinese (zh)
Other versions
CN116796826B (en
Inventor
程小峰
王亚彪
丁玫菲
谢鸣
游善平
甘振业
李昱希
罗泽坤
孙众毅
黄飞跃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN202210226661.0A priority Critical patent/CN116796826B/en
Publication of CN116796826A publication Critical patent/CN116796826A/en
Application granted granted Critical
Publication of CN116796826B publication Critical patent/CN116796826B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

本发明提供了一种脉冲星搜索网络训练方法,包括:基于训练样本集合和所述候选体的脉冲轮廓图像,对脉冲星搜索网络的主干网络进行第一训练,确定主干网络的初始网络参数;基于训练样本集合和所述和脉冲星标签值,对脉冲星搜索网络的主干网络进行第二训练,得到脉冲星搜索网络的主干网络的目标参数,由此,能够在减少训练数据总量和无需重进行监督训练的前提下,稳定提高脉冲星搜索网络训练训练的准确率,减轻脉冲星搜索网络的过拟合,增强脉冲星搜索网络模型的泛化能力。本发明还提供了脉冲星搜索方法、装置、电子设备、计算机程序产品以及存储介质。本发明实施例可应用于云技术、人工智能、智慧交通、辅助驾驶等各种场景。

The invention provides a pulsar search network training method, which includes: based on a training sample set and the pulse profile image of the candidate body, performing first training on the backbone network of the pulsar search network, and determining the initial network parameters of the backbone network; Based on the training sample set and the above-mentioned and pulsar label values, the backbone network of the pulsar search network is secondly trained to obtain the target parameters of the backbone network of the pulsar search network. This can reduce the total amount of training data and eliminate the need for Under the premise of re-implementing supervised training, the accuracy of pulsar search network training can be steadily improved, the over-fitting of the pulsar search network can be reduced, and the generalization ability of the pulsar search network model can be enhanced. The present invention also provides a pulsar search method, device, electronic equipment, computer program product and storage medium. Embodiments of the present invention can be applied to various scenarios such as cloud technology, artificial intelligence, smart transportation, and assisted driving.

Description

脉冲星搜索网络训练方法、脉冲星搜索方法、装置、设备Pulsar search network training method, pulsar search method, device and equipment

技术领域Technical field

本发明涉及人工智能领域的图像处理技术,尤其涉及脉冲星搜索网络训练方法、脉冲星搜索方法、装置、电子设备、计算机程序产品以及存储介质。The present invention relates to image processing technology in the field of artificial intelligence, and in particular to pulsar search network training methods, pulsar search methods, devices, electronic equipment, computer program products and storage media.

背景技术Background technique

基于深度学习所进行的各类别识别,一直以来都是各应用场景下解决大量数据分的重要工具。例如,在图像、自然语言处理等应用场景中,对大量数据所实现的大规模分类和识别,以此来快速准确的获得相关的分类预测结果,加速所在应用场景的功能实现。Category recognition based on deep learning has always been an important tool for solving large amounts of data in various application scenarios. For example, in application scenarios such as images and natural language processing, large-scale classification and recognition of large amounts of data can be achieved to quickly and accurately obtain relevant classification prediction results and accelerate the implementation of functions in the application scenarios.

在对图像所进行的分类预测中,根据所部署的应用场景不同,可以执行不同的任务,在进行脉冲星搜索时,由于脉冲星搜索任务中通常会产生海量的候选体文件,例如FAST会将会有PB量级的年增量,预计将产生千万量级的候选体,故而在脉冲星候选体搜寻中,搜索模型的准确度和搜索模型的推理速度都将至关重要。In the classification and prediction of images, different tasks can be performed according to the deployed application scenarios. When performing pulsar searches, a large number of candidate files are usually generated in pulsar search tasks. For example, FAST will There will be annual increments of the order of PB, and it is expected that tens of millions of candidates will be generated. Therefore, in the search for pulsar candidates, the accuracy of the search model and the inference speed of the search model will be crucial.

发明内容Contents of the invention

有鉴于此,本发明实施例提供一种脉冲星搜索网络训练方法、脉冲星搜索方法装置、电子设备、计算机程序产品以及存储介质,能够实现通过对脉冲星搜索网络的训练,能够在减少训练数据总量和无需重进行监督训练的前提下,稳定提高脉冲星搜索网络训练训练的准确率,减轻脉冲星搜索网络的过拟合,增强脉冲星搜索网络模型的泛化能力,便于将及时所训练的脉冲星搜索网络模型部署于网络终端中,实现脉冲星搜索网络模型的大规模应用。In view of this, embodiments of the present invention provide a pulsar search network training method, pulsar search method device, electronic equipment, computer program products and storage media, which can achieve training of the pulsar search network and reduce training data. Under the premise of total amount and no need to re-supervised training, it can steadily improve the accuracy of pulsar search network training, reduce the over-fitting of pulsar search network, enhance the generalization ability of pulsar search network model, and facilitate the timely training of pulsar search network. The pulsar search network model is deployed in network terminals to realize large-scale application of the pulsar search network model.

本发明实施例的技术方案是这样实现的:The technical solution of the embodiment of the present invention is implemented as follows:

本发明实施例提供了一种脉冲星搜索网络训练方法,包括:Embodiments of the present invention provide a pulsar search network training method, which includes:

获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;Obtain the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal;

对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合;The time phase distribution image and the frequency phase distribution image are combined to form a training sample set for the pulsar search network;

获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值;Obtain the pulse profile image and pulsar label value of the candidate body corresponding to the obtained pulsar search candidate body signal;

基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数;Based on the training sample set and the pulse profile image of the candidate body, perform a first training on the backbone network of the pulsar search network, and determine the initial network parameters of the backbone network;

基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。Based on the training sample set and the sum pulsar label value, perform a second training on the backbone network of the pulsar search network to adjust the initial parameters of the backbone network of the pulsar search network to obtain the pulsar search network. Target parameters of the backbone network of the star search network.

本发明实施例还提供了一种脉冲星搜索方法,包括:Embodiments of the present invention also provide a pulsar search method, including:

接收无标注的脉冲星搜索候选体信号;Receive unlabeled pulsar search candidate signals;

获取与所述脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;Obtaining a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signal;

通过脉冲星搜索网络的主干网络,对所述时间相位分布图像和频率相位分布图像进行处理,确定图像处理结果;Process the time phase distribution image and frequency phase distribution image through the backbone network of the pulsar search network to determine the image processing results;

通过所述脉冲星搜索网络的分类任务全连接层网络,对所述图像处理结果进行处理,得到所述脉冲星搜索候选体信号的预测结果。Through the classification task fully connected layer network of the pulsar search network, the image processing results are processed to obtain the prediction results of the pulsar search candidate signals.

本发明实施例还提供了一种脉冲星搜索网络训练装置,所述装置包括:An embodiment of the present invention also provides a pulsar search network training device, which includes:

信息传输模块,用于获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;The information transmission module is used to obtain the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal;

信息处理模块,用于对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合;An information processing module, configured to combine the time phase distribution image and the frequency phase distribution image to form a training sample set for the pulsar search network;

所述信息处理模块,用于获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值;The information processing module is used to obtain the pulse profile image and pulsar label value of the candidate corresponding to the obtained pulsar search candidate signal;

所述信息处理模块,用于基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数;The information processing module is configured to perform first training on the backbone network of the pulsar search network based on the training sample set and the pulse profile image of the candidate body, and determine the initial network parameters of the backbone network;

所述信息处理模块,用于基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。The information processing module is configured to perform a second training on the backbone network of the pulsar search network based on the training sample set and the sum pulsar label value, so as to adjust the backbone network of the pulsar search network. The initial parameters are used to obtain the target parameters of the backbone network of the pulsar search network.

上述方案中,In the above scheme,

所述信息处理模块,用于确定所述时间相位分布图像和频率相位分布图像的噪声值,对所述确定所述时间相位分布图像和频率相位分布图像分别进行除噪处理;The information processing module is used to determine the noise value of the time phase distribution image and the frequency phase distribution image, and perform denoising processing on the determined time phase distribution image and frequency phase distribution image respectively;

所述信息处理模块,用于对经过除噪处理的所述时间相位分布图像和频率相位分布图像分别进行标准化处理,得到符合单通道灰度图像标准的时间相位分布图像和频率相位分布图像;The information processing module is used to standardize the time phase distribution image and frequency phase distribution image that have been denoised, respectively, to obtain a time phase distribution image and a frequency phase distribution image that comply with single-channel grayscale image standards;

所述信息处理模块,用于将符合单通道灰度图像标准的时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合。The information processing module is used to combine time phase distribution images and frequency phase distribution images that comply with single-channel grayscale image standards to form a training sample set for the pulsar search network.

上述方案中,In the above scheme,

所述信息处理模块,用于根据所述脉冲星搜索候选体信号所对应的目标区域的位置,确定与所述脉冲星搜索网络的使用环境相匹配的动态噪声阈值;The information processing module is configured to determine a dynamic noise threshold that matches the usage environment of the pulsar search network based on the position of the target area corresponding to the pulsar search candidate signal;

所述信息处理模块,用于根据所述动态噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述动态噪声阈值相匹配的时间相位分布图像和频率相位分布图像。The information processing module is configured to perform noise reduction processing on the time phase distribution image and the frequency phase distribution image according to the dynamic noise threshold to form a time phase distribution image and frequency that match the dynamic noise threshold. Phase distribution image.

上述方案中,In the above scheme,

所述信息处理模块,用于根据所述脉冲星搜索候选体信号所对应的射电望远镜类型,确定与所述脉冲星搜索网络的使用环境相匹配的固定噪声阈值;The information processing module is configured to determine a fixed noise threshold that matches the usage environment of the pulsar search network according to the radio telescope type corresponding to the pulsar search candidate signal;

所述信息处理模块,用于根据所述固定噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述固定噪声阈值相匹配的时间相位分布图像和频率相位分布图像。The information processing module is configured to perform noise reduction processing on the time phase distribution image and the frequency phase distribution image according to the fixed noise threshold to form a time phase distribution image and frequency that match the fixed noise threshold. Phase distribution image.

上述方案中,In the above scheme,

所述信息处理模块,用于获取所述脉冲星搜索候选体信号对应的候选体的原始脉冲轮廓图像;The information processing module is used to obtain the original pulse profile image of the candidate body corresponding to the pulsar search candidate body signal;

所述信息处理模块,用于对所述的原始脉冲轮廓图像的序列值进行标准化处理,得到所述原始脉冲轮廓图像标准序列值;The information processing module is used to standardize the sequence values of the original pulse profile image to obtain the standard sequence values of the original pulse profile image;

所述信息处理模块,用于对所述原始脉冲轮廓图像标准序列值进行调整,得到所述脉冲星搜索候选体信号对应的脉冲轮廓图像,以实现与所述脉冲星搜索网络的精确度相匹配。The information processing module is used to adjust the standard sequence value of the original pulse profile image to obtain the pulse profile image corresponding to the pulsar search candidate signal, so as to match the accuracy of the pulsar search network. .

上述方案中,In the above scheme,

所述信息处理模块,用于基于所述候选体的脉冲轮廓图像,确定所述候选体的脉冲轮廓图像的标签值;The information processing module is configured to determine the label value of the pulse profile image of the candidate body based on the pulse profile image of the candidate body;

所述信息处理模块,用于通过所述主干网络对应的预训练全连接层网络,基于所述脉冲轮廓图像的标签值,确定所述第一训练对应的第一损失函数;The information processing module is configured to determine the first loss function corresponding to the first training based on the label value of the pulse profile image through the pre-trained fully connected layer network corresponding to the backbone network;

所述信息处理模块,用于将所述训练样本集合中的训练样本,带入所述第一损失函数;The information processing module is used to bring the training samples in the training sample set into the first loss function;

所述信息处理模块,用于当所述第一损失函数满足第一收敛条件时,确定所述主干网络的初始网络参数。The information processing module is configured to determine initial network parameters of the backbone network when the first loss function satisfies the first convergence condition.

上述方案中,In the above scheme,

所述信息处理模块,用于通过所述主干网络对应的分类任务全连接层网络,基于所述脉冲星标签值,确定所述第二训练对应的第二损失函数;The information processing module is configured to determine the second loss function corresponding to the second training based on the pulsar label value through the fully connected layer network of the classification task corresponding to the backbone network;

所述信息处理模块,用于将所述训练样本集合中的训练样本,带入所述第二损失函数;The information processing module is used to bring the training samples in the training sample set into the second loss function;

所述信息处理模块,用于当所述第二损失函数满足第二收敛条件时,确定所述主干网络的更新参数;The information processing module is configured to determine the update parameters of the backbone network when the second loss function satisfies the second convergence condition;

所述信息处理模块,用于基于所述主干网络的更新参数,调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。The information processing module is configured to adjust the initial parameters of the backbone network of the pulsar search network based on the updated parameters of the backbone network to obtain target parameters of the backbone network of the pulsar search network.

本发明实施例还提供了一种脉冲星搜索装置,包括:An embodiment of the present invention also provides a pulsar search device, including:

信号传输模块,用于接收无标注的脉冲星搜索候选体信号;The signal transmission module is used to receive unlabeled pulsar search candidate signals;

信号处理模块,用于获取与所述脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;A signal processing module, used to obtain the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal;

所述信号处理模块,用于通过脉冲星搜索网络的主干网络,对所述时间相位分布图像和频率相位分布图像进行处理,确定图像处理结果;The signal processing module is used to process the time phase distribution image and frequency phase distribution image through the backbone network of the pulsar search network, and determine the image processing results;

所述信号处理模块,用于通过所述脉冲星搜索网络的分类任务全连接层网络,对所述图像处理结果进行处理,得到所述脉冲星搜索候选体信号的预测结果。The signal processing module is used to process the image processing results through the fully connected layer network of the classification task of the pulsar search network to obtain the prediction result of the pulsar search candidate signal.

本发明实施例还提供了一种电子设备,所述电子设备包括:An embodiment of the present invention also provides an electronic device. The electronic device includes:

存储器,用于存储可执行指令;Memory, used to store executable instructions;

处理器,用于运行所述存储器存储的可执行指令时,实现前述的脉冲星搜索网络训练方法。The processor is configured to implement the aforementioned pulsar search network training method when running executable instructions stored in the memory.

本发明实施例还提供了一种计算机可读存储介质,存储有可执行指令,其特征在于,所述可执行指令被处理器执行时实现前述的脉冲星搜索网络训练方法。An embodiment of the present invention also provides a computer-readable storage medium storing executable instructions, which is characterized in that when the executable instructions are executed by a processor, the aforementioned pulsar search network training method is implemented.

本发明实施例具有以下有益效果:The embodiments of the present invention have the following beneficial effects:

本发明通过获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合;获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值;基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数;基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数,由此,能够实现通过对脉冲星搜索网络的训练,能够在减少训练数据总量和无需重进行监督训练的前提下,稳定提高脉冲星搜索网络训练训练的准确率,减轻脉冲星搜索网络的过拟合,增强脉冲星搜索网络模型的泛化能力,便于将及时所训练的脉冲星搜索网络模型部署于网络终端中,实现脉冲星搜索网络模型的大规模应用。The present invention obtains the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal; combines the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network; obtains Obtaining the pulse profile image and pulsar label value of the candidate body corresponding to the pulsar search candidate body signal; based on the training sample set and the pulse profile image of the candidate body, conducting the backbone network of the pulsar search network The first training is to determine the initial network parameters of the backbone network; based on the training sample set and the sum pulsar label value, the second training is performed on the backbone network of the pulsar search network to adjust the pulse The initial parameters of the backbone network of the star search network are obtained to obtain the target parameters of the backbone network of the pulsar search network. Therefore, it is possible to reduce the total amount of training data and eliminate the need for re-supervision by training the pulsar search network. Under the premise of training, it can steadily improve the accuracy of pulsar search network training, reduce over-fitting of pulsar search network, enhance the generalization ability of pulsar search network model, and facilitate the deployment of timely trained pulsar search network model. In network terminals, large-scale application of the pulsar search network model is realized.

附图说明Description of the drawings

图1是本发明实施例提供的脉冲星搜索网络训练方法的使用环境示意图;Figure 1 is a schematic diagram of the usage environment of the pulsar search network training method provided by the embodiment of the present invention;

图2为本发明实施例提供的电子设备的组成结构示意图;Figure 2 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention;

图3为本发明实施例提供的脉冲星搜索网络训练方法一个可选的流程示意图;Figure 3 is an optional flow diagram of a pulsar search network training method provided by an embodiment of the present invention;

图4A为本发明实施例中脉冲星搜索网络的样本集合示意图;Figure 4A is a schematic diagram of the sample set of the pulsar search network in the embodiment of the present invention;

图4B为本发明实施例中脉冲星搜索网络的样本集合示意图;Figure 4B is a schematic diagram of the sample set of the pulsar search network in the embodiment of the present invention;

图5为本发明实施例提供的脉冲星搜索网络训练方法一个可选的流程示意图;Figure 5 is an optional flow diagram of the pulsar search network training method provided by the embodiment of the present invention;

图6为本发明实施例中脉冲星搜索网络的网络结构示意图;Figure 6 is a schematic network structure diagram of the pulsar search network in the embodiment of the present invention;

图7为本发明实施例提供的脉冲星搜索方法一个可选的流程示意图;Figure 7 is an optional flow diagram of the pulsar search method provided by the embodiment of the present invention;

图8为本发明实施例中脉冲星搜索候选体信号示意图。Figure 8 is a schematic diagram of pulsar search candidate signals in an embodiment of the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案和优点更加清楚,下面将结合附图对本发明作进一步地详细描述,所描述的实施例不应视为对本发明的限制,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the accompanying drawings. The described embodiments should not be regarded as limiting the present invention. Those of ordinary skill in the art will not make any All other embodiments obtained under the premise of creative work belong to the scope of protection of the present invention.

在以下的描述中,涉及到“一些实施例”,其描述了所有可能实施例的子集,但是可以理解,“一些实施例”可以是所有可能实施例的相同子集或不同子集,并且可以在不冲突的情况下相互结合。In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

对本发明实施例进行进一步详细说明之前,对本发明实施例中涉及的名词和术语进行说明,本发明实施例中涉及的名词和术语适用于如下的解释。Before further describing the embodiments of the present invention in detail, nouns and terms involved in the embodiments of the present invention will be described. The nouns and terms involved in the embodiments of the present invention are suitable for the following explanations.

1)响应于,用于表示所执行的操作所依赖的条件或者状态,当满足所依赖的条件或状态时,所执行的一个或多个操作可以是实时的,也可以具有设定的延迟;在没有特别说明的情况下,所执行的多个操作不存在执行先后顺序的限制。1) Response is used to represent the conditions or states on which the performed operations depend. When the dependent conditions or states are met, the one or more operations performed may be in real time or may have a set delay; Unless otherwise specified, there is no restriction on the execution order of the multiple operations performed.

2)下采样处理,对于一个样值序列间隔几个样值取样一次,这样得到新序列就是原序列的下采样,例如:对于一幅图像I尺寸为M*N,对其进行s倍下采样,即得到(M/s)*(N/s)尺寸的得分辨率图像,其中s应该是M和N的公约数2) Downsampling processing, a sample sequence is sampled at intervals of several samples, so that the new sequence obtained is the downsampling of the original sequence. For example: for an image whose I size is M*N, downsample it by s times. , that is, a resolution image of (M/s)*(N/s) size is obtained, where s should be the common divisor of M and N

3)模型训练,对图像数据集进行多分类学习。该模型可采用TensorFlow、torch等深度学习框架进行构建,使用CNN等神经网络层的多层结合组成多分类模型。模型的输入为图像经过openCV等工具读取形成的三通道或原通道矩阵,模型输出为多分类概率,通过softmax等算法最终输出图像类别。在训练时,模型通过交叉熵等目标函数向正确趋势逼近。3) Model training, performing multi-classification learning on image data sets. The model can be constructed using deep learning frameworks such as TensorFlow and torch, and uses multi-layer combinations of neural network layers such as CNN to form a multi-classification model. The input of the model is the three-channel or original channel matrix formed by reading the image through tools such as openCV. The output of the model is multi-classification probability, and the image category is finally output through algorithms such as softmax. During training, the model approaches the correct trend through objective functions such as cross-entropy.

4)带噪识别:基于噪声样本进行图像识别任务的训练,噪声样本包括具有错误类别标注的样本、具有不准确类别标注的样本,例如,图像与类别标签不完全对应,两个类别标签的概念具有部分重叠,图像具有上述两种类别标签的属性,但仅具有一种类别标签。4) Noisy recognition: Training of image recognition tasks based on noise samples. Noisy samples include samples with wrong category labels and samples with inaccurate category labels. For example, images do not completely correspond to category labels, and the concept of two category labels With partial overlap, an image has properties of both of the above class labels, but has only one class label.

5)Contrastive loss:对比损失函数,其可以学习一种映射关系,这种映射关系可以使得在高维空间中,相同类别但距离较远的点,通过函数映射到低维空间后,距离变近,不同类别但距离都较近的点,通过映射后再低维空间变得更远。这样的结果就是,在低维空间,同一种类的点会产生聚类的效果,不同种类的mean会隔开。类似fisher降维,但fisher降维不具有out-of-sample extension的效果,不能对new sample进行作用。5) Contrastive loss: Contrastive loss function, which can learn a mapping relationship, which can make points of the same category but far apart in a high-dimensional space become closer after being mapped to a low-dimensional space through a function , points of different categories but close together become further apart in the low-dimensional space after mapping. The result is that in a low-dimensional space, points of the same type will produce a clustering effect, and means of different types will be separated. Similar to fisher dimensionality reduction, but fisher dimensionality reduction does not have the effect of out-of-sample extension and cannot affect new samples.

6)客户端,终端中实现特定功能的载体,例如移动客户端(APP)是移动终端中特定功能的载体,例如执行用户手势识别的功能的程序。6) Client, a carrier that implements specific functions in the terminal. For example, a mobile client (APP) is a carrier of specific functions in the mobile terminal, such as a program that performs the function of user gesture recognition.

7)Soft max:归一化指数函数,是逻辑函数的一种推广。它能将一个含任意实数的K维向量“压缩”到另一个K维实向量中,使得每一个元素的范围都在[0,1]之间,并且所有元素的和为1。7)Soft max: Normalized exponential function, which is a generalization of the logical function. It can "compress" a K-dimensional vector containing any real number into another K-dimensional real vector, so that each element ranges between [0, 1], and the sum of all elements is 1.

图1为本发明实施例提供的脉冲星搜索网络训练方法的使用场景示意图,参见图1,终端(包括终端10-1和终端10-2)上设置有能够执行不同功能相应客户端其中,服务器200存储射电望远镜400所接收的脉冲信号,客户端为终端(包括终端10-1和终端10-2)通过网络300从相应的服务器200中获取不同脉冲星候选体信号进行浏览,或者获取相应的脉冲星候选体信号的识别结果,终端通过网络300连接服务器200,网络300可以是广域网或者局域网,又或者是二者的组合,使用无线链路实现数据传输,其中,终端(包括终端10-1和终端10-2)通过网络300从相应的服务器200中所获取的相应脉冲星候选体信号既可以相同也可以不相同,例如:终端(包括终端10-1和终端10-2)既可以通过网络300从相应的服务器200中获取与某一已知脉冲星的带标注信息的脉冲星体信号,也可以通过网络300从相应的服务器200中获取无标注信息的脉冲星候选体信号,还可以获取无标注信息的脉冲星候选体信号的最终识别结果(对未知脉冲星)。服务器200中可以保存有不同目标对象各自对应的无标注信息的脉冲星候选体信号和相应的识别结果。在本发明的一些实施例中,服务器200中所保存的无标注信息的脉冲星候选体信号是由射电望远镜400采集得到的。同时服务器200所保存的还可以是已知脉冲星的脉冲信号,例如:(1)HTRU:高时间分别率的宇宙脉冲星巡天(High Time Resolution UniverseSurvey),数据集中含有1196个已知脉冲星和89996个非脉冲星,其中的正样本大多是比较强的脉冲星信号,负样本数远多于正样本属于类别极度不平衡分类任务。(2)PMPS、PMPS-26k、P309:Parkes多波束脉冲巡天(Parkes Multi-beam PulsarSurvey)其中PMPS总共4.25T其中大多为初筛样本,PMPS-26k含有脉冲星样本2000个,射电干扰20000个,其他信号2000个,未标记样本2000个合计26k个。P309含有脉冲星2698个,射电干扰1656个。数据集总体来说含有样本种类较多,正样本中既有信号强样本也有信号较弱样本。以新一代500m口径球面射电望远镜(FAST Five Hundred MeterApertureSpherical Telescope)为例,服务器所存储的数据中含有脉冲星1163颗(可用训练集合837)非脉冲信号14319(可用训练结合998个)个。Figure 1 is a schematic diagram of the usage scenario of the pulsar search network training method provided by the embodiment of the present invention. Referring to Figure 1, terminals (including terminal 10-1 and terminal 10-2) are provided with corresponding clients capable of performing different functions. The server 200 stores the pulse signals received by the radio telescope 400. The client is a terminal (including terminal 10-1 and terminal 10-2) that obtains different pulsar candidate signals from the corresponding server 200 through the network 300 for browsing, or obtains the corresponding For the identification result of the pulsar candidate signal, the terminal connects to the server 200 through the network 300. The network 300 can be a wide area network or a local area network, or a combination of the two, using wireless links to realize data transmission, in which the terminal (including the terminal 10-1 The corresponding pulsar candidate signals obtained from the corresponding server 200 through the network 300 and the terminal 10-2) may be the same or different. For example, the terminals (including the terminal 10-1 and the terminal 10-2) may either The network 300 obtains a pulsar candidate signal with annotation information related to a certain known pulsar from the corresponding server 200. It can also obtain a pulsar candidate signal without annotation information from the corresponding server 200 through the network 300. It can also obtain The final identification result of the pulsar candidate signal without annotation information (for unknown pulsars). The server 200 may store pulsar candidate signals without annotation information corresponding to different target objects and corresponding recognition results. In some embodiments of the present invention, the pulsar candidate signals without annotation information stored in the server 200 are collected by the radio telescope 400 . At the same time, the server 200 may also store pulse signals of known pulsars, for example: (1) HTRU: High Time Resolution Universe Survey, the data set contains 1196 known pulsars and There are 89996 non-pulsars, and most of the positive samples are relatively strong pulsar signals. The number of negative samples is much more than the positive samples, which belongs to the extremely imbalanced classification task. (2) PMPS, PMPS-26k, P309: Parkes Multi-beam PulsarSurvey. PMPS has a total of 4.25T, most of which are preliminary screening samples. PMPS-26k contains 2,000 pulsar samples and 20,000 radio interference samples. There are 2000 other signals and 2000 unlabeled samples, totaling 26k. P309 contains 2698 pulsars and 1656 radio interferences. The data set generally contains a wide variety of samples, and the positive samples include both samples with strong signals and samples with weak signals. Taking the new generation 500m aperture spherical radio telescope (FAST Five Hundred Meter Aperture Spherical Telescope) as an example, the data stored in the server contains 1163 pulsars (837 available training sets) and 14319 non-pulse signals (998 available training sets).

通过本申请所提供的脉冲星搜索网络训练方法所训练的脉冲星搜索网络,可以实现利用无标注脉冲星搜索候选体信号对脉冲星的搜索,当然在使用前,需要首先对脉冲星搜索网络进行训练,具体包括:The pulsar search network trained by the pulsar search network training method provided in this application can realize the search for pulsars using unlabeled pulsar search candidate signals. Of course, before use, the pulsar search network needs to be first Training, specifically including:

获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合;获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值;基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数;基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。Obtain the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal; combine the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network; obtain the acquisition Pulsar search candidate body's pulse profile image and pulsar label value corresponding to the candidate body signal; based on the training sample set and the pulse profile image of the candidate body, perform first training on the backbone network of the pulsar search network , determine the initial network parameters of the backbone network; based on the training sample set and the sum pulsar label value, perform a second training on the backbone network of the pulsar search network to adjust the pulsar search network The initial parameters of the backbone network are used to obtain the target parameters of the backbone network of the pulsar search network.

为了更好地理解本申请实施例提供的方法,首先对人工智能、人工智能的各个分支,以及本申请实施例提供的方法所涉及的应用领域、云技术和人工智能云服务进行说明。In order to better understand the methods provided by the embodiments of the present application, artificial intelligence, various branches of artificial intelligence, and the application fields, cloud technologies and artificial intelligence cloud services involved in the methods provided by the embodiments of the present application are first described.

人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。换句话说,人工智能是计算机科学的一个综合技术,它企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器。人工智能也就是研究各种智能机器的设计原理与实现方法,使机器具有感知、推理与决策的功能。Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technology of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can respond in a similar way to human intelligence. Artificial intelligence is the study of the design principles and implementation methods of various intelligent machines, so that the machines have the functions of perception, reasoning and decision-making.

人工智能技术是一门综合学科,涉及领域广泛,既有硬件层面的技术也有软件层面的技术。人工智能基础技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理技术、操作/交互系统、机电一体化等技术。人工智能软件技术主要包括计算机视觉技术、语音处理技术、自然语言处理技术以及机器学习/深度学习等几大方向。以下对各个方向分别进行说明。Artificial intelligence technology is a comprehensive subject that covers a wide range of fields, including both hardware-level technology and software-level technology. Basic artificial intelligence technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, mechatronics and other technologies. Artificial intelligence software technology mainly includes computer vision technology, speech processing technology, natural language processing technology, and machine learning/deep learning. Each direction is explained below.

自然语言处理(NLP,Nature Language processin)是计算机科学领域与人工智能领域中的一个重要方向。它研究能实现人与计算机之间用自然语言进行有效通信的各种理论和方法。自然语言处理是一门融语言学、计算机科学、数学于一体的科学。因此,这一领域的研究将涉及自然语言,即人们日常使用的语言,所以它与语言学的研究有着密切的联系。自然语言处理技术通常包括文本处理、语义理解、机器翻译、机器人问答、知识图谱等技术。Natural language processing (NLP, Nature Language processing) is an important direction in the fields of computer science and artificial intelligence. It studies various theories and methods that enable effective communication between humans and computers using natural language. Natural language processing is a science that integrates linguistics, computer science, and mathematics. Therefore, research in this field will involve natural language, that is, the language that people use every day, so it is closely related to the study of linguistics. Natural language processing technology usually includes text processing, semantic understanding, machine translation, robot question answering, knowledge graph and other technologies.

机器学习(ML,Machine Learning)是一门多领域交叉学科,涉及概率论、统计学、逼近论、凸分析、算法复杂度理论等多门学科。专门研究计算机怎样模拟或实现人类的学习行为,以获取新的知识或技能,重新组织已有的知识结构使之不断改善自身的性能。机器学习是人工智能的核心,是使计算机具有智能的根本途径,其应用遍及人工智能的各个领域。机器学习和深度学习通常包括人工神经网络、置信网络、强化学习、迁移学习、归纳学习等技术。Machine Learning (ML) is a multi-field interdisciplinary subject involving probability theory, statistics, approximation theory, convex analysis, algorithm complexity theory and other disciplines. It specializes in studying how computers can simulate or implement human learning behavior to acquire new knowledge or skills, and reorganize existing knowledge structures to continuously improve their performance. Machine learning is the core of artificial intelligence and the fundamental way to make computers intelligent. Its applications cover all fields of artificial intelligence. Machine learning and deep learning usually include artificial neural networks, belief networks, reinforcement learning, transfer learning, inductive learning and other technologies.

云技术(Cloud technology)是指在广域网或局域网内将硬件、软件、网络等系列资源统一起来,实现数据的计算、储存、处理和共享的一种托管技术。云技术基于云计算商业模式应用的网络技术、信息技术、整合技术、管理平台技术、应用技术等的总称,可以组成资源池,按需所用,灵活便利。云计算技术将变成重要支撑。技术网络系统的后台服务需要大量的计算、存储资源,如视频网站、图片类网站和更多的门户网站。伴随着互联网行业的高度发展和应用,将来每个物品都有可能存在自己的识别标志,都需要传输到后台系统进行逻辑处理,不同程度级别的数据将会分开处理,各类行业数据皆需要强大的系统后盾支撑,只能通过云计算来实现。Cloud technology refers to a hosting technology that unifies a series of resources such as hardware, software, and networks within a wide area network or local area network to realize data calculation, storage, processing, and sharing. Cloud technology is a general term for network technology, information technology, integration technology, management platform technology, application technology, etc. based on the cloud computing business model. It can form a resource pool and use it on demand, which is flexible and convenient. Cloud computing technology will become an important support. The background services of technical network systems require a large amount of computing and storage resources, such as video websites, picture websites and more portal websites. With the rapid development and application of the Internet industry, in the future each item may have its own identification mark, which needs to be transmitted to the backend system for logical processing. Data at different levels will be processed separately, and all types of industry data need to be powerful. System backing support can only be achieved through cloud computing.

所谓人工智能云服务,一般也被称作是AI即服务(AIaaS,AI as a Service),是目前主流的一种人工智能平台的服务方式,具体来说AIaaS平台会把几类常见的AI服务进行拆分,并在云端提供独立或者打包的服务。这种服务模式类似于开了一个AI主题商城:所有的开发者都可以通过API接口的方式来接入使用平台提供的一种或者是多种人工智能服务,部分资深的开发者还可以使用平台提供的AI框架和AI基础设施来部署和运维自己专属的云人工智能服务。The so-called artificial intelligence cloud service, also generally called AIaaS (AI as a Service), is currently a mainstream artificial intelligence platform service method. Specifically, the AIaaS platform will provide several common AI services. Split it up and provide standalone or packaged services in the cloud. This service model is similar to opening an AI theme mall: all developers can access one or more artificial intelligence services provided by the platform through API interfaces, and some senior developers can also use the platform. Provide AI framework and AI infrastructure to deploy and operate your own exclusive cloud artificial intelligence services.

本申请实施例提供的方案涉及人工智能的自然语言处理、机器学习、人工智能云服务等技术,具体通过如下实施例进行说明。The solutions provided by the embodiments of this application involve artificial intelligence natural language processing, machine learning, artificial intelligence cloud services and other technologies, which are specifically explained through the following embodiments.

将结合本申请实施例提供的终端的示例性应用和实施,说明本申请实施例提供的吗,脉冲星搜索网络训练方法。The pulsar search network training method provided by the embodiment of the present application will be described with reference to the exemplary application and implementation of the terminal provided by the embodiment of the present application.

下面对本发明实施例的电子设备的结构做详细说明,电子设备可以各种形式来实施,如带有内窥镜图像处理功能的专用终端,也可以为带有内窥镜图像处理功能的电子设备或者云服务器,例如前述图1中的服务器200。图2为本发明实施例提供的电子设备的组成结构示意图,可以理解,图2仅仅示出了电子设备的示例性结构而非全部结构,根据需要可以实施图2示出的部分结构或全部结构。The structure of the electronic device according to the embodiment of the present invention is described in detail below. The electronic device can be implemented in various forms, such as a dedicated terminal with an endoscopic image processing function, or an electronic device with an endoscopic image processing function. Or a cloud server, such as the server 200 in Figure 1 mentioned above. Figure 2 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention. It can be understood that Figure 2 only shows an exemplary structure of the electronic device rather than the entire structure. Part or all of the structure shown in Figure 2 can be implemented as needed. .

本发明实施例提供的电子设备包括:至少一个处理器201、存储器202、用户接口203和至少一个网络接口204。电子设备中的各个组件通过总线系统205耦合在一起。可以理解,总线系统205用于实现这些组件之间的连接通信。总线系统205除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见,在图2中将各种总线都标为总线系统205。The electronic device provided by the embodiment of the present invention includes: at least one processor 201, a memory 202, a user interface 203, and at least one network interface 204. The various components in the electronic device are coupled together through bus system 205 . It can be understood that the bus system 205 is used to implement connection communication between these components. In addition to the data bus, the bus system 205 also includes a power bus, a control bus and a status signal bus. However, for the sake of clarity, the various buses are labeled as bus system 205 in FIG. 2 .

其中,用户接口203可以包括显示器、键盘、鼠标、轨迹球、点击轮、按键、按钮、触感板或者触摸屏等。其中,本发明实施例中的终端包括但不限于手机、电脑、智能语音交互设备、智能家电、车载终端等。本发明实施例可应用于各种场景,包括但不限于云技术、人工智能、智慧交通、辅助驾驶等,通过不同终端执行本发明所提供的脉冲星搜索网络训练方法时,具体的使用场景本发明不做限制The user interface 203 may include a display, keyboard, mouse, trackball, click wheel, keys, buttons, touch pad or touch screen, etc. Among them, terminals in embodiments of the present invention include but are not limited to mobile phones, computers, intelligent voice interaction devices, smart home appliances, vehicle-mounted terminals, etc. Embodiments of the present invention can be applied to various scenarios, including but not limited to cloud technology, artificial intelligence, smart transportation, assisted driving, etc. When the pulsar search network training method provided by the present invention is executed through different terminals, the specific usage scenarios are as follows: Invention without limits

可以理解,存储器202可以是易失性存储器或非易失性存储器,也可包括易失性和非易失性存储器两者。本发明实施例中的存储器202能够存储数据以支持终端(如10-1)的操作。这些数据的示例包括:用于在终端(如10-1)上操作的任何计算机程序,如操作系统和应用程序。其中,操作系统包含各种系统程序,例如框架层、核心库层、驱动层等,用于实现各种基础业务以及处理基于硬件的任务。应用程序可以包含各种应用程序。It can be understood that the memory 202 can be a volatile memory or a non-volatile memory, and can also include both volatile and non-volatile memories. The memory 202 in the embodiment of the present invention can store data to support the operation of the terminal (such as 10-1). Examples of such data include: any computer programs, such as operating systems and applications, used to operate on a terminal such as 10-1. Among them, the operating system includes various system programs, such as framework layer, core library layer, driver layer, etc., which are used to implement various basic services and process hardware-based tasks. Applications can contain various applications.

在一些实施例中,本发明实施例提供的脉冲星搜索网络训练装置可以采用软硬件结合的方式实现,作为示例,本发明实施例提供的脉冲星搜索网络训练装置可以是采用硬件译码处理器形式的处理器,其被编程以执行本发明实施例提供的脉冲星搜索网络训练方法。例如,硬件译码处理器形式的处理器可以采用一个或多个应用专用集成电路(ASIC,Application Specific Integrated Circuit)、DSP、可编程逻辑器件(PLD,ProgrammableLogic Device)、复杂可编程逻辑器件(CPLD,Complex Programmable Logic Device)、现场可编程门阵列(FPGA,Field-Programmable Gate Array)或其他电子元件。In some embodiments, the pulsar search network training device provided by the embodiment of the present invention can be implemented by combining software and hardware. As an example, the pulsar search network training device provided by the embodiment of the present invention can be implemented by using a hardware decoding processor. A processor in the form of a processor, which is programmed to execute the pulsar search network training method provided by the embodiment of the present invention. For example, a processor in the form of a hardware decoding processor can use one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs) , Complex Programmable Logic Device), Field-Programmable Gate Array (FPGA, Field-Programmable Gate Array) or other electronic components.

作为本发明实施例提供的脉冲星搜索网络训练装置采用软硬件结合实施的示例,本发明实施例所提供的脉冲星搜索网络训练装置可以直接体现为由处理器201执行的软件模块组合,软件模块可以位于存储介质中,存储介质位于存储器202,处理器201读取存储器202中软件模块包括的可执行指令,结合必要的硬件(例如,包括处理器201以及连接到总线205的其他组件)完成本发明实施例提供的脉冲星搜索网络训练方法。As an example of the pulsar search network training device provided by the embodiment of the present invention using a combination of software and hardware, the pulsar search network training device provided by the embodiment of the present invention can be directly embodied as a combination of software modules executed by the processor 201. The software modules It can be located in a storage medium, and the storage medium is located in the memory 202. The processor 201 reads the executable instructions included in the software module in the memory 202, and completes the present invention in combination with the necessary hardware (for example, including the processor 201 and other components connected to the bus 205). The pulsar search network training method provided by the embodiment of the invention.

作为示例,处理器201可以是一种集成电路芯片,具有信号的处理能力,例如通用处理器、数字信号处理器(DSP,Digital Signal Processor),或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等,其中,通用处理器可以是微处理器或者任何常规的处理器等。As an example, the processor 201 may be an integrated circuit chip with signal processing capabilities, such as a general-purpose processor, a digital signal processor (DSP), or other programmable logic devices, discrete gates, or transistor logic devices. , discrete hardware components, etc., wherein the general processor can be a microprocessor or any conventional processor, etc.

作为本发明实施例提供的脉冲星搜索网络训练装置采用硬件实施的示例,本发明实施例所提供的装置可以直接采用硬件译码处理器形式的处理器201来执行完成,例如,被一个或多个应用专用集成电路(ASIC,Application Specific Integrated Circuit)、DSP、可编程逻辑器件(PLD,Programmable Logic Device)、复杂可编程逻辑器件(CPLD,ComplexProgrammable Logic Device)、现场可编程门阵列(FPGA,Field-Programmable GateArray)或其他电子元件执行实现本发明实施例提供的脉冲星搜索网络训练方法。As an example of the pulsar search network training device provided by the embodiment of the present invention being implemented in hardware, the device provided by the embodiment of the present invention can be directly executed by a processor 201 in the form of a hardware decoding processor, for example, by one or more Application Specific Integrated Circuit (ASIC, Application Specific Integrated Circuit), DSP, Programmable Logic Device (PLD, Programmable Logic Device), Complex Programmable Logic Device (CPLD, Complex Programmable Logic Device), Field Programmable Gate Array (FPGA, Field -Programmable GateArray) or other electronic components to implement the pulsar search network training method provided by the embodiment of the present invention.

本发明实施例中的存储器202用于存储各种类型的数据以支持电子设备的操作。这些数据的示例包括:用于在电子设备上操作的任何可执行指令,如可执行指令,实现本发明实施例的从脉冲星搜索网络训练方法的程序可以包含在可执行指令中。The memory 202 in the embodiment of the present invention is used to store various types of data to support the operation of the electronic device. Examples of these data include: any executable instructions for operating on an electronic device, such as executable instructions, and a program that implements the pulsar search network training method of an embodiment of the present invention may be included in the executable instructions.

在另一些实施例中,本发明实施例提供的脉冲星搜索网络训练装置可以采用软件方式实现,图2示出了存储在存储器202中的脉冲星搜索网络训练装置2020,其可以是程序和插件等形式的软件,并包括一系列的模块,作为存储器202中存储的程序的示例,可以包括脉冲星搜索网络训练装置2020,脉冲星搜索网络训练装置2020中包括以下的软件模块:In other embodiments, the pulsar search network training device provided by the embodiment of the present invention can be implemented in software. Figure 2 shows the pulsar search network training device 2020 stored in the memory 202, which can be a program and a plug-in. and other forms of software, and includes a series of modules. As an example of a program stored in the memory 202, it may include a pulsar search network training device 2020. The pulsar search network training device 2020 includes the following software modules:

信息传输模块2081,用于获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像。The information transmission module 2081 is used to obtain the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal.

信息处理模块2082,用于对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合。The information processing module 2082 is used to combine the time phase distribution image and the frequency phase distribution image to form a training sample set for the pulsar search network.

所述信息处理模块2082,用于获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值。The information processing module 2082 is used to obtain the pulse profile image and pulsar label value of the candidate corresponding to the obtained pulsar search candidate signal.

所述信息处理模块2082,用于基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数。The information processing module 2082 is configured to perform first training on the backbone network of the pulsar search network based on the training sample set and the pulse profile image of the candidate body, and determine the initial network parameters of the backbone network.

所述信息处理模块2082,用于基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。The information processing module 2082 is configured to perform a second training on the backbone network of the pulsar search network based on the training sample set and the sum pulsar label value, so as to adjust the backbone network of the pulsar search network The initial parameters of the network are used to obtain the target parameters of the backbone network of the pulsar search network.

需要说明的是,在脉冲星搜索网络训练完成后,还可以部署在电子设备(例如寻星服务器或者射电望远镜的数据处理服务器)中,以实现对所接收的脉冲星搜索候选体信号的处理,因此电子设备中还可以包括:It should be noted that after the pulsar search network training is completed, it can also be deployed in electronic equipment (such as a star finding server or a data processing server of a radio telescope) to process the received pulsar search candidate signals. Therefore, electronic equipment can also include:

信号传输模块,用于接收无标注的脉冲星搜索候选体信号。The signal transmission module is used to receive unlabeled pulsar search candidate signals.

信号处理模块,用于获取与所述脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像。A signal processing module, configured to obtain a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signal.

所述信号处理模块,用于通过脉冲星搜索网络的主干网络,对所述时间相位分布图像和频率相位分布图像进行处理,确定图像处理结果。The signal processing module is used to process the time phase distribution image and frequency phase distribution image through the backbone network of the pulsar search network, and determine the image processing results.

所述信号处理模块,用于通过所述脉冲星搜索网络的分类任务全连接层网络,对所述图像处理结果进行处理,得到所述脉冲星搜索候选体信号的预测结果。The signal processing module is used to process the image processing results through the fully connected layer network of the classification task of the pulsar search network to obtain the prediction result of the pulsar search candidate signal.

根据图2所示的电子设备,在本申请的一个方面中,本申请还提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行本申请所提供的脉冲星搜索网络训练方法的各种可选实现方式中所提供的方法。According to the electronic device shown in Figure 2, in one aspect of the present application, the present application also provides a computer program product or computer program. The computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable file. in the storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the various optional implementations of the pulsar search network training method provided by this application. Methods.

结合图2示出的脉冲星搜索网络训练装置说明本发明实施例提供的脉冲星搜索网络训练方法,参见图3,图3为本发明实施例提供的脉冲星搜索网络训练方法一个可选的流程示意图,可以理解地,图3所示的步骤可以由运行脉冲星搜索网络训练装置的各种电子设备执行,例如可以是如带有脉冲星搜索功能的专用终端、服务器或者服务器集群。下面针对图3示出的步骤进行说明。The pulsar search network training method provided by the embodiment of the present invention will be described with reference to the pulsar search network training device shown in Figure 2. Refer to Figure 3, which is an optional process of the pulsar search network training method provided by the embodiment of the present invention. Schematic diagram, it can be understood that the steps shown in Figure 3 can be executed by various electronic devices running the pulsar search network training device, for example, it can be a dedicated terminal, server or server cluster with a pulsar search function. The steps shown in Figure 3 will be described below.

步骤301:脉冲星搜索网络训练装置获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像。Step 301: The pulsar search network training device obtains the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal.

步骤302:脉冲星搜索网络训练装置对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合。Step 302: The pulsar search network training device combines the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network.

其中,参考图4A,图4A为本发明实施例中脉冲星搜索网络的样本集合示意图,其中,对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合,可以通过以下方式实现:Referring to Figure 4A, Figure 4A is a schematic diagram of a sample set of the pulsar search network in an embodiment of the present invention, in which the time phase distribution image and the frequency phase distribution image are combined to form a training sample of the pulsar search network Collections can be implemented in the following ways:

确定所述时间相位分布图像和频率相位分布图像的噪声值,对所述确定所述时间相位分布图像和频率相位分布图像分别进行除噪处理;对经过除噪处理的所述时间相位分布图像和频率相位分布图像分别进行标准化处理,得到符合单通道灰度图像标准的时间相位分布图像和频率相位分布图像;将符合单通道灰度图像标准的时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合。以图1所示的射电望远镜为例,通过接收脉冲星信号得到的fid格式文件经过特征读取后时间-相位分布图像分辨率是256*64,频率-相位分布图像分辨率是96*64;而其他射电望远镜获取的phcx格式文件经过特征抽取后时间-相位分布图像分辨率一般为(16~24)*64,频率-相位分布图像分辨率为16*64,对于上述图像的大小并不相等,此外相位分布图像和频率相位分布图像的特征背景区分也有不同,例如phcx格式文件图像中脉冲特征为黑色,背景为白色,而fid格式文件图像中脉冲特征为白色,背景为黑色,因此需要进行除噪处理和标准化处理,以便于脉冲星搜索网络的训练和使用。Determine the noise values of the time phase distribution image and frequency phase distribution image, perform denoising processing on the determined time phase distribution image and frequency phase distribution image respectively; perform denoising processing on the time phase distribution image and The frequency phase distribution images are standardized separately to obtain a time phase distribution image and a frequency phase distribution image that meet the single-channel grayscale image standard; the time phase distribution image and frequency phase distribution image that meet the single-channel grayscale image standard are combined to form A collection of training samples for the pulsar search network. Taking the radio telescope shown in Figure 1 as an example, after feature reading of the fid format file obtained by receiving pulsar signals, the time-phase distribution image resolution is 256*64, and the frequency-phase distribution image resolution is 96*64; After feature extraction, the phcx format files obtained by other radio telescopes generally have a time-phase distribution image resolution of (16~24)*64 and a frequency-phase distribution image resolution of 16*64. The sizes of the above images are not equal. , in addition, the characteristic background distinction between the phase distribution image and the frequency phase distribution image is also different. For example, in the phcx format file image, the pulse characteristics are black and the background is white, while in the fid format file image, the pulse characteristics are white and the background is black, so it needs to be Denoising and normalizing to facilitate training and use of pulsar search networks.

在本发明的一些实施例中,参考图4B,图4B为本发明实施例中脉冲星搜索网络的样本集合示意图,可以将时间-相位分布图和频率-相位分布图分别进行数据标准化操作,得到对应的64x64的单通道灰度图,然后将两个灰度图在通道维度进行堆叠成2通道2维信号作为主干网络的输入特征向量。在预训练任务和分类任务中,利用主干网络的输出结果,通过不同的全连接层将主干网络的输出调整至可以计算任务损失的输出维度。In some embodiments of the present invention, referring to Figure 4B, which is a schematic diagram of the sample set of the pulsar search network in the embodiment of the present invention, the time-phase distribution diagram and the frequency-phase distribution diagram can be subjected to data standardization operations respectively, to obtain The corresponding 64x64 single-channel grayscale image is then stacked in the channel dimension into a 2-channel 2-dimensional signal as the input feature vector of the backbone network. In pre-training tasks and classification tasks, the output results of the backbone network are used to adjust the output of the backbone network to the output dimension that can calculate the task loss through different fully connected layers.

在本发明的一些实施例中,确定所述时间相位分布图像和频率相位分布图像的噪声值,对所述确定所述时间相位分布图像和频率相位分布图像分别进行除噪处理,可以通过以下方式实现:In some embodiments of the present invention, the noise values of the time phase distribution image and the frequency phase distribution image are determined, and the determined time phase distribution image and the frequency phase distribution image are separately denoised in the following manner. accomplish:

根据所述脉冲星搜索候选体信号所对应的目标区域的位置,确定与所述脉冲星搜索网络的使用环境相匹配的动态噪声阈值;根据所述动态噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述动态噪声阈值相匹配的时间相位分布图像和频率相位分布图像。其中,在低频射电波段(例如350MHz以下)对脉冲星进行探测时,由于辐射频谱在相当低的频率下限之上都呈现为幂律谱,所以预期相对于高频应当具有更高的流量密度,所以更容易探测到,但是由于脉冲星所出现的星系不同,同时,由于星际介质对不同频率无线电波具有不同的折射率(即色散),所以在信号发射地对齐的不同频率的脉冲星脉冲,经历了长距离的传播之后,不同频率成分会相互错开,在时域内非常明显的周期性脉冲轮廓,经过色散效应会变得几乎无法直接辨认和探测。因此,在低频射电波段探测脉冲星需要动态噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述动态噪声阈值相匹配的时间相位分布图像和频率相位分布图像,以保证脉冲星搜索网络的准确性。According to the position of the target area corresponding to the pulsar search candidate signal, a dynamic noise threshold matching the usage environment of the pulsar search network is determined; according to the dynamic noise threshold, the time phase distribution image and the The frequency phase distribution image is subjected to noise reduction processing to form a time phase distribution image and a frequency phase distribution image that match the dynamic noise threshold. Among them, when detecting pulsars in the low-frequency radio band (for example, below 350MHz), since the radiation spectrum appears as a power law spectrum above a very low frequency lower limit, it is expected that there should be a higher flux density relative to high frequency. Therefore, it is easier to detect, but because the galaxies in which pulsars appear are different, and because the interstellar medium has different refractive index (i.e., dispersion) for radio waves of different frequencies, pulsar pulses of different frequencies aligned at the signal emission site, After long-distance propagation, different frequency components will be staggered from each other, and the very obvious periodic pulse profile in the time domain will become almost impossible to directly identify and detect through the dispersion effect. Therefore, detecting pulsars in the low-frequency radio band requires a dynamic noise threshold to denoise the time phase distribution image and the frequency phase distribution image to form a time phase distribution image and frequency phase that match the dynamic noise threshold. Distribute images to ensure the accuracy of pulsar search networks.

在本发明的一些实施例中,确定所述时间相位分布图像和频率相位分布图像的噪声值,对所述确定所述时间相位分布图像和频率相位分布图像分别进行除噪处理,可以通过以下方式实现:In some embodiments of the present invention, the noise values of the time phase distribution image and the frequency phase distribution image are determined, and the determined time phase distribution image and the frequency phase distribution image are separately denoised in the following manner. accomplish:

根据所述脉冲星搜索候选体信号所对应的射电望远镜类型,确定与所述脉冲星搜索网络的使用环境相匹配的固定噪声阈值;根据所述固定噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述固定噪声阈值相匹配的时间相位分布图像和频率相位分布图像。其中,由于,已经被探测到的并且可以用来调校探测算法参数的脉冲星个数是非常有限的,而未知的脉冲星在实际搜索到之前是无法用来作为校正训练样本集合的。因此,对于一台全新的射电望远镜,或者一台未曾探测到脉冲星过的既有的射电望远镜,可以通过根据所述动态噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与所述动态噪声阈值相匹配的时间相位分布图像和频率相位分布图像,并且确定与射电望远镜相匹配的噪声阈值。According to the type of radio telescope corresponding to the pulsar search candidate signal, a fixed noise threshold matching the usage environment of the pulsar search network is determined; according to the fixed noise threshold, the time phase distribution image and the The frequency phase distribution image undergoes noise reduction processing to form a time phase distribution image and a frequency phase distribution image that match the fixed noise threshold. Among them, because the number of pulsars that have been detected and can be used to adjust the parameters of the detection algorithm is very limited, and unknown pulsars cannot be used as a correction training sample set before they are actually searched. Therefore, for a brand new radio telescope or an existing radio telescope that has never detected a pulsar, the time phase distribution image and the frequency phase distribution image can be reduced according to the dynamic noise threshold. Noise processing is performed to form a time phase distribution image and a frequency phase distribution image that match the dynamic noise threshold, and a noise threshold that matches the radio telescope is determined.

进一步地,当脉冲星搜索网络固定于射电望远镜的终端设备中时,根据固定噪声阈值对所述时间相位分布图像和所述频率相位分布图像进行降噪处理,以形成与固定噪声阈值相匹配的时间相位分布图像和频率相位分布图像,可以继续减少脉冲星搜索网络的训练时间,提升脉冲星搜索网络的训练效率。Further, when the pulsar search network is fixed in the terminal equipment of the radio telescope, the time phase distribution image and the frequency phase distribution image are denoised according to the fixed noise threshold to form a pixel that matches the fixed noise threshold. The time phase distribution image and frequency phase distribution image can continue to reduce the training time of the pulsar search network and improve the training efficiency of the pulsar search network.

步骤303:脉冲星搜索网络训练装置获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值。Step 303: The pulsar search network training device obtains the pulse profile image and pulsar label value of the candidate corresponding to the acquired pulsar search candidate signal.

在本发明的一些实施例中,获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像,可以通过以下方式实现:In some embodiments of the present invention, obtaining the pulse profile image of the candidate body corresponding to the pulsar search candidate body signal can be achieved in the following manner:

获取所述脉冲星搜索候选体信号对应的候选体的原始脉冲轮廓图像;对所述的原始脉冲轮廓图像的序列值进行标准化处理,得到所述原始脉冲轮廓图像标准序列值;对所述原始脉冲轮廓图像标准序列值进行调整,得到所述脉冲星搜索候选体信号对应的脉冲轮廓图像,以实现与所述脉冲星搜索网络的精确度相匹配。其中,为了增加预训练任务的难度,可以将原始脉冲轮廓序列进行数据标准化,再将64维序列线性插值至100维序列作为预训练任务的标签值,当训练精度增加时,还可以灵活调整不同维度序列线性插值以适应脉冲星搜索网络的不同需求。Obtain the original pulse profile image of the candidate body corresponding to the pulsar search candidate body signal; perform standardization processing on the sequence value of the original pulse profile image to obtain the standard sequence value of the original pulse profile image; The standard sequence value of the profile image is adjusted to obtain a pulse profile image corresponding to the pulsar search candidate signal, so as to match the accuracy of the pulsar search network. Among them, in order to increase the difficulty of the pre-training task, the original pulse profile sequence can be data standardized, and then the 64-dimensional sequence is linearly interpolated to a 100-dimensional sequence as the label value of the pre-training task. When the training accuracy increases, different parameters can be flexibly adjusted. Dimension sequences are linearly interpolated to adapt to different needs of pulsar search networks.

步骤304:脉冲星搜索网络训练装置基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数。Step 304: The pulsar search network training device performs first training on the backbone network of the pulsar search network based on the training sample set and the pulse profile image of the candidate body, and determines the initial network parameters of the backbone network.

步骤305:脉冲星搜索网络训练装置基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。Step 305: The pulsar search network training device performs a second training on the backbone network of the pulsar search network based on the training sample set and the sum pulsar label value to adjust the backbone network of the pulsar search network The initial parameters of the network are used to obtain the target parameters of the backbone network of the pulsar search network.

继续结合图2示出的脉冲星搜索网络训练装置说明本发明实施例提供的脉冲星搜索网络训练方法,参见图5,图5为本发明实施例提供的脉冲星搜索网络训练方法一个可选的流程示意图,可以理解地,图5所示的步骤可以由运行脉冲星搜索网络训练装置的各种电子设备执行,例如可以是如带有脉冲星搜索功能的专用终端、服务器或者服务器集群。下面针对图5示出的步骤进行说明。Continuing to describe the pulsar search network training method provided by the embodiment of the present invention with reference to the pulsar search network training device shown in Figure 2, see Figure 5. Figure 5 is an optional version of the pulsar search network training method provided by the embodiment of the present invention. It is understandable that the steps shown in Figure 5 can be executed by various electronic devices running the pulsar search network training device, for example, a dedicated terminal, a server or a server cluster with a pulsar search function. The steps shown in Figure 5 will be described below.

步骤501:脉冲星搜索网络训练装置基于所述候选体的脉冲轮廓图像,确定所述候选体的脉冲轮廓图像的标签值。Step 501: The pulsar search network training device determines the label value of the pulse profile image of the candidate body based on the pulse profile image of the candidate body.

步骤502:脉冲星搜索网络训练装置通过所述主干网络对应的预训练全连接层网络,基于所述脉冲轮廓图像的标签值,确定所述第一训练对应的第一损失函数。Step 502: The pulsar search network training device determines the first loss function corresponding to the first training based on the label value of the pulse profile image through the pre-trained fully connected layer network corresponding to the backbone network.

步骤503:脉冲星搜索网络训练装置将所述训练样本集合中的训练样本,带入所述第一损失函数。Step 503: The pulsar search network training device brings the training samples in the training sample set into the first loss function.

步骤504:脉冲星搜索网络训练装置当所述第一损失函数满足第一收敛条件时,确定所述主干网络的初始网络参数。Step 504: When the first loss function satisfies the first convergence condition, the pulsar search network training device determines the initial network parameters of the backbone network.

参考图6,图6为本发明实施例中脉冲星搜索网络的网络结构示意图,其中,通过步骤501-步骤504的处理,使用候选体信号的时间-相位分布图和频率-相位分布图作为脉冲星搜索网络的输入特征,使用候选体的脉冲轮廓图作为标签值,使用均方根误差作为损失函数计算脉冲星搜索网络的初始网络参数。通过主干网络,可以将时间-相位分布图和频率-相位分布图按列维度求和,再将两个求和后的序列再次求和得到64维的原始的脉冲轮廓序列,其中,图6所示的网络结构中的主干网络可以是ResNet18网络,与主干网络相连接的网络可以包括多个全连接层。值得说明的是,为了平衡图像特征与图像量化编码两者的空间占用,可以将图像特征的维度设置得比图像量化编码的维度更小,否则图像特征的维度过大会导致占用过多存储空间。Referring to Figure 6, Figure 6 is a schematic network structure diagram of a pulsar search network in an embodiment of the present invention, in which, through the processing of steps 501 to 504, the time-phase distribution diagram and frequency-phase distribution diagram of the candidate body signal are used as the pulse For the input features of the star search network, the pulse profile of the candidate body is used as the label value, and the root mean square error is used as the loss function to calculate the initial network parameters of the pulsar search network. Through the backbone network, the time-phase distribution diagram and the frequency-phase distribution diagram can be summed in column dimensions, and then the two summed sequences are summed again to obtain a 64-dimensional original pulse profile sequence, where Figure 6 shows The backbone network in the network structure shown can be a ResNet18 network, and the network connected to the backbone network can include multiple fully connected layers. It is worth noting that in order to balance the space occupied by image features and image quantized coding, the dimension of the image feature can be set smaller than the dimension of the image quantized coding. Otherwise, excessive dimensions of the image feature will occupy too much storage space.

在训练脉冲星搜索网络的过程中,主要涉及到以下步骤。In the process of training the pulsar search network, the following steps are mainly involved.

1)参数初始化。对于初始特征提取网络来说,可以通过开源的图像集(如ImageNet图像集、Open Image图像集等)进行训练,从而完成权重参数的初始化。对于其他的网络,如最大池化网络、嵌入表示网络及量化编码网络来说,可以对权重参数进行随机初始化,例如可以采用方差为0.01、且均值为0的高斯分布来进行初始化。1) Parameter initialization. For the initial feature extraction network, it can be trained through open source image sets (such as ImageNet image set, Open Image image set, etc.) to complete the initialization of weight parameters. For other networks, such as max pooling networks, embedded representation networks and quantized coding networks, the weight parameters can be randomly initialized. For example, a Gaussian distribution with a variance of 0.01 and a mean of 0 can be used for initialization.

2)确定需要学习(即需要更新)的权重参数。这里可以分为两个训练阶段,在第一个训练阶段,对初始特征提取网络、最大池化网络、嵌入表示网络这些网络的权重参数进行更新;在第二个训练阶段,对量化编码网络的权重参数进行更新。2) Determine the weight parameters that need to be learned (that is, need to be updated). This can be divided into two training stages. In the first training stage, the weight parameters of the initial feature extraction network, maximum pooling network, and embedding representation network are updated; in the second training stage, the weight parameters of the quantized encoding network are updated. The weight parameters are updated.

3)确定学习率(Learning Rate)。在本申请实施例中,脉冲星搜索网络中的各个网络均可以采用相同的学习率,例如学习率为0.005。每经过10轮迭代后学习率可以更新为原来的0.1倍。3) Determine the learning rate. In this embodiment of the present application, each network in the pulsar search network can adopt the same learning rate, for example, the learning rate is 0.005. The learning rate can be updated to 0.1 times the original value after every 10 iterations.

4)学习过程(训练过程)。这里,可以对脉冲星搜索网络进行epoch轮迭代,epoch是大于1的整数,可以根据实际应用场景进行设定。在每轮迭代中,处理一次全量样本,例如可以根据Batch Size(可以根据实际应用场景进行设定)对全量的相似图像对进行划分,得到Nb个批次,对于每个批次(Batch)执行以下处理:4) Learning process (training process). Here, the pulsar search network can be iterated in epoch rounds. epoch is an integer greater than 1 and can be set according to the actual application scenario. In each round of iteration, a full amount of samples are processed once. For example, the full amount of similar image pairs can be divided according to the Batch Size (which can be set according to the actual application scenario) to obtain Nb batches. For each batch (Batch), execute The following processing:

①前向传播处理。这里,通过脉冲星搜索网络对Batch内的每个训练图像进行前向传播处理,得到图像特征及图像量化编码。①Forward propagation processing. Here, each training image in the batch is forward propagated through the pulsar search network to obtain image features and image quantization coding.

②损失值计算。这里,特征提取网络及量化编码网络对应不同的损失值。②Loss value calculation. Here, the feature extraction network and the quantization encoding network correspond to different loss values.

③权重参数更新。例如,可以采用随机梯度下降算法或其他梯度下降算法,将损失值进行反向传播处理,并在反向传播的过程中,沿梯度下降方向更新需要训练的网络的权重参数。③Update weight parameters. For example, the stochastic gradient descent algorithm or other gradient descent algorithms can be used to backpropagate the loss value, and during the backpropagation process, the weight parameters of the network to be trained are updated along the gradient descent direction.

需要说明的是,上述实施例中具体结构只是示例性的,还可以根据需要选择其他模型结构,例如,主干网络还可以采用resnet50、inceptionv4、resnet18等,在此本申请不作限定。It should be noted that the specific structures in the above embodiments are only exemplary, and other model structures can be selected as needed. For example, the backbone network can also use resnet50, inceptionv4, resnet18, etc., which is not limited in this application.

通过步骤501-步骤504完成预训练任务后,继续执行步骤505。After completing the pre-training task through steps 501 to 504, continue to step 505.

步骤505:脉冲星搜索网络训练装置通过所述主干网络对应的分类任务全连接层网络,基于所述脉冲星标签值,确定所述第二训练对应的第二损失函数。Step 505: The pulsar search network training device determines the second loss function corresponding to the second training based on the pulsar label value through the fully connected layer network of the classification task corresponding to the backbone network.

步骤506:脉冲星搜索网络训练装置将所述训练样本集合中的训练样本,带入所述第二损失函数。Step 506: The pulsar search network training device brings the training samples in the training sample set into the second loss function.

步骤507:脉冲星搜索网络训练装置当所述第二损失函数满足第二收敛条件时,确定所述主干网络的更新参数。Step 507: When the second loss function satisfies the second convergence condition, the pulsar search network training device determines the update parameters of the backbone network.

步骤508:脉冲星搜索网络训练装置基于所述主干网络的更新参数,调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数。Step 508: The pulsar search network training device adjusts the initial parameters of the backbone network of the pulsar search network based on the updated parameters of the backbone network, and obtains the target parameters of the backbone network of the pulsar search network.

其中,通过步骤505至步骤508,通过全连接层将特征维度映射到分类维度,再将预测值和脉冲星标签值计算交叉熵损失。为了防止在分类任务优化过程中,预训练后的主干网络出现过拟合现象,可以将主干网络参数学习率为分类任务全连接参数学习率的0.25倍。Among them, through steps 505 to 508, the feature dimension is mapped to the classification dimension through the fully connected layer, and then the cross-entropy loss is calculated between the predicted value and the pulsar label value. In order to prevent the pre-trained backbone network from over-fitting during the optimization process of the classification task, the backbone network parameter learning rate can be set to 0.25 times the learning rate of the fully connected parameters of the classification task.

当脉冲星搜索网络通过前序实施例训练完成之后,可以部署在相应的数据处理设备中,以对获取的脉冲星搜索候选体信号进行预测,参见图7,图7为本发明实施例提供的脉冲星搜索方法一个可选的流程示意图,可以理解地,图7所示的步骤可以由运行脉冲星搜索装置的各种电子设备执行,例如可以是如带有脉冲星搜索功能的专用终端、射电望远镜的数据服务器或者服务器集群。下面针对图7示出的步骤进行说明。After the pulsar search network is trained through the previous embodiment, it can be deployed in the corresponding data processing equipment to predict the obtained pulsar search candidate signals. See Figure 7, which is provided by the embodiment of the present invention. An optional flow diagram of the pulsar search method. Understandably, the steps shown in Figure 7 can be executed by various electronic equipment running the pulsar search device, such as a dedicated terminal with a pulsar search function, a radio Telescope data server or server cluster. The steps shown in Figure 7 will be described below.

步骤701:脉冲星搜索装置接收无标注的脉冲星搜索候选体信号。Step 701: The pulsar search device receives unlabeled pulsar search candidate signals.

步骤702:脉冲星搜索装置获取与所述脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像。Step 702: The pulsar search device acquires the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal.

其中,参考图8,图8为本发明实施例中脉冲星搜索候选体信号示意图,其中,脉冲星搜索候选体信号对应的时间相位分布图像如(1)所示、频率相位分布图像如(2)所示,当时间相位分布图像和频率相位分布图像中出现非数值(NaN Not a Number)时,可以替换所有的NaN值,例如采用线性插值法补全替换时间相位分布图像和频率相位分布图像中出现非数值,若出现数据缺失情况,还可以进行数据过滤处理,以保证时间相位分布图像和频率相位分布图像中出现非数值的完整性。Referring to Figure 8, Figure 8 is a schematic diagram of the pulsar search candidate signal in an embodiment of the present invention. The time phase distribution image corresponding to the pulsar search candidate signal is as shown in (1), and the frequency phase distribution image is as (2). ), when non-numeric values (NaN Not a Number) appear in the time phase distribution image and frequency phase distribution image, all NaN values can be replaced, for example, the linear interpolation method is used to complete and replace the time phase distribution image and frequency phase distribution image. If non-numeric values appear in the data, if there is missing data, data filtering can also be performed to ensure the integrity of the non-numeric values in the time phase distribution image and frequency phase distribution image.

步骤703:脉冲星搜索装置通过脉冲星搜索网络的主干网络,对所述时间相位分布图像和频率相位分布图像进行处理,确定图像处理结果。Step 703: The pulsar search device processes the time phase distribution image and frequency phase distribution image through the backbone network of the pulsar search network, and determines the image processing result.

参考图8,以新一代500m口径球面射电望远镜(FAST Five Hundred MeterApertureSpherical Telescope)为例,在随着脉冲星搜索的进行,会产生大量的不带标注信息的脉冲星候选体信号。但是脉冲星的不同模态信号之间存在的固有联系,例如,图8中(3)所示的脉冲星的积分轮廓图即为时间-相位分布图对时间进行积分或是频率-相位分布图对频率进行积分得到的。脉冲星搜索候选体信号中不同信号间的固有关系,可以通过经过训练的脉冲星搜索网络进行确定。其中,脉冲星搜索网络的主干网络可以采用resnet50、inceptionv4、resnet18等网络结构,在此本申请不作限定。Referring to Figure 8, taking the new generation 500m aperture spherical radio telescope (FAST Five Hundred Meter Aperture Spherical Telescope) as an example, as the pulsar search proceeds, a large number of pulsar candidate signals without annotation information will be generated. However, there is an inherent connection between the different modal signals of the pulsar. For example, the integrated profile of the pulsar shown in (3) in Figure 8 is the time-phase distribution diagram integrated over time or the frequency-phase distribution diagram. It is obtained by integrating the frequency. The inherent relationships between different signals in the pulsar search candidate signals can be determined through a trained pulsar search network. Among them, the backbone network of the pulsar search network can adopt network structures such as resnet50, inceptionv4, resnet18, etc., which is not limited in this application.

步骤704:脉冲星搜索装置通过所述脉冲星搜索网络的分类任务全连接层网络,对所述图像处理结果进行处理,得到所述脉冲星搜索候选体信号的预测结果。Step 704: The pulsar search device processes the image processing result through the fully connected layer network of the classification task of the pulsar search network to obtain the prediction result of the pulsar search candidate signal.

其中,如图8中国的(4)所示,通过脉冲星搜索网络的分类任务全连接层网络,对所述图像处理结果进行处理得到时间相位分布图像和频率相位分布图像的图像处理结果,可以确定发现新的脉冲星。Among them, as shown in (4) in Figure 8 China, through the fully connected layer network of the classification task of the pulsar search network, the image processing results are processed to obtain the image processing results of the time phase distribution image and the frequency phase distribution image, which can be The discovery of a new pulsar was confirmed.

有益技术效果:Beneficial technical effects:

本发明通过获取脉冲星搜索候选体信号对应的时间相位分布图像和频率相位分布图像;对所述时间相位分布图像和频率相位分布图像进行组合,形成所述脉冲星搜索网络的训练样本集合;获取所述获取脉冲星搜索候选体信号对应的候选体的脉冲轮廓图像和脉冲星标签值;基于所述训练样本集合和所述候选体的脉冲轮廓图像,对所述脉冲星搜索网络的主干网络进行第一训练,确定所述主干网络的初始网络参数;基于所述训练样本集合和所述和脉冲星标签值,对所述脉冲星搜索网络的主干网络进行第二训练,以实现调整所述脉冲星搜索网络的主干网络的初始参数,得到所述脉冲星搜索网络的主干网络的目标参数,由此,能够实现通过对脉冲星搜索网络的训练,能够在减少训练数据总量和无需重进行监督训练的前提下,稳定提高脉冲星搜索网络训练训练的准确率,减轻脉冲星搜索网络的过拟合,增强脉冲星搜索网络模型的泛化能力,便于将及时所训练的脉冲星搜索网络模型部署于网络终端中,实现脉冲星搜索网络模型的大规模应用。The present invention obtains the time phase distribution image and frequency phase distribution image corresponding to the pulsar search candidate signal; combines the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network; obtains Obtaining the pulse profile image and pulsar label value of the candidate body corresponding to the pulsar search candidate body signal; based on the training sample set and the pulse profile image of the candidate body, conducting the backbone network of the pulsar search network The first training is to determine the initial network parameters of the backbone network; based on the training sample set and the sum pulsar label value, the second training is performed on the backbone network of the pulsar search network to adjust the pulse The initial parameters of the backbone network of the star search network are obtained to obtain the target parameters of the backbone network of the pulsar search network. Therefore, it is possible to reduce the total amount of training data and eliminate the need for re-supervision by training the pulsar search network. Under the premise of training, it can steadily improve the accuracy of pulsar search network training, reduce over-fitting of pulsar search network, enhance the generalization ability of pulsar search network model, and facilitate the deployment of timely trained pulsar search network model. In network terminals, large-scale application of the pulsar search network model is realized.

以上所述,仅为本发明的实施例而已,并非用于限定本发明的保护范围,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above are only examples of the present invention and are not used to limit the scope of the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention shall be included in the present invention. within the scope of protection.

Claims (13)

1. A pulsar search network training method, the method comprising:
acquiring a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signals;
combining the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network;
acquiring a pulse contour image and a pulsar label value of a candidate corresponding to the acquired pulsar search candidate signal;
performing first training on a backbone network of the pulsar search network based on the training sample set and the pulse contour image of the candidate, and determining initial network parameters of the backbone network;
and performing second training on the backbone network of the pulsar search network based on the training sample set and the sum pulsar label value to realize adjustment of initial parameters of the backbone network of the pulsar search network and obtain target parameters of the backbone network of the pulsar search network.
2. The method of claim 1, wherein combining the time phase distribution image and frequency phase distribution image to form a training sample set for the pulsar search network comprises:
determining noise values of the time phase distribution image and the frequency phase distribution image, and respectively carrying out noise removal processing on the determined time phase distribution image and the determined frequency phase distribution image;
respectively carrying out standardization processing on the time phase distribution image and the frequency phase distribution image subjected to the denoising processing to obtain a time phase distribution image and a frequency phase distribution image which accord with the single-channel gray level image standard;
and combining the time phase distribution image and the frequency phase distribution image which accord with the single-channel gray level image standard to form a training sample set of the pulsar search network.
3. The method of claim 2, wherein said determining noise values of said time phase distribution image and said frequency phase distribution image, respectively, comprises:
determining a dynamic noise threshold matched with the use environment of the pulsar search network according to the position of the target area corresponding to the pulsar search candidate signal;
And carrying out noise reduction processing on the time phase distribution image and the frequency phase distribution image according to the dynamic noise threshold value so as to form a time phase distribution image and a frequency phase distribution image matched with the dynamic noise threshold value.
4. The method of claim 2, wherein said determining noise values of said time phase distribution image and said frequency phase distribution image, respectively, comprises:
determining a fixed noise threshold matched with the use environment of the pulsar search network according to the type of the radio telescope corresponding to the pulsar search candidate signal;
and carrying out noise reduction processing on the time phase distribution image and the frequency phase distribution image according to the fixed noise threshold value so as to form a time phase distribution image and a frequency phase distribution image matched with the fixed noise threshold value.
5. The method of claim 1, wherein the acquiring the pulse profile image of the candidate corresponding to the pulsar search candidate signal comprises:
acquiring an original pulse contour image of a candidate corresponding to the pulsar search candidate signal;
Performing standardization processing on the sequence value of the original pulse contour image to obtain the standard sequence value of the original pulse contour image;
and adjusting the standard sequence value of the original pulse contour image to obtain the pulse contour image corresponding to the pulsar search candidate signal so as to realize matching with the accuracy of the pulsar search network.
6. The method of claim 5, wherein the first training of the backbone network of the pulsar search network based on the set of training samples and the pulse profile image of the candidate comprises:
determining a label value of the pulse contour image of the candidate based on the pulse contour image of the candidate;
determining a first loss function corresponding to the first training based on the label value of the pulse contour image through a pre-training full-connection layer network corresponding to the backbone network;
bringing training samples in the training sample set into the first loss function;
and when the first loss function meets a first convergence condition, determining initial network parameters of the backbone network.
7. The method of claim 1, wherein the performing a second training on the backbone network of the pulsar search network based on the set of training samples and the sum pulsar label values to achieve adjusting initial parameters of the backbone network of the pulsar search network to obtain target parameters of the backbone network of the pulsar search network comprises:
determining a second loss function corresponding to the second training based on the pulsar label value through a classification task full-connection layer network corresponding to the backbone network;
bringing training samples in the training sample set into the second loss function;
when the second loss function meets a second convergence condition, determining updating parameters of the backbone network;
and based on the updated parameters of the backbone network, adjusting initial parameters of the backbone network of the pulsar search network to obtain target parameters of the backbone network of the pulsar search network.
8. A pulsar search method, said method comprising:
receiving a pulsar search candidate signal without labels;
acquiring a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signals;
Processing the time phase distribution image and the frequency phase distribution image through a backbone network of a pulsar search network, and determining an image processing result;
and processing the image processing result through the classification task full-connection layer network of the pulsar search network to obtain a prediction result of the pulsar search candidate signal.
9. A pulsar search network training apparatus, said apparatus comprising:
the information transmission module is used for acquiring a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signals;
the information processing module is used for combining the time phase distribution image and the frequency phase distribution image to form a training sample set of the pulsar search network;
the information processing module is used for acquiring the pulse contour image and the pulsar label value of the candidate corresponding to the acquired pulsar search candidate signal;
the information processing module is used for performing first training on a backbone network of the pulsar search network based on the training sample set and the pulse contour image of the candidate body, and determining initial network parameters of the backbone network;
The information processing module is used for performing second training on the main network of the pulsar search network based on the training sample set and the sum pulsar label value so as to adjust initial parameters of the main network of the pulsar search network and obtain target parameters of the main network of the pulsar search network.
10. A pulsar search device, said device comprising:
the signal transmission module is used for receiving the pulsar search candidate signals without labels;
the signal processing module is used for acquiring a time phase distribution image and a frequency phase distribution image corresponding to the pulsar search candidate signals;
the signal processing module is used for processing the time phase distribution image and the frequency phase distribution image through a backbone network of a pulsar search network and determining an image processing result;
and the signal processing module is used for processing the image processing result through the classification task full-connection layer network of the pulsar search network to obtain the prediction result of the pulsar search candidate signal.
11. An electronic device, the electronic device comprising:
A memory for storing executable instructions;
a processor configured to implement the pulsar search network training method of any one of claims 1 to 7, or implement the pulsar search method of claim 8, when executing the executable instructions stored in the memory.
12. A computer program product comprising a computer program or instructions which, when executed by a processor, implements the pulsar search network training method of any one of claims 1 to 7, or implements the pulsar search method of claim 8.
13. A computer readable storage medium storing executable instructions which when executed by a processor implement the pulsar search network training method of any one of claims 1 to 7, or implement the pulsar search method of claim 8.
CN202210226661.0A 2022-03-09 2022-03-09 Pulsar search network training method, pulsar search method, device, and equipment Active CN116796826B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210226661.0A CN116796826B (en) 2022-03-09 2022-03-09 Pulsar search network training method, pulsar search method, device, and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210226661.0A CN116796826B (en) 2022-03-09 2022-03-09 Pulsar search network training method, pulsar search method, device, and equipment

Publications (2)

Publication Number Publication Date
CN116796826A true CN116796826A (en) 2023-09-22
CN116796826B CN116796826B (en) 2025-02-11

Family

ID=88046566

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210226661.0A Active CN116796826B (en) 2022-03-09 2022-03-09 Pulsar search network training method, pulsar search method, device, and equipment

Country Status (1)

Country Link
CN (1) CN116796826B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120599387A (en) * 2025-08-07 2025-09-05 之江实验室 End-to-end pulsar candidate identification method, device, and computer equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109272026A (en) * 2018-08-30 2019-01-25 昆明理工大学 A kind of pulsar recognition methods based on convolutional neural networks
CN109272027A (en) * 2018-08-30 2019-01-25 昆明理工大学 A kind of pulsar candidate's body recognition methods based on artificial neural network ensembles
CN110059804A (en) * 2019-04-15 2019-07-26 北京迈格威科技有限公司 Network training method, data processing method and device to be searched
CN111189445A (en) * 2020-01-14 2020-05-22 哈尔滨工业大学 Pulsar identification method based on stochastic resonance
CN111985615A (en) * 2020-08-13 2020-11-24 北京北斗天巡科技有限公司 Pulsar search method based on transfer learning lightweight neural network
CN112797989A (en) * 2021-03-29 2021-05-14 中国人民解放军国防科技大学 A Fast Search Method for Pulsar Frequency Parameters

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109272026A (en) * 2018-08-30 2019-01-25 昆明理工大学 A kind of pulsar recognition methods based on convolutional neural networks
CN109272027A (en) * 2018-08-30 2019-01-25 昆明理工大学 A kind of pulsar candidate's body recognition methods based on artificial neural network ensembles
CN110059804A (en) * 2019-04-15 2019-07-26 北京迈格威科技有限公司 Network training method, data processing method and device to be searched
CN111189445A (en) * 2020-01-14 2020-05-22 哈尔滨工业大学 Pulsar identification method based on stochastic resonance
CN111985615A (en) * 2020-08-13 2020-11-24 北京北斗天巡科技有限公司 Pulsar search method based on transfer learning lightweight neural network
CN112797989A (en) * 2021-03-29 2021-05-14 中国人民解放军国防科技大学 A Fast Search Method for Pulsar Frequency Parameters

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王元超: "基于机器学习的脉冲星候选样本分类算法研究", 《中国优秀博士论文全文库》, 15 February 2021 (2021-02-15), pages 4 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120599387A (en) * 2025-08-07 2025-09-05 之江实验室 End-to-end pulsar candidate identification method, device, and computer equipment

Also Published As

Publication number Publication date
CN116796826B (en) 2025-02-11

Similar Documents

Publication Publication Date Title
CN113705769B (en) A neural network training method and device
Yuan et al. A comprehensive review of binary neural network
CN111782840B (en) Image question-answering method, device, computer equipment and medium
CN113469088B (en) SAR image ship target detection method and system under passive interference scene
US11604960B2 (en) Differential bit width neural architecture search
US20250209843A1 (en) Data processing method and apparatus
CN112699991A (en) Method, electronic device, and computer-readable medium for accelerating information processing for neural network training
Imani et al. Fach: Fpga-based acceleration of hyperdimensional computing by reducing computational complexity
CN117456297A (en) Image generation method, neural network compression method and related devices and equipment
CN112905795A (en) Text intention classification method, device and readable medium
US12475565B2 (en) Amodal instance segmentation using diffusion models
Sun et al. Fast object detection based on binary deep convolution neural networks
EP4318322A1 (en) Data processing method and related device
US11948078B2 (en) Joint representation learning from images and text
WO2022156561A1 (en) Method and device for natural language processing
CN111027576A (en) Cooperative significance detection method based on cooperative significance generation type countermeasure network
KR20220047228A (en) Method and apparatus for generating image classification model, electronic device, storage medium, computer program, roadside device and cloud control platform
US20220335209A1 (en) Systems, apparatus, articles of manufacture, and methods to generate digitized handwriting with user style adaptations
CN112418320A (en) A kind of enterprise association relationship identification method, device and storage medium
US20250094818A1 (en) Data denoising method and related device
WO2021012263A1 (en) Systems and methods for end-to-end deep reinforcement learning based coreference resolution
US20240005164A1 (en) Neural Network Training Method and Related Device
WO2019118639A1 (en) Residual binary neural network
CN112532251B (en) A method and device for data processing
Wang et al. Gated convolutional LSTM for speech commands recognition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40094470

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant