[go: up one dir, main page]

CN111126249A - A pedestrian re-identification method and device combining big data and Bayesian - Google Patents

A pedestrian re-identification method and device combining big data and Bayesian Download PDF

Info

Publication number
CN111126249A
CN111126249A CN201911327696.8A CN201911327696A CN111126249A CN 111126249 A CN111126249 A CN 111126249A CN 201911327696 A CN201911327696 A CN 201911327696A CN 111126249 A CN111126249 A CN 111126249A
Authority
CN
China
Prior art keywords
pedestrian
model
bayesian
identification
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911327696.8A
Other languages
Chinese (zh)
Inventor
李宁
张斯尧
罗茜
王思远
蒋杰
张�诚
李乾
谢喜林
黄晋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Jiuling Software Technology Co ltd
Original Assignee
Shenzhen Jiuling Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Jiuling Software Technology Co ltd filed Critical Shenzhen Jiuling Software Technology Co ltd
Priority to CN201911327696.8A priority Critical patent/CN111126249A/en
Publication of CN111126249A publication Critical patent/CN111126249A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • G06F18/24155Bayesian classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Library & Information Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)

Abstract

本发明公开了一种大数据和贝叶斯相结合的行人重识别方法及装置,所述方法包括:利用行人图像数据库对行人重识别系统模型进行分布式训练;对查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;将再排序后的查询对象和候选对象进行PTGAN处理;将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对候选对象进行重新排序;根据推理线索模型调整行人重识别系统模型的目标参数的参数值;通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。本发明解决了现有技术中的行人重识别方法跨摄像头的检索难度大,重识别准确率低的问题。

Figure 201911327696

The invention discloses a pedestrian re-identification method and device combining big data and Bayesian. The method includes: using a pedestrian image database to perform distributed training on a pedestrian re-identification system model; Multiple candidate objects are re-identified and re-ranked based on Bayesian query expansion; the re-ranked query objects and candidate objects are processed by PTGAN; the query objects and candidate objects processed by PTGAN are input into the trained Bayesian model , calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects; adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model; A good pedestrian re-identification system model searches for the pedestrian image with the highest similarity. The invention solves the problems that the pedestrian re-identification method in the prior art is difficult to retrieve across cameras and has a low re-identification accuracy rate.

Figure 201911327696

Description

一种大数据和贝叶斯相结合的行人重识别方法及装置A pedestrian re-identification method and device combining big data and Bayesian

技术领域technical field

本发明涉及计算机视觉和智慧城市技术领域,具体涉及一种大数据和贝叶斯相结合的行人重识别方法、装置、终端设备及计算机可读介质。The invention relates to the technical fields of computer vision and smart cities, and in particular to a pedestrian re-identification method, device, terminal device and computer-readable medium combining big data and Bayesian.

背景技术Background technique

随着人工智能、计算机视觉和硬件技术的不断发展,视频图像处理技术已经广泛应用于智能城市系统中。With the continuous development of artificial intelligence, computer vision and hardware technology, video image processing technology has been widely used in smart city systems.

行人重识别(Person Re-identification)也称行人再识别,简称为Re-ID。是利用计算机视觉技术判断图像或者视频序列中是否存在特定行人的技术。广泛被认为是一个图像检索的子问题。给定一个监控行人图像,检索跨设备下的该行人图像。由于不同摄像设备之间的差异,同时行人兼具刚性和柔性的特性,外观易受穿着、尺度、遮挡、姿态和视角等影响,使得行人重识别成为计算机视觉领域中一个既具有研究价值同时又极具挑战性的热门课题。Pedestrian Re-identification (Person Re-identification) is also called Pedestrian Re-Identification, or Re-ID for short. It is a technology that uses computer vision technology to determine whether there is a specific pedestrian in an image or video sequence. Widely regarded as a sub-problem of image retrieval. Given a surveillance pedestrian image, retrieve the pedestrian image across devices. Due to the differences between different camera devices, pedestrians are both rigid and flexible, and their appearance is easily affected by clothing, scale, occlusion, posture, and perspective, making pedestrian re-identification a field of computer vision. Challenging topic.

目前来说,虽然行人重识别的检测能力已经显著提升,但是在实际场合中很多具有挑战性的问题还没被完全解决:比如在复杂的场景,光线差异,视角和姿势的改变,大量的行人在一个监控摄像头网络中等情况。在这些情况下,跨摄像头的检索通常难度会很大,同时前期进行视频图像样本训练时的标注工作代价昂贵,需要耗费大量的人力,并且往往现有算法通常无法达到预期效果,重识别准确率较低。At present, although the detection ability of pedestrian re-identification has been significantly improved, many challenging problems have not been fully solved in practical situations: for example, in complex scenes, light differences, changes in perspective and posture, a large number of pedestrians In a surveillance camera network medium situation. In these cases, cross-camera retrieval is usually very difficult. At the same time, the labeling work during the training of video image samples in the early stage is expensive and requires a lot of manpower, and the existing algorithms often fail to achieve the expected results. lower.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明的目的在于提供一种大数据和贝叶斯相结合的行人重识别方法、装置、终端设备及计算机可读介质,能够提高不同摄像头下行人重识别的准确率,解决了现有技术中的行人重识别方法跨摄像头的检索难度大,重识别准确率低的问题。In view of this, the purpose of the present invention is to provide a pedestrian re-identification method, device, terminal device and computer-readable medium combining big data and Bayesian, which can improve the accuracy rate of pedestrian re-identification with different cameras, and solve the problem. The pedestrian re-identification method in the prior art is difficult to retrieve across cameras, and the re-identification accuracy is low.

本发明实施例的第一方面提供了一种大数据和贝叶斯相结合的行人重识别方法,包括:A first aspect of the embodiments of the present invention provides a pedestrian re-identification method combining big data and Bayesian, including:

利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,其中,所述行人图像数据库包括多个匹配图像组,所述匹配图像组包括至少两个匹配图像;Distributed training is performed on the pedestrian re-identification system model by using a pedestrian image database, and the trained pedestrian re-identification system model is obtained, wherein the pedestrian image database includes a plurality of matching image groups, and the matching image group includes at least two match image;

将查询对象输入所述行人重识别系统模型,得到多个候选对象的排名列表;Input the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects;

对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;Re-identification and re-ranking based on Bayesian query expansion is performed on the query object and the plurality of candidate objects in the ranking list;

将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The reordered query objects and candidate objects are processed by PTGAN to realize the migration of the background difference area under the premise of the pedestrian foreground unchanged;

将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对所述候选对象进行重新排序;Input the query object and candidate object processed by PTGAN into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects;

对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;Multi-dimensional feature extraction is performed on the query object after PTGAN processing and the candidate object after reordering, and the inference clue model is determined;

使用推理算法对所述推理线索模型进行调整并确定最后的推理线索模型;using an inference algorithm to adjust the inference cue model and determine a final inference cue model;

根据所述推理线索模型调整所述行人重识别系统模型的目标参数的参数值;Adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model;

通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。By inputting the image to be recognized into the trained pedestrian re-identification system model, the pedestrian image with the highest similarity is searched.

进一步地,利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,包括:Further, using the pedestrian image database to perform distributed training on the pedestrian re-identification system model, to obtain the pedestrian re-identification system model after the training, including:

通过使用多个处理器增大批量大小对所述行人重识别系统模型进行迭代训练;iteratively train the person re-identification system model by increasing the batch size using multiple processors;

根据线性缩放和预热策略算法对所述行人重识别系统模型进行迭代训练;Iteratively train the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm;

应用适应率缩放(LARS)对所述行人重识别系统模型中的每一层网络使用不同的学习率。Applying adaptation rate scaling (LARS) uses different learning rates for each layer of the network in the person re-id system model.

进一步地,对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序,包括:Further, re-identification and re-ranking based on Bayesian query expansion is performed on the query object and multiple candidate objects in the ranking list, including:

利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;Use the pedestrian image database to train the Bayesian model to obtain the trained Bayesian model;

根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;According to the distance between the query object and a plurality of candidate object images, the true matching probability of each candidate object is predicted by the trained Bayesian model;

根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。Query expansion is performed according to the true matching probability of each candidate object, and a new ranking list is generated through the query expansion.

进一步地,所述对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型,包括:Further, the multi-dimensional feature extraction is performed on the query object after PTGAN processing and the reordered candidate object and the inference clue model is determined, including:

提取行人的外观特征;Extract the appearance features of pedestrians;

提取行人的面部特征;Extract the facial features of pedestrians;

根据行人在不同视频头的时间和定位特征构建定位分支Markov链,根据定位分支Markov链训练推理线索模型。The positioning branch Markov chain is constructed according to the time and positioning features of pedestrians in different video heads, and the inference cue model is trained according to the positioning branch Markov chain.

本发明实施例的第二方面提供了一种大数据和贝叶斯相结合的行人重识别装置,其特征在于,包括:A second aspect of the embodiments of the present invention provides a pedestrian re-identification device combining big data and Bayesian, characterized in that it includes:

分布式训练模块,用于利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,其中,所述行人图像数据库包括多个匹配图像组,所述匹配图像组包括至少两个匹配图像;The distributed training module is used to perform distributed training on the pedestrian re-identification system model by using a pedestrian image database to obtain the pedestrian re-identification system model after training, wherein the pedestrian image database includes a plurality of matching image groups, and the The matched image group includes at least two matched images;

排名列表获取模块,用于将查询对象输入所述行人重识别系统模型,得到多个候选对象的排名列表;a ranking list acquisition module, used for inputting the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects;

重识别模块,用于对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;A re-identification module for performing re-identification and re-ordering based on Bayesian query expansion on the query object and multiple candidate objects in the ranking list;

PTGAN处理模块,用于将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The PTGAN processing module is used to perform PTGAN processing on the reordered query objects and candidate objects, so as to realize the migration of the background difference area under the premise that the pedestrian foreground remains unchanged;

训练模块,用于将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对所述候选对象进行重新排序;The training module is used to input the query object and candidate object processed by PTGAN into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and re-engineer the candidate object. sort;

推理线索模块,用于对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;The inference clue module is used to extract multi-dimensional features for the query object processed by PTGAN and the candidate object after reordering and determine the inference clue model;

推理线索调整模块,用于使用推理算法对所述推理线索模型进行调整并确定最后的推理线索模型;an inference clue adjustment module, used for using an inference algorithm to adjust the inference clue model and to determine the final inference clue model;

模型调整模块,用于根据所述推理线索模型调整所述行人重识别系统模型的目标参数的参数值;a model adjustment module, configured to adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model;

识别模块,用于通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。The recognition module is used to search for the pedestrian image with the highest similarity by inputting the image to be recognized into the trained pedestrian re-identification system model.

进一步地,所述分布式训练模块包括:Further, the distributed training module includes:

处理器增加模块,用于通过使用多个处理器增大批量大小对所述行人重识别系统模型进行迭代训练;a processor augmentation module for iteratively training the person re-identification system model by using a plurality of processors to increase the batch size;

批量算法模块,用于根据线性缩放和预热策略算法对所述行人重识别系统模型进行迭代训练;The batch algorithm module is used for iteratively training the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm;

学习率调整模块,用于应用适应率缩放(LARS)对所述行人重识别系统模型中的每一层网络使用不同的学习率。A learning rate adjustment module for applying adaptation rate scaling (LARS) to use a different learning rate for each layer of the network in the person re-identification system model.

进一步地,所述重识别模块包括:Further, the re-identification module includes:

贝叶斯训练模块,用于利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;The Bayesian training module is used to train the Bayesian model using the pedestrian image database to obtain the trained Bayesian model;

预测模块,用于根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;a prediction module, configured to predict the true matching probability of each candidate object through the trained Bayesian model according to the distance between the query object and a plurality of candidate object images;

查询扩展模块,用于根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。A query expansion module, configured to perform query expansion according to the true matching probability of each candidate object, and generate a new ranking list through the query expansion.

进一步地,所述推理线索模块包括:Further, the inference clue module includes:

外观提取模块,用于提取行人的外观特征;Appearance extraction module, used to extract the appearance features of pedestrians;

面部提取模块,用于提取行人的面部特征;The face extraction module is used to extract the facial features of pedestrians;

定位分支模块,用于根据行人在不同视频头的时间和定位特征构建定位分支Markov链,根据定位分支Markov链训练推理线索模型。The positioning branch module is used to construct the positioning branch Markov chain according to the time and positioning features of pedestrians in different video heads, and train the inference clue model according to the positioning branch Markov chain.

本发明实施例的第三方面提供了一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述大数据和贝叶斯相结合的行人重识别方法的步骤。A third aspect of the embodiments of the present invention provides a terminal device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, when the processor executes the computer program Steps to implement the above method of person re-identification combining big data and Bayesian.

本发明实施例的第四方面提供了一种计算机可读介质,所述计算机可读介质存储有计算机程序,所述计算机程序被处理执行时实现上述大数据和贝叶斯相结合的行人重识别方法的步骤。A fourth aspect of the embodiments of the present invention provides a computer-readable medium, where the computer-readable medium stores a computer program, and when the computer program is processed and executed, realizes person re-identification combining the above-mentioned big data and Bayesian steps of the method.

在本发明实施例中,通过对行人重识别系统模型进行分布式训练,大大提高了模型训练的速度,同时,通过基于贝叶斯查询扩展的重识别再排序以及PTGAN处理,提高了复杂条件下行人重识别的准确率、提高了系统的鲁棒性。解决了现有技术中的行人重识别方法跨摄像头的检索难度大,重识别准确率低的问题。In the embodiment of the present invention, by performing distributed training on the pedestrian re-identification system model, the speed of model training is greatly improved, and at the same time, through the re-identification and re-ordering based on Bayesian query expansion and PTGAN processing, the downlink of complex conditions is improved. The accuracy of person re-identification improves the robustness of the system. The problem of the pedestrian re-identification method in the prior art that the retrieval across cameras is difficult and the re-identification accuracy rate is low is solved.

附图说明Description of drawings

为了更清楚地说明本发明实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only for the present invention. In some embodiments, for those of ordinary skill in the art, other drawings can also be obtained according to these drawings without any creative effort.

图1是本发明实施例提供的一种大数据和贝叶斯相结合的行人重识别方法的流程图;Fig. 1 is a flow chart of a pedestrian re-identification method combining big data and Bayesian provided by an embodiment of the present invention;

图2是本发明实施例提供的不同行人重识别方法实时转换效果对比图;2 is a comparison diagram of real-time conversion effects of different pedestrian re-identification methods provided by an embodiment of the present invention;

图3是本发明实施例提供的一种大数据和贝叶斯相结合的行人重识别装置的结构示意图;3 is a schematic structural diagram of a pedestrian re-identification device combining big data and Bayesian provided by an embodiment of the present invention;

图4是本发明实施例提供的分布式训练模块的细化结构图;4 is a detailed structural diagram of a distributed training module provided by an embodiment of the present invention;

图5是本发明实施例提供的重识别模块的细化结构图;5 is a detailed structural diagram of a re-identification module provided by an embodiment of the present invention;

图6是本发明实施例提供的推理线索模块的细化结构图;6 is a detailed structural diagram of an inference clue module provided by an embodiment of the present invention;

图7是本发明实施例提供的终端设备的示意图。FIG. 7 is a schematic diagram of a terminal device provided by an embodiment of the present invention.

具体实施方式Detailed ways

以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本发明实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本发明。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本发明的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and technologies are set forth in order to provide a thorough understanding of the embodiments of the present invention. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.

为了说明本发明所述的技术方案,下面通过具体实施例来进行说明。In order to illustrate the technical solutions of the present invention, the following specific embodiments are used for description.

请参阅图1,图1是本发明实施例提供的大数据和贝叶斯相结合的行人重识别方法的流程图。如图1所示,本实施例的大数据和贝叶斯相结合的行人重识别方法包括以下步骤:Please refer to FIG. 1. FIG. 1 is a flowchart of a method for pedestrian re-identification combining big data and Bayesian provided by an embodiment of the present invention. As shown in FIG. 1 , the pedestrian re-identification method combining big data and Bayesian in this embodiment includes the following steps:

步骤S102,利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的行人重识别系统模型,其中,行人图像数据库包括多个匹配图像组,匹配图像组包括至少两个匹配图像。Step S102, using the pedestrian image database to perform distributed training on the pedestrian re-identification system model to obtain a trained pedestrian re-identification system model, wherein the pedestrian image database includes multiple matching image groups, and the matching image group includes at least two matching images.

进一步地,利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,包括:Further, using the pedestrian image database to perform distributed training on the pedestrian re-identification system model, to obtain the pedestrian re-identification system model after the training, including:

步骤1,通过使用多个处理器增大批量大小对行人重识别系统模型进行迭代训练。Step 1, iteratively train the person re-ID system model by using multiple processors to increase the batch size.

通过迭代算法,将算法扩展使用到更多的处理器,并在每次迭代时加载更多的行人图像数据,以此来减少总训练时间;Reduce total training time by iterating the algorithm by scaling the algorithm to use more processors and loading more pedestrian image data at each iteration;

通常来说,在一定范围内,更大的批量将使单个GPU的速度更高。原因是低级矩阵计算库将更有效。对于使用ImageNet训练Res-Net 50模型,每个GPU的最佳批量大小为512。如果想要使用许多GPU并使每个GPU都有效,则需要更大的批量大小。例如,如果有16个GPU,那么应该将批量大小设置为16×512=8192。理想情况下,如果固定总数量访问量,随着处理器数量的增加相应呈线性增加批量大小,那么改进的SGD(随机梯度下降)迭代次数将会线性减小,每次迭代的时间成本保持不变,因此总时间也将随处理器数量线性减少。Generally speaking, larger batch sizes will make a single GPU faster, within certain limits. The reason is that a low-level matrix computation library will be more efficient. For training a Res-Net 50 model with ImageNet, the optimal batch size per GPU is 512. If you want to use many GPUs and make each GPU efficient, you will need a larger batch size. For example, if there are 16 GPUs, then the batch size should be set to 16×512=8192. Ideally, if the total number of accesses is fixed, and the batch size increases linearly with the number of processors, then the number of iterations of the improved SGD (stochastic gradient descent) will decrease linearly, and the time cost of each iteration remains constant. so the total time will also decrease linearly with the number of processors.

具体的改进的随机梯度下降(SGD)迭代算法如下:令w代表DNN的权重、X代表训练数据、n为X中的样本数,而Y代表训练数据X的标注。我们令xi为X的样本,(xi,w)为xi和其标注yi(i∈{1,2,...,n))所计算出的损失。本发明使用如交叉熵函数那样的损失函数。DNN训练的目标是最小化方程(1)中的损失函数。公式如下:The specific improved stochastic gradient descent (SGD) iterative algorithm is as follows: let w represent the weight of the DNN, X represent the training data, n be the number of samples in X, and Y represent the labeling of the training data X. We let x i be a sample of X, and ( xi , w) be the loss computed by xi and its label y i (i∈{1,2,...,n)). The present invention uses a loss function such as a cross-entropy function. The goal of DNN training is to minimize the loss function in Equation (1). The formula is as follows:

Figure BDA0002328798720000051
Figure BDA0002328798720000051

在第t次迭代中,本发明算法使用前向和反向传播以求得损失函数对权重的梯度。然后,使用这个梯度来更新权重,根据梯度更新权重的方程(2)如下:In the t-th iteration, the algorithm of the present invention uses forward and backward propagation to obtain the gradient of the loss function to the weight. Then, using this gradient to update the weights, the equation (2) for updating the weights according to the gradient is as follows:

Figure BDA0002328798720000061
Figure BDA0002328798720000061

其中η为学习率。本发明算法令第t次迭代的批量大小为Bt,且Bt的大小为b。然后就可以基于以下方程(3)更新权重:where η is the learning rate. The algorithm of the present invention sets the batch size of the t-th iteration as B t and the size of B t as b. The weights can then be updated based on the following equation (3):

Figure BDA0002328798720000062
Figure BDA0002328798720000062

这种方法叫作小批量随机梯度下降。为了简化表达方式,我们可以说方程(4)中的更新规则代表我们使用权重的梯度

Figure BDA0002328798720000063
更新权重wt为wt+1。This method is called mini-batch stochastic gradient descent. To simplify the expression, we can say that the update rule in equation (4) represents the gradient of our use weights
Figure BDA0002328798720000063
The update weight wt is wt +1 .

Figure BDA0002328798720000064
Figure BDA0002328798720000064

用此方法,进行迭代,同时尽可能多的用到处理器,能够大幅度线性减少训练时间。Using this method, iterating and using as many processors as possible at the same time can greatly reduce the training time linearly.

步骤2,根据线性缩放和预热策略算法对行人重识别系统模型进行迭代训练。Step 2: Iteratively train the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm.

在训练大的批量的时候,需要确保在运行相同数量的时间段(epoch)的情况下,实现与小的批量差不多的测试精度。在这里我们固定了时间段(epoch)的数量,因为:在统计上,一个时间段(epoch)意味着算法会触及整个数据集一次;而在计算上,固定时间段(epoch)的数量意味着固定浮点运算的数量。训练大批量的方法包括两种技术:When training large batches, you need to ensure that you achieve about the same test accuracy as small batches while running the same number of epochs. Here we fixed the number of epochs because: Statistically, an epoch means that the algorithm touches the entire dataset once; computationally, a fixed number of epochs means that Fixed number of floating point operations. Methods for training large batches include two techniques:

(1)线性缩放:将批量从B增加到kB,那么也应该将学习率从η增加到kη。(1) Linear scaling: increase the batch from B to kB, then the learning rate should also be increased from η to kη.

(2)预热策略:如果使用较大的学习率(η),应该从小的η值开始,然后在前几个时间段(epoch)将其增加到大的η。(2) Warm-up strategy: If you use a large learning rate (η), you should start with a small value of η and then increase it to a large η in the first few epochs (epochs).

通过线性缩放和预热策略,可以在一定范围内使用相对较大的批量数据图像。With linear scaling and warm-up strategies, relatively large batches of data images can be used within a certain range.

步骤3,应用适应率缩放(LARS)对行人重识别系统模型中的每一层网络使用不同的学习率。Step 3. Apply adaptation rate scaling (LARS) to use different learning rates for each layer of the network in the person re-id system model.

通过应用适应率缩放(LARS)对大批量训练层级进行相应训练,得出最后的快速训练模型。The final fast-training model is obtained by applying adaptation rate scaling (LARS) to the corresponding training of the large batch training layers.

为了提高大批量训练的准确性,本发明方法使用了一种新的更新学习率(LR)规则。在这里必须考虑单机情况,使用

Figure BDA0002328798720000065
来更新权重。使用数据并行方法,可以用相同的方式处理多机器版本。In order to improve the accuracy of large batch training, the method of the present invention uses a new update learning rate (LR) rule. The single machine situation must be considered here, using
Figure BDA0002328798720000065
to update the weights. Using a data-parallel approach, multi-machine versions can be processed in the same way.

每个层都有自己的权重w和梯度

Figure BDA0002328798720000066
标准SGD算法对所有层使用相同的LR(η)。然而,从日常实验中,可以观察到不同的层可能需要不同的LR。原因是||w||2和
Figure BDA0002328798720000067
之间的比率不同的层有很大的不同。Each layer has its own weight w and gradient
Figure BDA0002328798720000066
The standard SGD algorithm uses the same LR(n) for all layers. However, from daily experiments, it can be observed that different layers may require different LRs. The reason is that ||w||2 and
Figure BDA0002328798720000067
The ratios between the different layers are very different.

本发明使用LARS算法来解决这个问题。基本LR规则在等式(1)中定义。l是缩放因子,本算法中在AlexNet和ResNet训练中将l设置为0.001。γ是用户的调整参数。通常一个好的γ,值都在[1,50]之间。在这个等式中,不同的层可以有不同的LR。向SGD添加动量(用μ表示)和权重衰减(用β表示),并对LARS使用以下序列:The present invention uses the LARS algorithm to solve this problem. The basic LR rule is defined in equation (1). l is the scaling factor. In this algorithm, l is set to 0.001 in AlexNet and ResNet training. γ is the tuning parameter of the user. Usually a good γ, the value is between [1, 50]. In this equation, different layers can have different LRs. Add momentum (denoted by μ) and weight decay (denoted by β) to SGD, and use the following sequence for LARS:

得到每个可学习参数的本地LR,get the local LR for each learnable parameter,

得到每个层的真实LR,为η=γ×α;Get the real LR of each layer, which is η=γ×α;

通过

Figure BDA0002328798720000071
更新梯度;pass
Figure BDA0002328798720000071
update gradient;

通过

Figure BDA0002328798720000072
更新加速项a;pass
Figure BDA0002328798720000072
Update acceleration item a;

用w=w-a来更新权重。The weights are updated with w=w-a.

使用这种方法预热(warmup),用有大的批量的SGD可以实现与基准相同的精度。为了扩展到更大的批量大小(例如32k),需要将本地响应规范化(LRN)更改为批量归一化(BN)。本发明方法在每个卷积层之后添加BN。LARS可以帮助ResNet-50保持高的测试精度。当前的方法(线性缩放和预热)对于批量大小为16k和32k的精度要低得多。Using this approach to warmup, SGD with large batch sizes can achieve the same accuracy as the baseline. To scale to larger batch sizes (e.g. 32k), local response normalization (LRN) needs to be changed to batch normalization (BN). The inventive method adds BN after each convolutional layer. LARS can help ResNet-50 maintain high test accuracy. Current methods (linear scaling and warmup) are much less accurate for batch sizes of 16k and 32k.

步骤S104,将查询对象输入行人重识别系统模型,得到多个候选对象的排名列表。Step S104, the query object is input into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects.

其中,行人重识别系统模型可以为现有的任意MA-CNN或者是RNN等行人重识别网络模型。Among them, the person re-identification system model can be any existing person re-identification network model such as MA-CNN or RNN.

步骤S106,对查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序。Step S106, performing re-identification and re-ranking based on Bayesian query expansion on the query object and the multiple candidate objects in the ranking list.

进一步地,对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序,包括:Further, re-identification and re-ranking based on Bayesian query expansion is performed on the query object and multiple candidate objects in the ranking list, including:

步骤一,利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;Step 1, using the pedestrian image database to train the Bayesian model to obtain the trained Bayesian model;

步骤二,根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;Step 2, according to the distance between the query object and a plurality of candidate object images, predict the true matching probability of each candidate object through the trained Bayesian model;

步骤三,根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。Step 3: Perform query expansion according to the true matching probability of each candidate object, and generate a new ranking list through the query expansion.

BQE使用来自初始图库秩列表的信息生成新查询,用于再次检索图库图像。具体来说,数据集分为查询、图库和培训数据三部分。在离线过程中,首先对训练数据进行贝叶斯后验估计训练。在给定距离度量的情况下,贝叶斯模型可以预测候选对象的真实匹配概率。在线检索时,通过计算查询与图库图像的相似性,可以得到一个初始秩列表。根据排序表,利用贝叶斯模型计算出每个候选对象的真实匹配概率。然后,将具有高概率的初始秩列表中的图像特征与原始查询合并,从而启动一个新的查询来执行另一轮检索。在新一轮检索之后,可以再次利用查询扩展过程,从而得到迭代算法。形式上,每个图像均由一个d维特征向量表示,表示为x∈Rd。令

Figure BDA0002328798720000081
为训练集,
Figure BDA0002328798720000082
为图库集。然后,将训练图像
Figure BDA0002328798720000083
查询图像q以及图库图像
Figure BDA0002328798720000084
的标识分别表示为
Figure BDA0002328798720000085
lq以及
Figure BDA0002328798720000086
Figure BDA0002328798720000087
为查询图像q和图库图像
Figure BDA0002328798720000088
之间的距离。然后,初始秩列表表示为
Figure BDA0002328798720000089
Figure BDA00023287987200000810
其中
Figure BDA00023287987200000811
由此,基于离线训练的贝叶斯模型可以对初始秩列表进行重新排序。BQE uses information from the initial gallery rank list to generate new queries for retrieving gallery images again. Specifically, the dataset is divided into three parts: query, gallery and training data. In the offline process, Bayesian posterior estimation training is first performed on the training data. Given a distance metric, a Bayesian model can predict the true matching probability of a candidate object. During online retrieval, an initial rank list can be obtained by calculating the similarity between the query and gallery images. According to the ranking table, the true matching probability of each candidate object is calculated using the Bayesian model. Then, image features from the initial rank list with high probability are merged with the original query, thereby starting a new query to perform another round of retrieval. After a new round of retrieval, the query expansion process can be used again, resulting in an iterative algorithm. Formally, each image is represented by a d-dimensional feature vector, denoted as x ∈ R d . make
Figure BDA0002328798720000081
is the training set,
Figure BDA0002328798720000082
for the gallery set. Then, the training image
Figure BDA0002328798720000083
Query image q as well as gallery images
Figure BDA0002328798720000084
are identified as
Figure BDA0002328798720000085
l q and
Figure BDA0002328798720000086
Assume
Figure BDA0002328798720000087
For query images q and gallery images
Figure BDA0002328798720000088
the distance between. Then, the initial rank list is expressed as
Figure BDA0002328798720000089
Figure BDA00023287987200000810
in
Figure BDA00023287987200000811
Thus, the initial rank list can be reordered based on the offline-trained Bayesian model.

步骤S108,将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移。In step S108, PTGAN processing is performed on the reordered query objects and candidate objects, so as to realize the migration of the background difference area under the premise that the pedestrian foreground remains unchanged.

PTGAN(Person Transfer GAN)是一个针对于重识别Re-ID问题的生成对抗网络。本发明中,PTGAN最大的特点就是在尽可能保证行人前景不变的前提下实现背景区域差异的迁移。首先PTGAN网路的损失函数包括两部分:PTGAN (Person Transfer GAN) is a generative adversarial network for re-identification Re-ID problem. In the present invention, the biggest feature of PTGAN is to realize the migration of background area differences on the premise that the pedestrian foreground is kept unchanged as much as possible. First of all, the loss function of the PTGAN network consists of two parts:

Figure BDA00023287987200000812
Figure BDA00023287987200000812

其中LStyle代表生成的风格损失,或者说区域差异domain损失,就是生成的图像是否像新的数据集风格。LID代表生成图像的身份损失,就是验证生成的图像是否和原始图像是同一个人。该处的λ1是平衡两个损失的权重。这两个损失定义如下:Among them, L Style represents the generated style loss, or the regional difference domain loss, which is whether the generated image resembles the new dataset style. L ID stands for the identity loss of the generated image, which is to verify whether the generated image is the same person as the original image. λ1 here is the weight that balances the two losses. The two losses are defined as follows:

首先,本发明所说PTGAN的损失函数(Loss)分为两部分;第一部分是LStyle,其具体公式如下:First, the loss function (Loss) of PTGAN mentioned in the present invention is divided into two parts; the first part is L Style , and its specific formula is as follows:

Figure BDA00023287987200000813
Figure BDA00023287987200000813

其中,

Figure BDA00023287987200000814
代表标准对抗性损失,LCyc代表周期一致性损失,A、B为两帧做GAN处理的图像,令G为图像A到B风格映射功能函数,
Figure BDA00023287987200000815
为B到A的风格映射功能函数,λ2为分割损失和身份损失的权重。in,
Figure BDA00023287987200000814
Represents the standard adversarial loss, L Cyc represents the cycle consistency loss, A and B are the images processed by GAN for two frames, let G be the image A to B style mapping function,
Figure BDA00023287987200000815
is the style mapping function from B to A, and λ2 is the weight of segmentation loss and identity loss.

以上几部分都是正常的PTGAN的损失,目的是为了保证生成的图片和期望的数据集的差异区域(domain)是一样的。The above parts are the losses of normal PTGAN, the purpose is to ensure that the difference area (domain) of the generated image and the expected dataset is the same.

其次,为了保证图片迁移过程中前景不变,先用PSPNet对视频图像进行了一个前景分割,得到一个mask(面具层)区域。通常来说,传统的生成对抗网络如CycleGAN等并不是用于Re-ID任务,因此也不需要保证前景物体的身份信息不变,这样的结果就是前景可能模糊之类的质量很差,更糟糕的现象是行人的外观可能改变。为了解决这个问题,本发明提出LID损失,用PSPNet提取的前景,这个前景就是一个面具层mask,最后身份信息损失为:Secondly, in order to ensure that the foreground remains unchanged during the image transfer process, PSPNet is used to perform a foreground segmentation on the video image to obtain a mask (mask layer) area. Generally speaking, traditional generative adversarial networks such as CycleGAN are not used for Re-ID tasks, so there is no need to ensure that the identity information of foreground objects remains unchanged. The result is that the quality of the foreground may be blurred, and the quality is even worse. The phenomenon is that the appearance of pedestrians may change. In order to solve this problem, the present invention proposes L ID loss, the foreground extracted by PSPNet, this foreground is a mask layer mask, and the final loss of identity information is:

Figure BDA0002328798720000091
Figure BDA0002328798720000091

其中M(a)和M(b)是两个分割出来的前景面具层,身份信息损失函数(Loss)将会约束行人前景在迁移过程中尽可能的保持不变。Among them, M(a) and M(b) are the two segmented foreground mask layers, and the identity information loss function (Loss) will constrain the pedestrian foreground to remain as unchanged as possible during the migration process.

其中,G(a)是图像a中转移的行人图像,

Figure BDA0002328798720000092
是是图像b中转移的行人图像,
Figure BDA0002328798720000093
为A的数据分布,
Figure BDA0002328798720000094
为B的数据分布,M(a)和M(b)是两个分割出来的面具层区域。where G(a) is the transferred pedestrian image in image a,
Figure BDA0002328798720000092
is the transferred pedestrian image in image b,
Figure BDA0002328798720000093
is the data distribution of A,
Figure BDA0002328798720000094
For the data distribution of B, M(a) and M(b) are the two segmented mask layer regions.

图2显示了不同行人重识别方法实时转换效果对比图,其中,第一行图片为待转换的图片,第四行显示了PTGAN转换的结果,可以看出,与使用Cycle-GAN转换结果的第三行图片相比,PTGAN生成的图像质量更高的。例如,人的外观保持不变,风格被有效地转移到另一个摄像头上。自动生成阴影,道路标记和背景,与另一个摄像头拍摄的效果相似。同时,PTGAN可以很好地处理由PSPNet产生的噪声分割结果。可以看出,本发明算法直观上和传统的环形生成对抗网络等(CycleGAN)相比能够更好的保证行人的身份信息。Figure 2 shows a comparison chart of real-time conversion effects of different pedestrian re-identification methods, in which the first row of pictures is the picture to be converted, and the fourth row shows the result of PTGAN conversion. Compared with the three-line pictures, the images generated by PTGAN are of higher quality. For example, the appearance of the person remains the same and the style is effectively transferred to another camera. Shadows, road markings and backgrounds are automatically generated, similar to those captured by another camera. Meanwhile, PTGAN can handle the noisy segmentation results produced by PSPNet well. It can be seen that the algorithm of the present invention can better guarantee the identity information of pedestrians intuitively compared with the traditional ring generative adversarial network (CycleGAN).

步骤S110,将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对候选对象进行重新排序。Step S110 , input the query object and candidate object after PTGAN processing into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects.

本质上,贝叶斯模型表现了真实匹配和虚假匹配的匹配分数分布。该模型是在训练集上创建的,并运用于测试期间以估计顶部图像与查询真正匹配的概率。Essentially, a Bayesian model represents the distribution of match scores for true matches and false matches. The model is created on the training set and used during testing to estimate the probability that the top image truly matches the query.

对于从行人重识别系统返回的排名列表中的每个图像x,都有一个由学习度量计算的距离。由于距离查询距离较小的图像将列在顶部,需要知道是否可以使用顶部图像对图像列表重新排序,以提高性能。选择候选图像至关重要,因为错误匹配将对性能产生相反的影响。当给定查询与图像之间的距离P(x|d(x,q))时,由于相同或相异特征的图像通常具有明显不同的距离范围,因此本发明借助距离来区分候选对象。本发明采用贝叶斯模型估计排序表中图像相关性的概率。For each image x in the ranking list returned from the person re-id system, there is a distance computed by the learned metric. Since images with a smaller distance from the query will be listed at the top, you need to know if you can use the top image to reorder the image list to improve performance. The selection of candidate images is critical, as a false match will have an opposite impact on performance. When the distance P(x|d(x,q)) between the query and the image is given, since images with the same or different features usually have significantly different distance ranges, the present invention uses the distance to distinguish candidate objects. The present invention uses a Bayesian model to estimate the probability of image correlation in the ranking table.

具体而言,对于查询q和图库图像x,基于两图像之间的距离,通常计算两个图像属于同一特征的概率,即

Figure BDA0002328798720000095
Specifically, for a query q and a gallery image x, based on the distance between the two images, the probability that the two images belong to the same feature is usually calculated, namely
Figure BDA0002328798720000095

根据贝叶斯定理,概率可以改写如下:According to Bayes' theorem, the probability can be rewritten as follows:

Figure BDA0002328798720000096
Figure BDA0002328798720000096

其中

Figure BDA0002328798720000097
可以通过以下公式计算,in
Figure BDA0002328798720000097
It can be calculated by the following formula,

Figure BDA0002328798720000098
Figure BDA0002328798720000098

本发明利用训练数据来估计概率。可以直接使用

Figure BDA0002328798720000099
Figure BDA00023287987200000910
来计算
Figure BDA00023287987200000911
Figure BDA00023287987200000912
的近似值。为了计算
Figure BDA00023287987200000913
Figure BDA00023287987200000914
需要计算训练数据中每个图像之间的距离,并使用距离范围来替代距离确切值。本发明将距离范围
Figure BDA0002328798720000101
值)划分成M个区间,然后计算每个间隔内的候选数。假设
Figure BDA0002328798720000102
在[0.2,0.3]这个区间内,则
Figure BDA0002328798720000103
可由候选数除以该区间中红柱的频率来计算。
Figure BDA0002328798720000104
的计算方法与之类似。间隔M的数量是根据数据集的大小来选择的。实际操作中,如果测试阶段的距离大于训练阶段的上限(或小于下限),则使用上限(或下限)的结果。The present invention utilizes training data to estimate probabilities. can be used directly
Figure BDA0002328798720000099
and
Figure BDA00023287987200000910
to calculate
Figure BDA00023287987200000911
and
Figure BDA00023287987200000912
approximate value. in order to calculate
Figure BDA00023287987200000913
and
Figure BDA00023287987200000914
The distance between each image in the training data needs to be calculated and the distance range is used instead of the exact distance value. The present invention will range the distance
Figure BDA0002328798720000101
value) is divided into M intervals, and then the number of candidates in each interval is calculated. Assumption
Figure BDA0002328798720000102
In the interval [0.2, 0.3], then
Figure BDA0002328798720000103
It can be calculated by dividing the number of candidates by the frequency of the red bars in the interval.
Figure BDA0002328798720000104
The calculation method is similar. The number of intervals M is chosen according to the size of the dataset. In practice, if the distance in the test phase is greater than the upper limit (or smaller than the lower limit) in the training phase, the upper (or lower) result is used.

对于查询扩展,将引发一个新查询对候选项重新排序。只有k个高概率的候选项被合并为新查询,其中K≤真实匹配的数量且k<<n。K的值根据实际情况调整。特征池化的策略是具有多样性的。For query expansion, a new query is invoked to reorder the candidates. Only k high-probability candidates are merged into a new query, where K≤number of true matches and k<<n. The value of K is adjusted according to the actual situation. Feature pooling strategies are diverse.

查询扩展有两种简单的策略:平均查询扩展(AQE)和最大查询扩展(MQE)。对于这两种方法,分别使用平均池和最大池来融合查询图像和顶部候选对象的特征。对于AQE,扩展查询计算为:There are two simple strategies for query expansion: Average Query Expansion (AQE) and Maximum Query Expansion (MQE). For both methods, average pooling and max pooling are used to fuse the features of the query image and top candidates, respectively. For AQE, the extended query is calculated as:

Figure BDA0002328798720000105
Figure BDA0002328798720000105

这些策略的不足之处在于其有效性在很大程度上依赖于初始排序表的质量和参数k的值。当初始排序表不满足或k值较大时,将使用错误匹配来构造新的查询,这将影响查询的精度。The disadvantage of these strategies is that their effectiveness depends heavily on the quality of the initial sorting table and the value of the parameter k. When the initial sorted table is not satisfied or the value of k is large, false matches will be used to construct a new query, which will affect the precision of the query.

为了克服这一不足,本发明在进行特征池化时为每个候选对象分配了不同的权重。然后,通过将前K个图像和查询q与概率合二为一来计算初始查询q的扩展探针qnew。在这里,本发明简单地使用带权重的平均池,其中权重就是概率。公式如下:To overcome this deficiency, the present invention assigns different weights to each candidate object during feature pooling. Then, an extended probe qnew for the initial query q is computed by unifying the top K images and the query q with the probability. Here, the present invention simply uses weighted average pooling, where the weights are the probabilities. The formula is as follows:

Figure BDA0002328798720000106
Figure BDA0002328798720000106

最后使用这个新查询计算距离并重新排列初始排名列表。然后将结果进行更多的迭代。通常扩展后的查询将产生一个更好的排名列表,从而可以产生一个更好的查询。本发明可以重复地执行生成排名列表、特征池和查询扩展的过程。通过重复BQE,效果将得到加强。将T表示为迭代次数。Finally use this new query to calculate the distance and rearrange the initial ranking list. Then iterate over the result for more iterations. Often the expanded query will produce a better ranked list, which can result in a better query. The present invention can repeatedly perform the process of generating ranked lists, feature pools, and query expansion. By repeating the BQE, the effect will be enhanced. Denote T as the number of iterations.

假设训练集和图库集的大小分别为M和N。贝叶斯模型用复杂度

Figure BDA0002328798720000107
离线计算。对于查询扩展过程,需要计算概率并构造新的查询。生成新查询的时间复杂度是
Figure BDA0002328798720000108
其中K是池化图像的数量。由于参数K小于真匹配数,并且K≤N,所以可以将复杂度限制为
Figure BDA0002328798720000109
然后用复杂度
Figure BDA00023287987200001010
计算信任图像的成对距离。得出结果,对于一个查询,计算复杂度为
Figure BDA00023287987200001011
Suppose the training set and gallery set are of size M and N, respectively. Bayesian Models with Complexity
Figure BDA0002328798720000107
Offline computing. For the query expansion process, probabilities need to be calculated and new queries constructed. The time complexity of generating a new query is
Figure BDA0002328798720000108
where K is the number of pooled images. Since the parameter K is less than the number of true matches, and K≤N, the complexity can be limited to
Figure BDA0002328798720000109
Then use the complexity
Figure BDA00023287987200001010
Calculate the pairwise distance of the trust images. As a result, for a query, the computational complexity is
Figure BDA00023287987200001011

步骤S112,对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型。Step S112 , perform multi-dimensional feature extraction on the query object after PTGAN processing and the reordered candidate object, and determine an inference clue model.

本发明使用外观、面部和可能的目的地线索,每个时间戳的特征都是单独提取的,用于跨摄像头的所有检测。The present invention uses appearance, face and possible destination cues, and features for each timestamp are extracted individually for all detections across cameras.

首先从人的检测中提取基于外观的属性,它们以外观的形式捕捉个体的特质和特征。图像表现的共同点是卷积神经网络(CNN)。本发明使用在ImageNet上预先训练过的AlexNet模型作为外观特征的提取器。这是通过移除顶部输出层并使用最后一个完全连接层的激活作为特征(长度4096)来完成的。AlexNet体系结构包括五个卷几层、三个完全连接层和三个紧跟第一、第二和第五卷积层的最大池层。第一卷积层有96个大小为11×11的滤波器,第二层有256个大小为5×5的滤波器,第三、第四和第五层彼此连接而不存在任何干涉池,并且分别具有384/384和256个大小为3×3的滤波器。完全连接层L学习非线性函数

Figure BDA0002328798720000111
其中
Figure BDA0002328798720000112
W和b是输入数据Xi的隐含观测量,分别有各自的权重与偏差,而f是激活隐藏层的校正线性单元。基于上述步骤,对每个时间戳的连续帧视频图像中的行人进行外观特征提取。Appearance-based attributes are first extracted from person detection, which capture individual traits and characteristics in the form of appearance. The common denominator of image representations is Convolutional Neural Networks (CNN). The present invention uses the AlexNet model pre-trained on ImageNet as the appearance feature extractor. This is done by removing the top output layer and using the activations of the last fully connected layer as features (length 4096). The AlexNet architecture consists of five convolutional layers, three fully connected layers, and three max-pooling layers followed by the first, second, and fifth convolutional layers. The first convolutional layer has 96 filters of size 11×11, the second layer has 256 filters of size 5×5, the third, fourth and fifth layers are connected to each other without any interference pooling, and have 384/384 and 256 filters of size 3×3, respectively. Fully connected layer L learns nonlinear functions
Figure BDA0002328798720000111
in
Figure BDA0002328798720000112
W and b are the implicit observations of the input data Xi, with their own weights and biases, respectively, and f is the rectified linear unit that activates the hidden layer. Based on the above steps, appearance feature extraction is performed on pedestrians in consecutive frames of video images at each timestamp.

其次,提取面部特征,人脸生物识别是一种用于身份识别和验证的已建立的生物识别技术。人脸形态可以用于重识别因为它本质上是一种非接触生物特征,且是可以远程提取的。本发明使用ImageNet上预先训练的VGG-16模型从面部边界框中提取面部特征。这是通过移除顶部的输出层并使用最后一个完全连接层的激活作为面部特征(长度4096)来完成的。VGG-16是一种卷积神经网络,其结构由13个卷积层和3个完全连接层组成,过滤器尺寸为3×3。池将运用于具有2×2像素窗口的卷积层之间,步幅为2。训练集的平均减法用作预处理步骤。Second, facial features are extracted, and facial biometrics is an established biometric technology used for identification and verification. Face morphology can be used for re-identification because it is essentially a non-contact biometric and can be extracted remotely. The present invention extracts facial features from facial bounding boxes using the VGG-16 model pre-trained on ImageNet. This is done by removing the top output layer and using the activations of the last fully connected layer as facial features (length 4096). VGG-16 is a convolutional neural network whose structure consists of 13 convolutional layers and 3 fully connected layers with a filter size of 3 × 3. Pooling will be applied between convolutional layers with a 2×2 pixel window with a stride of 2. The mean subtraction of the training set is used as a preprocessing step.

同时,本发明描述了位置约束,它本质上是线性的,并且预测了摄像机内部和穿过摄像机之间的最可能路径。对于多个摄像头中的重识别和跟踪,有关可能目的地的知识被当做某个人出现在另一个摄像头视野中的先验判断。通常,通过学习摄像机网络中出现的重复模式来模拟过渡概率分布。从特定网格空间退出摄像机视图的个人很可能会从另一个特定网格空间进入另一个摄像机视图。本发明将状态转移概率分布建模为Markov链,每个摄像机视图被分为n个状态,假设有k个摄像机,则状态的总数N=n×k。一个Markov链被描述为一个n×n的转移概率矩阵p,每个条目在区间[0,1]内,每一行的条目之和加起来为1。At the same time, the present invention describes position constraints, which are linear in nature and predict the most probable paths within and between cameras. For re-identification and tracking across multiple cameras, knowledge about possible destinations is taken as a priori that a person is present in another camera's field of view. Typically, transition probability distributions are modeled by learning recurring patterns that occur in the camera network. An individual exiting a camera view from a particular grid space is likely to enter another camera view from another particular grid space. The present invention models the state transition probability distribution as a Markov chain, and each camera view is divided into n states. Assuming that there are k cameras, the total number of states is N=n×k. A Markov chain is described as an n×n transition probability matrix p with each entry in the interval [0,1] and the sum of the entries in each row adding up to 1.

Figure BDA0002328798720000113
Figure BDA0002328798720000113

Figure BDA0002328798720000114
Figure BDA0002328798720000114

因此,利用Markov性质,将状态Si和Sj之间转换的概率分布估计为:Therefore, using the Markov property, the probability distribution of transitions between states S i and S j is estimated as:

Figure BDA0002328798720000121
Figure BDA0002328798720000121

进行上述多尺度特征提取后,训练出推理线索模型。After the above-mentioned multi-scale feature extraction, an inference cue model is trained.

步骤S114,使用推理算法对推理线索模型进行调整并确定最后的推理线索模型。Step S114, use an inference algorithm to adjust the inference clue model and determine the final inference clue model.

在每个时间步长中,重识别的问题都可以用关联矩阵来表示,其中每一行表示一个以前看到的实体,列包含当前活动的实体。根据相关实体的特征或属性,将每行与列之间进行最佳关联的任务可以表示为一个线性规划问题,如下所示:At each time step, the problem of re-identification can be represented by an association matrix, where each row represents a previously seen entity and the column contains the currently active entity. The task of optimally correlating each row and column according to the characteristics or properties of the related entities can be formulated as a linear programming problem as follows:

Figure BDA0002328798720000122
Figure BDA0002328798720000122

s.t W∈[0,1],W1=1,1TW=1st W∈[0, 1], W1=1, 1 T W=1

其中p是关联矩阵或概率矩阵,用于存储被关联实体的匹配概率,w是要优化的权重矩阵。图3描述了建议的推理算法是如何在关联矩阵P上工作的。关联矩阵中的匹配概率是使用预训练Alexnet和VGG-16模型分别计算的每个中层属性和面特征的余弦距离,或者是位置分数,即实体之间可能移动模式的转换概率模型。where p is the affinity matrix or probability matrix to store the matching probabilities of the associated entities, and w is the weight matrix to be optimized. Figure 3 depicts how the proposed inference algorithm works on the association matrix P. The matching probability in the association matrix is the cosine distance of each mid-level attribute and face feature computed using pretrained Alexnet and VGG-16 models, respectively, or the location score, a transition probability model of possible movement patterns between entities.

约束w1=1的作用是规范列与列之间的匹配概率,并强制它们对每个先前的实体求和为1。从这个约束的表达式来看,很明显,对于每个先前实体的关联概率集只有一个极大值。这意味着每个以前的实体最多只能与一个当前实体关联。因此,选择权重矩阵w的值本质上减少为最佳关联分配1的值,因此,计算最佳可能关联等价于按顺序选择最大匹配概率的贪婪方法。最后,结合各特征提取的约束条件,确定最后的推理线索模型。The effect of the constraint w1=1 is to normalize the matching probabilities from column to column and force them to sum to 1 for each previous entity. From the expression of this constraint, it is clear that there is only one maximum value for the set of associated probabilities for each previous entity. This means that each previous entity can only be associated with at most one current entity. Therefore, choosing the value of the weight matrix w essentially reduces the assignment of a value of 1 to the best association, so computing the best possible association is equivalent to the greedy method of choosing the largest matching probability in order. Finally, combined with the constraints of each feature extraction, the final inference clue model is determined.

整体目标函数可以表示为:The overall objective function can be expressed as:

Figure BDA0002328798720000123
Figure BDA0002328798720000123

其中Θ表示推理模型中的参数。L1,L2和L3分别表示面部,外观,定位分支中的分类损失。λ1,λ2,λ3表示相应损失的权重。where Θ denotes the parameters in the inference model. L1 , L2 and L3 denote the classification losses in the face, appearance, localization branches, respectively. λ 1 , λ 2 , λ 3 represent the weights of the corresponding losses.

步骤S116,根据推理线索模型调整行人重识别系统模型的目标参数的参数值。Step S116, adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model.

步骤S118,通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。Step S118 , by inputting the image to be recognized into the trained pedestrian re-identification system model, search out the pedestrian image with the highest similarity.

在本发明实施例中,通过对行人重识别系统模型进行分布式训练,大大提高了模型训练的速度,同时,通过基于贝叶斯查询扩展的重识别再排序以及PTGAN处理,提高了复杂条件下行人重识别的准确率、提高了系统的鲁棒性。解决了现有技术中的行人重识别方法跨摄像头的检索难度大,重识别准确率低的问题。In the embodiment of the present invention, by performing distributed training on the pedestrian re-identification system model, the speed of model training is greatly improved, and at the same time, through the re-identification and re-ordering based on Bayesian query expansion and PTGAN processing, the downlink of complex conditions is improved. The accuracy of person re-identification improves the robustness of the system. The problem of the pedestrian re-identification method in the prior art that the retrieval across cameras is difficult and the re-identification accuracy rate is low is solved.

请参阅图3,图3是本发明实施例提供的一种大数据和贝叶斯相结合的行人重识别装置的结构框图。如图3所示,本实施例的大数据和贝叶斯相结合的行人重识别20包括排名分布式训练模块202、排名列表获取模块204、重识别模块206、PTGAN处理模块208、训练模块210、推理线索模块212、推理线索调整模块214、模型调整模块216和识别模块218。分布式训练模块202、排名列表获取模块204、重识别模块206、PTGAN处理模块208、训练模块210、推理线索模块212、推理线索调整模块214、模型调整模块216和识别模块218分别用于执行图1中的S102、S104、S106、S108、S110、S112、S114、S116、S118中的具体方法,详情可参见图1的相关介绍,在此仅作简单描述:Please refer to FIG. 3. FIG. 3 is a structural block diagram of a person re-identification device combining big data and Bayesian provided by an embodiment of the present invention. As shown in FIG. 3 , the person re-identification 20 combining big data and Bayesian in this embodiment includes a ranking distributed training module 202 , a ranking list obtaining module 204 , a re-identification module 206 , a PTGAN processing module 208 , and a training module 210 , an inference clue module 212 , an inference clue adjustment module 214 , a model adjustment module 216 and a recognition module 218 . The distributed training module 202, the ranking list acquisition module 204, the re-identification module 206, the PTGAN processing module 208, the training module 210, the inference clue module 212, the inference clue adjustment module 214, the model adjustment module 216, and the identification module 218 are respectively used to execute the graph For the specific methods in S102, S104, S106, S108, S110, S112, S114, S116, and S118 in 1, please refer to the relevant introduction in Figure 1 for details, which are only briefly described here:

分布式训练模块202,用于利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的行人重识别系统模型,其中,行人图像数据库包括多个匹配图像组,匹配图像组包括至少两个匹配图像;The distributed training module 202 is used to perform distributed training on the pedestrian re-identification system model by using the pedestrian image database to obtain the pedestrian re-identification system model after training, wherein the pedestrian image database includes a plurality of matching image groups, and the matching image group includes at least two matching images;

排名列表获取模块204,用于将查询对象输入行人重识别系统模型,得到多个候选对象的排名列表;The ranking list obtaining module 204 is used to input the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects;

重识别模块206,用于对查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;A re-identification module 206, configured to perform re-identification and re-ranking based on Bayesian query expansion on the query object and a plurality of candidate objects in the ranking list;

PTGAN处理模块208,用于将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The PTGAN processing module 208 is used to perform PTGAN processing on the reordered query objects and candidate objects, so as to realize the migration of the background difference area under the premise that the pedestrian foreground remains unchanged;

训练模块210,用于将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对候选对象进行重新排序;The training module 210 is used to input the query object and candidate object after PTGAN processing into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects ;

推理线索模块212,用于对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;The inference clue module 212 is used to perform multi-dimensional feature extraction on the query object after PTGAN processing and the reordered candidate object and determine the inference clue model;

推理线索调整模块214,用于使用推理算法对推理线索模型进行调整并确定最后的推理线索模型;an inference clue adjustment module 214, configured to use an inference algorithm to adjust the inference clue model and determine the final inference clue model;

模型调整模块216,用于根据推理线索模型调整行人重识别系统模型的目标参数的参数值;A model adjustment module 216, configured to adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model;

识别模块218,用于通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。The identification module 218 is configured to search for a pedestrian image with the highest similarity by inputting the image to be identified into the trained pedestrian re-identification system model.

进一步地,可参见图4,所述分布式训练模块202包括:Further, referring to FIG. 4 , the distributed training module 202 includes:

处理器增加模块2021,用于通过使用多个处理器增大批量大小对所述行人重识别系统模型进行迭代训练;a processor increasing module 2021, configured to iteratively train the pedestrian re-identification system model by increasing the batch size by using multiple processors;

批量算法模块2022,用于根据线性缩放和预热策略算法对所述行人重识别系统模型进行迭代训练;A batch algorithm module 2022, configured to iteratively train the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm;

学习率调整模块2023,用于应用适应率缩放(LARS)对所述行人重识别系统模型中的每一层网络使用不同的学习率。The learning rate adjustment module 2023 is used for applying adaptation rate scaling (LARS) to use different learning rates for each layer of the network in the person re-identification system model.

进一步地,可参见图5,所述重识别模块206包括:Further, referring to FIG. 5 , the re-identification module 206 includes:

贝叶斯训练模块2061,用于利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;The Bayesian training module 2061 is used to train the Bayesian model by using the pedestrian image database to obtain the trained Bayesian model;

预测模块2062,用于根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;A prediction module 2062, configured to predict the true matching probability of each candidate object through the trained Bayesian model according to the distance between the query object and multiple candidate object images;

查询扩展模块2063,用于根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。The query expansion module 2063 is configured to perform query expansion according to the true matching probability of each candidate object, and generate a new ranking list through the query expansion.

进一步地,可参见图6,所述推理线索模块212包括:Further, referring to FIG. 6 , the reasoning clue module 212 includes:

外观提取模块2121,用于提取行人的外观特征;Appearance extraction module 2121, used to extract the appearance features of pedestrians;

面部提取模块2122,用于提取行人的面部特征;The face extraction module 2122 is used to extract the facial features of pedestrians;

定位分支模块2123,用于根据行人在不同视频头的时间和定位特征构建定位分支Markov链,根据定位分支Markov链训练推理线索模型。The positioning branch module 2123 is used to construct a positioning branch Markov chain according to the time and positioning features of pedestrians in different video heads, and train an inference clue model according to the positioning branch Markov chain.

在本发明实施例中,通过对行人重识别系统模型进行分布式训练,大大提高了模型训练的速度,同时,通过基于贝叶斯查询扩展的重识别再排序以及PTGAN处理,提高了复杂条件下行人重识别的准确率、提高了系统的鲁棒性。解决了现有技术中的行人重识别方法跨摄像头的检索难度大,重识别准确率低的问题。In the embodiment of the present invention, by performing distributed training on the pedestrian re-identification system model, the speed of model training is greatly improved, and at the same time, through the re-identification and re-ordering based on Bayesian query expansion and PTGAN processing, the downlink of complex conditions is improved. The accuracy of person re-identification improves the robustness of the system. The problem of the pedestrian re-identification method in the prior art that the retrieval across cameras is difficult and the re-identification accuracy rate is low is solved.

图7是本发明一实施例提供的终端设备的示意图。如图7所示,该实施例的终端设备10包括:处理器100、存储器101以及存储在所述存储器101中并可在所述处理器100上运行的计算机程序102,例如进行大数据和贝叶斯相结合的行人重识别的程序。所述处理器100执行所述计算机程序102时实现上述方法实施例中的步骤,例如,图1所示的S102、S104、S106、S108、S110、S112、S114、S116、S118的步骤。或者,所述处理器100执行所述计算机程序102时实现上述各装置实施例中各模块/单元的功能,例如图7所示的分布式训练模块202、排名列表获取模块204、重识别模块206、PTGAN处理模块208、训练模块210、推理线索模块212、推理线索调整模块214、模型调整模块216和识别模块218的功能。FIG. 7 is a schematic diagram of a terminal device provided by an embodiment of the present invention. As shown in FIG. 7 , the terminal device 10 in this embodiment includes: a processor 100, a memory 101, and a computer program 102 stored in the memory 101 and running on the processor 100, for example, to perform big data and A procedure for pedestrian re-identification combined with Yeas. When the processor 100 executes the computer program 102 , the steps in the above method embodiments are implemented, for example, the steps of S102 , S104 , S106 , S108 , S110 , S112 , S114 , S116 , and S118 shown in FIG. 1 . Alternatively, when the processor 100 executes the computer program 102, the functions of the modules/units in the above device embodiments are implemented, such as the distributed training module 202, the ranking list obtaining module 204, and the re-identification module 206 shown in FIG. 7 . , PTGAN processing module 208, training module 210, inference clue module 212, inference clue adjustment module 214, model adjustment module 216 and recognition module 218 functions.

示例性的,所述计算机程序102可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器101中,并由所述处理器100执行,以完成本发明。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序102在终端设备10中的执行过程。例如,排名分布式训练模块202、排名列表获取模块204、重识别模块206、PTGAN处理模块208、训练模块210、推理线索模块212、推理线索调整模块214、模型调整模块216和识别模块218。(虚拟装置中的模块),各模块具体功能如下:Exemplarily, the computer program 102 may be divided into one or more modules/units, and the one or more modules/units are stored in the memory 101 and executed by the processor 100 to complete the this invention. The one or more modules/units may be a series of computer program instruction segments capable of accomplishing specific functions, and the instruction segments are used to describe the execution process of the computer program 102 in the terminal device 10 . For example, ranking distributed training module 202 , ranking list acquisition module 204 , re-identification module 206 , PTGAN processing module 208 , training module 210 , inference cue module 212 , inference cue adjustment module 214 , model adjustment module 216 , and recognition module 218 . (Module in the virtual device), the specific functions of each module are as follows:

分布式训练模块202,用于利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的行人重识别系统模型,其中,行人图像数据库包括多个匹配图像组,匹配图像组包括至少两个匹配图像;The distributed training module 202 is used to perform distributed training on the pedestrian re-identification system model by using the pedestrian image database to obtain the pedestrian re-identification system model after training, wherein the pedestrian image database includes a plurality of matching image groups, and the matching image group includes at least two matching images;

排名列表获取模块204,用于将查询对象输入行人重识别系统模型,得到多个候选对象的排名列表;The ranking list obtaining module 204 is used to input the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects;

重识别模块206,用于对查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;A re-identification module 206, configured to perform re-identification and re-ranking based on Bayesian query expansion on the query object and a plurality of candidate objects in the ranking list;

PTGAN处理模块208,用于将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The PTGAN processing module 208 is used to perform PTGAN processing on the reordered query objects and candidate objects, so as to realize the migration of the background difference area under the premise that the pedestrian foreground remains unchanged;

训练模块210,用于将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对候选对象进行重新排序;The training module 210 is used to input the query object and candidate object after PTGAN processing into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects ;

推理线索模块212,用于对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;The inference clue module 212 is used to perform multi-dimensional feature extraction on the query object after PTGAN processing and the reordered candidate object and determine the inference clue model;

推理线索调整模块214,用于使用推理算法对推理线索模型进行调整并确定最后的推理线索模型;an inference clue adjustment module 214, configured to use an inference algorithm to adjust the inference clue model and determine the final inference clue model;

模型调整模块216,用于根据推理线索模型调整行人重识别系统模型的目标参数的参数值;A model adjustment module 216, configured to adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model;

识别模块218,用于通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。The identification module 218 is configured to search for a pedestrian image with the highest similarity by inputting the image to be identified into the trained pedestrian re-identification system model.

所述终端设备10可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。终端设备10可包括,但不仅限于,处理器100、存储器101。本领域技术人员可以理解,图7仅仅是终端设备10的示例,并不构成对终端设备10的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述终端设备还可以包括输入输出设备、网络接入设备、总线等。The terminal device 10 may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The terminal device 10 may include, but is not limited to, a processor 100 and a memory 101 . Those skilled in the art can understand that FIG. 7 is only an example of the terminal device 10, and does not constitute a limitation on the terminal device 10, and may include more or less components than the one shown, or combine some components, or different components For example, the terminal device may further include an input and output device, a network access device, a bus, and the like.

所述处理器100可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 100 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, a digital signal processor (Digital Signal Processor, DSP), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), Off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

所述存储器101可以是终端设备10的内部存储单元,例如终端设备10的硬盘或内存。所述存储器101也可以是终端设备10的外部存储设备,例如所述终端设备10上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器101还可以既包括终端设备10的内部存储单元也包括外部存储设备。所述存储器101用于存储所述计算机程序以及终端设备10所需的其他程序和数据。所述存储器101还可以用于暂时地存储已经输出或者将要输出的数据。The memory 101 may be an internal storage unit of the terminal device 10 , such as a hard disk or a memory of the terminal device 10 . The memory 101 may also be an external storage device of the terminal device 10, such as a plug-in hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card equipped on the terminal device 10, Flash card (Flash Card) and so on. Further, the memory 101 may also include both an internal storage unit of the terminal device 10 and an external storage device. The memory 101 is used to store the computer program and other programs and data required by the terminal device 10 . The memory 101 may also be used to temporarily store data that has been output or will be output.

所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and simplicity of description, only the division of the above-mentioned functional units and modules is used as an example. Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated in one processing unit, or each unit may exist physically alone, or two or more units may be integrated in one unit, and the above-mentioned integrated units may adopt hardware. It can also be realized in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of the present application. For the specific working processes of the units and modules in the above-mentioned system, reference may be made to the corresponding processes in the foregoing method embodiments, which will not be repeated here.

在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the foregoing embodiments, the description of each embodiment has its own emphasis. For parts that are not described or described in detail in a certain embodiment, reference may be made to the relevant descriptions of other embodiments.

本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art can realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of the present invention.

在本发明所提供的实施例中,应该理解到,所揭露的装置/终端设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/终端设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided by the present invention, it should be understood that the disclosed apparatus/terminal device and method may be implemented in other manners. For example, the apparatus/terminal device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods, such as multiple units. Or components may be combined or may be integrated into another system, or some features may be omitted, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.

另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.

所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。The integrated modules/units, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on this understanding, the present invention can implement all or part of the processes in the methods of the above embodiments, and can also be completed by instructing relevant hardware through a computer program, and the computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, the steps of the foregoing method embodiments can be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file or some intermediate form, and the like. The computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in the computer-readable media may be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction, for example, in some jurisdictions, according to legislation and patent practice, the computer-readable media Electric carrier signals and telecommunication signals are not included.

以上所述实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围,均应包含在本发明的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it is still possible to implement the foregoing implementations. The technical solutions described in the examples are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be included in the within the protection scope of the present invention.

Claims (10)

1.一种大数据和贝叶斯相结合的行人重识别方法,其特征在于,包括:1. a pedestrian re-identification method combining big data and Bayesian, is characterized in that, comprises: 利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,其中,所述行人图像数据库包括多个匹配图像组,所述匹配图像组包括至少两个匹配图像;Distributed training is performed on the pedestrian re-identification system model by using a pedestrian image database, and the trained pedestrian re-identification system model is obtained, wherein the pedestrian image database includes a plurality of matching image groups, and the matching image group includes at least two match image; 将查询对象输入所述行人重识别系统模型,得到多个候选对象的排名列表;Input the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects; 对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;Re-identification and re-ranking based on Bayesian query expansion is performed on the query object and the plurality of candidate objects in the ranking list; 将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The reordered query objects and candidate objects are processed by PTGAN to realize the migration of the background difference area under the premise of the pedestrian foreground unchanged; 将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对所述候选对象进行重新排序;Input the query object and candidate object processed by PTGAN into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and reorder the candidate objects; 对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;Multi-dimensional feature extraction is performed on the query object after PTGAN processing and the candidate object after reordering, and the inference clue model is determined; 使用推理算法对所述推理线索模型进行调整并确定最后的推理线索模型;using an inference algorithm to adjust the inference cue model and determine a final inference cue model; 根据所述推理线索模型调整所述行人重识别系统模型的目标参数的参数值;Adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model; 通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。By inputting the image to be recognized into the trained pedestrian re-identification system model, the pedestrian image with the highest similarity is searched. 2.根据权利要求1所述的大数据和贝叶斯相结合的行人重识别方法,其特征在于,利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,包括:2. the pedestrian re-identification method combining big data and Bayesian according to claim 1, is characterized in that, utilizes pedestrian image database to carry out distributed training to pedestrian re-identification system model, obtains described pedestrian re-identification after training. Identify system models, including: 通过使用多个处理器增大批量大小对所述行人重识别系统模型进行迭代训练;iteratively train the person re-identification system model by increasing the batch size using multiple processors; 根据线性缩放和预热策略算法对所述行人重识别系统模型进行迭代训练;Iteratively train the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm; 应用适应率缩放(LARS)对所述行人重识别系统模型中的每一层网络使用不同的学习率。Applying adaptation rate scaling (LARS) uses different learning rates for each layer of the network in the person re-id system model. 3.根据权利要求1所述的大数据和贝叶斯相结合的行人重识别方法,其特征在于,对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序,包括:3. The pedestrian re-identification method combining big data and Bayesian according to claim 1, is characterized in that, carrying out the re-identification based on Bayesian query expansion on the query object and a plurality of candidate objects in the ranking list. Identify reordering, including: 利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;Use the pedestrian image database to train the Bayesian model to obtain the trained Bayesian model; 根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;According to the distance between the query object and a plurality of candidate object images, the true matching probability of each candidate object is predicted by the trained Bayesian model; 根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。Query expansion is performed according to the true matching probability of each candidate object, and a new ranking list is generated through the query expansion. 4.根据权利要求3所述的大数据和贝叶斯相结合的行人重识别方法,其特征在于,所述对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型,包括:4. the pedestrian re-identification method combining big data and Bayesian according to claim 3, it is characterized in that, the described query object after carrying out PTGAN processing and the candidate object after carrying out reordering are carried out multi-dimensional feature extraction And identify inference cue models, including: 提取行人的外观特征;Extract the appearance features of pedestrians; 提取行人的面部特征;Extract the facial features of pedestrians; 根据行人在不同视频头的时间和定位特征构建定位分支Markov链,根据定位分支Markov链训练推理线索模型。The positioning branch Markov chain is constructed according to the time and positioning features of pedestrians in different video heads, and the inference cue model is trained according to the positioning branch Markov chain. 5.一种大数据和贝叶斯相结合的行人重识别装置,其特征在于,包括:5. A pedestrian re-identification device combining big data and Bayesian features, comprising: 分布式训练模块,用于利用行人图像数据库对行人重识别系统模型进行分布式训练,得到训练之后的所述行人重识别系统模型,其中,所述行人图像数据库包括多个匹配图像组,所述匹配图像组包括至少两个匹配图像;The distributed training module is used to perform distributed training on the pedestrian re-identification system model by using a pedestrian image database to obtain the pedestrian re-identification system model after training, wherein the pedestrian image database includes a plurality of matching image groups, and the The matched image group includes at least two matched images; 排名列表获取模块,用于将查询对象输入所述行人重识别系统模型,得到多个候选对象的排名列表;a ranking list acquisition module, used for inputting the query object into the pedestrian re-identification system model to obtain a ranking list of multiple candidate objects; 重识别模块,用于对所述查询对象和排名列表中的多个候选对象进行基于贝叶斯查询扩展的重识别再排序;A re-identification module for performing re-identification and re-ordering based on Bayesian query expansion on the query object and multiple candidate objects in the ranking list; PTGAN处理模块,用于将再排序后的查询对象和候选对象进行PTGAN处理,实现行人前景不变的前提下实现背景差异区域的迁移;The PTGAN processing module is used to perform PTGAN processing on the reordered query objects and candidate objects, so as to realize the migration of the background difference area under the premise that the pedestrian foreground remains unchanged; 训练模块,用于将进行PTGAN处理后的查询对象和候选对象输入训练好的贝叶斯模型,通过训练数据中的图像距离,计算每个候选对象真实匹配概率,并对所述候选对象进行重新排序;The training module is used to input the query object and candidate object processed by PTGAN into the trained Bayesian model, calculate the true matching probability of each candidate object through the image distance in the training data, and re-engineer the candidate object. sort; 推理线索模块,用于对进行PTGAN处理后的查询对象和进行重新排序后的候选对象进行多维度特征提取并确定推理线索模型;The inference clue module is used to extract multi-dimensional features for the query object processed by PTGAN and the candidate object after reordering and determine the inference clue model; 推理线索调整模块,用于使用推理算法对所述推理线索模型进行调整并确定最后的推理线索模型;an inference clue adjustment module, used for using an inference algorithm to adjust the inference clue model and to determine the final inference clue model; 模型调整模块,用于根据所述推理线索模型调整所述行人重识别系统模型的目标参数的参数值;a model adjustment module, configured to adjust the parameter value of the target parameter of the pedestrian re-identification system model according to the inference clue model; 识别模块,用于通过将待识别图像输入训练好的行人重识别系统模型,搜索出相似度最高的行人图像。The recognition module is used to search for the pedestrian image with the highest similarity by inputting the image to be recognized into the trained pedestrian re-identification system model. 6.根据权利要求5所述的大数据和贝叶斯相结合的行人重识别装置,其特征在于,所述分布式训练模块包括:6. The pedestrian re-identification device combined with big data and Bayesian according to claim 5, is characterized in that, described distributed training module comprises: 处理器增加模块,用于通过使用多个处理器增大批量大小对所述行人重识别系统模型进行迭代训练;a processor augmentation module for iteratively training the person re-identification system model by using a plurality of processors to increase the batch size; 批量算法模块,用于根据线性缩放和预热策略算法对所述行人重识别系统模型进行迭代训练;The batch algorithm module is used for iteratively training the pedestrian re-identification system model according to the linear scaling and warm-up strategy algorithm; 学习率调整模块,用于应用适应率缩放(LARS)对所述行人重识别系统模型中的每一层网络使用不同的学习率。A learning rate adjustment module for applying adaptation rate scaling (LARS) to use a different learning rate for each layer of the network in the person re-identification system model. 7.根据权利要求5所述的大数据和贝叶斯相结合的行人重识别装置,其特征在于,所述重识别模块包括:7. The pedestrian re-identification device combining big data and Bayesian according to claim 5, wherein the re-identification module comprises: 贝叶斯训练模块,用于利用行人图像数据库训练贝叶斯模型,得到训练后的贝叶斯模型;The Bayesian training module is used to train the Bayesian model using the pedestrian image database to obtain the trained Bayesian model; 预测模块,用于根据所述查询对象和多个候选对象图像之间的距离,通过所述训练后的贝叶斯模型预测每个候选对象的真实匹配概率;a prediction module, configured to predict the true matching probability of each candidate object through the trained Bayesian model according to the distance between the query object and a plurality of candidate object images; 查询扩展模块,用于根据所述每个候选对象的真实匹配概率进行查询扩展,通过所述查询扩展生成新的排名列表。A query expansion module, configured to perform query expansion according to the true matching probability of each candidate object, and generate a new ranking list through the query expansion. 8.根据权利要求5所述的大数据和贝叶斯相结合的行人重识别装置,其特征在于,所述推理线索模块包括:8. The pedestrian re-identification device combining big data and Bayesian according to claim 5, wherein the inference clue module comprises: 外观提取模块,用于提取行人的外观特征;Appearance extraction module, used to extract the appearance features of pedestrians; 面部提取模块,用于提取行人的面部特征;The face extraction module is used to extract the facial features of pedestrians; 定位分支模块,用于根据行人在不同视频头的时间和定位特征构建定位分支Markov链,根据定位分支Markov链训练推理线索模型。The positioning branch module is used to construct the positioning branch Markov chain according to the time and positioning features of pedestrians in different video heads, and train the inference clue model according to the positioning branch Markov chain. 9.一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1-4中任一项所述方法的步骤。9. A terminal device, comprising a memory, a processor and a computer program stored in the memory and running on the processor, wherein the processor implements the computer program as claimed in the claims when executing the computer program The steps of any one of 1-4. 10.一种计算机可读介质,所述计算机可读介质存储有计算机程序,其特征在于,所述计算机程序被处理执行时实现如权利要求1-4中任一项所述方法的步骤。10. A computer-readable medium storing a computer program, characterized in that, when the computer program is processed and executed, the steps of the method according to any one of claims 1-4 are implemented.
CN201911327696.8A 2019-12-20 2019-12-20 A pedestrian re-identification method and device combining big data and Bayesian Pending CN111126249A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911327696.8A CN111126249A (en) 2019-12-20 2019-12-20 A pedestrian re-identification method and device combining big data and Bayesian

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911327696.8A CN111126249A (en) 2019-12-20 2019-12-20 A pedestrian re-identification method and device combining big data and Bayesian

Publications (1)

Publication Number Publication Date
CN111126249A true CN111126249A (en) 2020-05-08

Family

ID=70500734

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911327696.8A Pending CN111126249A (en) 2019-12-20 2019-12-20 A pedestrian re-identification method and device combining big data and Bayesian

Country Status (1)

Country Link
CN (1) CN111126249A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111860230A (en) * 2020-07-03 2020-10-30 北京航空航天大学 An automatic detection system and method based on the behavior of video surveillance personnel not wearing masks
CN112613474A (en) * 2020-12-30 2021-04-06 珠海大横琴科技发展有限公司 Pedestrian re-identification method and device
CN114283471A (en) * 2021-12-16 2022-04-05 武汉大学 A Multimodal Ranking Optimization Method for Re-identification of Heterogeneous Face Images
CN114612985A (en) * 2022-03-17 2022-06-10 中国科学技术大学 Portrait query method, system, device and storage medium
CN115858846A (en) * 2023-02-16 2023-03-28 云南派动科技有限公司 Deep learning-based skier image retrieval method and system
CN117808790A (en) * 2024-01-04 2024-04-02 东南大学 PSPNet and Bayesian optimization-based comprehensive pipe gallery disease identification method, system and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108256439A (en) * 2017-12-26 2018-07-06 北京大学 A kind of pedestrian image generation method and system based on cycle production confrontation network
CN110457992A (en) * 2019-06-25 2019-11-15 上海高重信息科技有限公司 Pedestrian based on Bayes's optimisation technique recognition methods, device and system again
CN110555428A (en) * 2019-09-12 2019-12-10 腾讯科技(深圳)有限公司 Pedestrian re-identification method, device, server and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108256439A (en) * 2017-12-26 2018-07-06 北京大学 A kind of pedestrian image generation method and system based on cycle production confrontation network
CN110457992A (en) * 2019-06-25 2019-11-15 上海高重信息科技有限公司 Pedestrian based on Bayes's optimisation technique recognition methods, device and system again
CN110555428A (en) * 2019-09-12 2019-12-10 腾讯科技(深圳)有限公司 Pedestrian re-identification method, device, server and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
YUTIAN LIN 等: "Bayesian query expansion for multi-camera person re-identification" *
郑娇: "基于时空转移模型的监控目标交接算法研究" *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111860230A (en) * 2020-07-03 2020-10-30 北京航空航天大学 An automatic detection system and method based on the behavior of video surveillance personnel not wearing masks
CN111860230B (en) * 2020-07-03 2022-12-06 北京航空航天大学 Automatic detection system and method based on behavior of video monitoring personnel not wearing mask
CN112613474A (en) * 2020-12-30 2021-04-06 珠海大横琴科技发展有限公司 Pedestrian re-identification method and device
CN114283471A (en) * 2021-12-16 2022-04-05 武汉大学 A Multimodal Ranking Optimization Method for Re-identification of Heterogeneous Face Images
CN114283471B (en) * 2021-12-16 2024-04-02 武汉大学 Multi-mode ordering optimization method for heterogeneous face image re-recognition
CN114612985A (en) * 2022-03-17 2022-06-10 中国科学技术大学 Portrait query method, system, device and storage medium
CN114612985B (en) * 2022-03-17 2024-04-02 中国科学技术大学 A portrait query method, system, equipment and storage medium
CN115858846A (en) * 2023-02-16 2023-03-28 云南派动科技有限公司 Deep learning-based skier image retrieval method and system
CN115858846B (en) * 2023-02-16 2023-04-21 云南派动科技有限公司 Skier image retrieval method and system based on deep learning
CN117808790A (en) * 2024-01-04 2024-04-02 东南大学 PSPNet and Bayesian optimization-based comprehensive pipe gallery disease identification method, system and equipment
CN117808790B (en) * 2024-01-04 2025-02-28 东南大学 Comprehensive pipeline corridor disease identification method, system and equipment based on PSPNet and Bayesian optimization

Similar Documents

Publication Publication Date Title
CN111709311B (en) Pedestrian re-identification method based on multi-scale convolution feature fusion
Singh et al. A deeply coupled ConvNet for human activity recognition using dynamic and RGB images
CN111797893B (en) Neural network training method, image classification system and related equipment
CN110298404B (en) A Target Tracking Method Based on Triple Siamese Hash Network Learning
CN111126249A (en) A pedestrian re-identification method and device combining big data and Bayesian
CN113361549B (en) A model updating method and related device
WO2021022521A1 (en) Method for processing data, and method and device for training neural network model
US20180260415A1 (en) Instance-level image retrieval with a region proposal network
CN110163117B (en) Pedestrian re-identification method based on self-excitation discriminant feature learning
Xia et al. Loop closure detection for visual SLAM using PCANet features
CN109871821B (en) Pedestrian re-identification method, device, device and storage medium for adaptive network
CN108229347A (en) Method and apparatus for deep permutation of quasi-Gibbs structure sampling for human recognition
CN114358205B (en) Model training method, model training device, terminal device and storage medium
CN110222718B (en) Image processing methods and devices
CN110751027B (en) Pedestrian re-identification method based on deep multi-instance learning
CN111104911A (en) Pedestrian re-identification method and device based on big data training
CN114462479B (en) Model training methods, retrieval methods, models, devices, and media
CN117315770A (en) Human behavior recognition method, device and storage medium based on skeleton points
Kowdiki et al. Adaptive hough transform with optimized deep learning followed by dynamic time warping for hand gesture recognition
Fan Research and realization of video target detection system based on deep learning
Huang et al. Extreme learning machine with multi-scale local receptive fields for texture classification
Wang et al. A novel multiface recognition method with short training time and lightweight based on ABASNet and H-softmax
CN111695531A (en) Cross-domain pedestrian re-identification method based on heterogeneous convolutional network
KR20120080629A (en) Method of computing global-to-local metrics for recognition
CN111291785B (en) Target detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200508

RJ01 Rejection of invention patent application after publication