[go: up one dir, main page]

CN117036418A - Image processing method, device and equipment - Google Patents

Image processing method, device and equipment Download PDF

Info

Publication number
CN117036418A
CN117036418A CN202210464887.4A CN202210464887A CN117036418A CN 117036418 A CN117036418 A CN 117036418A CN 202210464887 A CN202210464887 A CN 202210464887A CN 117036418 A CN117036418 A CN 117036418A
Authority
CN
China
Prior art keywords
target
identified
images
frame
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210464887.4A
Other languages
Chinese (zh)
Inventor
李波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shirui Electronics Co Ltd
Original Assignee
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shirui Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Shiyuan Electronics Thecnology Co Ltd, Guangzhou Shirui Electronics Co Ltd filed Critical Guangzhou Shiyuan Electronics Thecnology Co Ltd
Priority to CN202210464887.4A priority Critical patent/CN117036418A/en
Priority to PCT/CN2023/091396 priority patent/WO2023208154A1/en
Publication of CN117036418A publication Critical patent/CN117036418A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/254Analysis of motion involving subtraction of images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • G06T2207/30201Face

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Human Computer Interaction (AREA)
  • Economics (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The application provides an image processing method, an image processing device and image processing equipment, and relates to an image processing technology, wherein the method comprises the following steps: identifying the multi-frame images to obtain a parameter list of each image; the parameter list comprises parameter information of an object to be identified in the image. For two adjacent frame images in the multi-frame images, determining a target to be identified in the next frame image in the two adjacent frame images according to a parameter list of the previous frame image in the two adjacent frame images and a parameter list of the next frame image in the two adjacent frame images; the target to be identified in the next frame of images in the two adjacent frames is the target to be identified corresponding to the target to be identified in the previous frame of images in the two adjacent frames. Generating an identification code corresponding to the target to be identified in the last frame of image in the multi-frame images, and generating target information corresponding to the identification code. The application solves the technical problem that the behavior information corresponding to the identification code cannot be acquired due to the large tracking difficulty of the identification code of the user.

Description

图像处理方法、装置及设备Image processing methods, devices and equipment

技术领域Technical field

本申请涉及图像处理技术,尤其涉及一种图像处理方法、装置及设备。The present application relates to image processing technology, and in particular, to an image processing method, device and equipment.

背景技术Background technique

目前,为了了解学生上课时的课堂行为,需要对每个学生进行跟踪识别。Currently, in order to understand students' classroom behavior during class, each student needs to be tracked and identified.

现有技术中,对每个学生进行跟踪识别时,是通过目标检测、动作识别技术识别学生的课堂行为,并且根据目标跟踪算法计算出整节课中每个学生产生的动作次数。In the existing technology, when tracking and identifying each student, the student's classroom behavior is identified through target detection and action recognition technology, and the number of actions produced by each student in the entire class is calculated based on the target tracking algorithm.

然而现有技术中,由于根据目标跟踪算法计算出整节课中每个学生产生的动作次数时,在课堂中学生会有干扰行为,例如干扰行为包括低头、学生之间遮挡等,导致无法准确识别出学生的识别码,进而导致无法获取学生的课堂行为。However, in the existing technology, when the number of actions produced by each student in the entire class is calculated based on the target tracking algorithm, students will have disturbing behaviors in the classroom, such as lowering their heads, blocking between students, etc., resulting in the inability to accurately identify The student's identification code is obtained, resulting in the inability to obtain the student's classroom behavior.

发明内容Contents of the invention

本申请提供一种图像处理方法、装置及设备,用以解决用户的识别码追踪难度较大导致无法获取识别码对应的行为信息的技术问题。This application provides an image processing method, device and equipment to solve the technical problem of being unable to obtain behavioral information corresponding to the identification code due to difficulty in tracking the user's identification code.

第一方面,本申请提供一种图像处理方法,包括:In the first aspect, this application provides an image processing method, including:

对多帧图像进行识别处理,得到每一所述图像的参数列表;其中,所述参数列表中包括图像中的待识别目标的参数信息;Perform recognition processing on multiple frame images to obtain a parameter list for each image; wherein the parameter list includes parameter information of the target to be recognized in the image;

针对所述多帧图像中的相邻两帧图像,根据所述相邻两帧图像中前一帧图像的参数列表、以及所述相邻两帧图像中后一帧图像的参数列表,确定所述相邻两帧图像中后一帧图像中的待识别目标;其中,所述相邻两帧图像中后一帧图像中的待识别目标,为与所述相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标;For two adjacent frames of images in the multi-frame images, the parameter list of the previous frame of the image in the two adjacent frames of images and the parameter list of the subsequent frame of the image in the two adjacent frames of images are determined. The target to be recognized in the latter frame of the two adjacent images is the same as the target in the previous frame of the two adjacent images. The target to be recognized corresponding to the target to be recognized in the image;

生成所述多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与所述识别码对应的目标信息;其中,所述识别码表征待识别目标的身份标识,所述目标信息表征待识别目标发出预设动作的次数。Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein the identification code represents the identity of the target to be identified, and the target The information represents the number of times the target to be identified has issued a preset action.

进一步地,针对所述多帧图像中的相邻两帧图像,根据所述相邻两帧图像中前一帧图像的参数列表、以及所述相邻两帧图像中后一帧图像的参数列表,确定所述相邻两帧图像中后一帧图像中的待识别目标,包括:Further, for two adjacent frames of images in the multi-frame images, according to the parameter list of the previous frame of the image in the two adjacent frames of images, and the parameter list of the subsequent frame of the image in the two adjacent frames of images. , determining the target to be recognized in the latter frame of the two adjacent images, including:

根据多帧图像中第一帧图像的参数列表,确定第一帧图像中的待识别目标对应的高斯分布;其中,所述高斯分布表征待识别目标的位置信息;According to the parameter list of the first frame image in the multi-frame image, determine the Gaussian distribution corresponding to the target to be recognized in the first frame image; wherein the Gaussian distribution represents the position information of the target to be recognized;

针对所述多帧图像中的相邻两帧图像,将所述相邻两帧图像中后一帧图像的参数列表,与所述相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息;其中,所述概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;For two adjacent frames in the multi-frame images, the parameter list of the latter image in the two adjacent frames corresponds to the target to be recognized in the previous frame of the two adjacent images. The Gaussian distribution is used for probability calculation processing to obtain the probability result information corresponding to the target to be identified in the latter frame of the two adjacent images; wherein the probability result information is used to indicate the latter frame of the two adjacent images. Gaussian distribution corresponding to the target to be identified in ;

根据所述概率结果信息,确定所述相邻两帧图像中后一帧图像中的待识别目标。According to the probability result information, the target to be recognized in the latter frame of the two adjacent frames of images is determined.

进一步地,根据所述概率结果信息,确定所述相邻两帧图像中后一帧图像中的待识别目标,包括:Further, based on the probability result information, determining the target to be recognized in the latter frame of the two adjacent frames of images includes:

若确定所述概率结果信息为得到似然概率值,其中,所述似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标对应的最大似然概率值;其中,所述最大似然概率值用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;If it is determined that the probability result information is to obtain a likelihood probability value, wherein the likelihood probability value represents the parameter information of the target to be identified in the latter frame of two adjacent frames of images, respectively, and the likelihood probability value of the previous two adjacent frames of images. The likelihood probability value between multiple Gaussian distributions in one frame of image is used to determine the maximum likelihood probability value corresponding to the target to be identified; wherein, the maximum likelihood probability value is used to indicate the latter of two adjacent frames of images. Gaussian distribution corresponding to the target to be recognized in the frame image;

根据待识别目标对应的最大似然概率值,确定相邻两帧图像中后一帧图像中的待识别目标。According to the maximum likelihood probability value corresponding to the target to be recognized, the target to be recognized in the latter frame of the two adjacent frames of images is determined.

进一步地,待识别目标对应的参数信息包括人脸宽度值;所述方法还包括:Further, the parameter information corresponding to the target to be recognized includes the face width value; the method further includes:

根据预设的第一约束条件信息,确定相邻两帧图像中后一帧图像中的待识别目标,与待识别目标对应的高斯分布所在位置之间的距离;其中,所述第一约束条件信息表征待识别目标对应的位置信息;According to the preset first constraint information, determine the distance between the target to be identified in the latter frame of the two adjacent images and the location of the Gaussian distribution corresponding to the target to be identified; wherein, the first constraint The information represents the location information corresponding to the target to be identified;

若确定所述距离大于待识别目标的人脸宽度值,则将相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,替换为相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布,得到更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;If it is determined that the distance is greater than the face width value of the target to be recognized, then the Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames is replaced with the Gaussian distribution of the target in the previous frame of the two adjacent images. The Gaussian distribution corresponding to the target to be recognized is obtained, and the Gaussian distribution corresponding to the target to be recognized in the latter frame of the updated two adjacent frames of images is obtained;

根据更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。According to the Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames after updating, the target to be recognized in the latter image of the two adjacent frames is determined.

进一步地,根据所述概率结果信息,确定所述相邻两帧图像中后一帧图像中的待识别目标,包括:Further, based on the probability result information, determining the target to be recognized in the latter frame of the two adjacent frames of images includes:

若确定所述概率结果信息为未得到似然概率值,其中,所述似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标为新增目标;If it is determined that the probability result information is that the likelihood probability value has not been obtained, wherein the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, respectively, and the likelihood probability value of the two adjacent frames of images. Based on the likelihood probability values between multiple Gaussian distributions in the previous frame image, the target to be identified is determined to be a new target;

根据新增目标对应的参数信息,确定相邻两帧图像中后一帧图像中的新增目标对应的高斯分布;According to the parameter information corresponding to the new target, determine the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames;

根据相邻两帧图像中后一帧图像中的新增目标对应的高斯分布,确定所述相邻两帧图像中后一帧图像中的待识别目标。According to the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames of images, the target to be recognized in the latter frame of the two adjacent images is determined.

进一步地,根据所述概率结果信息,确定所述相邻两帧图像中后一帧图像中的待识别目标,包括:Further, based on the probability result information, determining the target to be recognized in the latter frame of the two adjacent frames of images includes:

若确定所述概率结果信息为未得到与高斯分布对应的参数信息,则确定所述高斯分布对应的待识别目标为消失状态;If it is determined that the probability result information is that parameter information corresponding to the Gaussian distribution has not been obtained, then it is determined that the target to be identified corresponding to the Gaussian distribution is in a disappearing state;

确定与高斯分布所在位置相邻且预设数量的待识别目标、以及所述待识别目标对应的参数信息;其中,所述高斯分布包括第一人脸面积,所述待识别目标对应的参数信息包括第二人脸面积;Determine a preset number of targets to be identified adjacent to the location of the Gaussian distribution, and parameter information corresponding to the target to be identified; wherein the Gaussian distribution includes the first face area, and the parameter information corresponding to the target to be identified Including the second face area;

根据预设的第二约束条件信息,在预设数量的第二人脸面积中,确定与所述第一人脸面积相等的目标人脸面积;其中,所述第二约束条件信息表征待识别目标的人脸面积的像素占比;According to the preset second constraint information, among the preset number of second face areas, determine the target face area that is equal to the first face area; wherein the second constraint information represents the to-be-recognized The pixel ratio of the target’s face area;

根据与所述目标人脸面积对应的参数信息,确定相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;According to the parameter information corresponding to the target face area, determine the Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames;

根据相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定所述相邻两帧图像中后一帧图像中的待识别目标。According to the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images, the target to be recognized in the latter frame of the two adjacent images is determined.

进一步地,对多帧图像进行识别处理,得到每一所述图像的参数列表,包括:Further, perform recognition processing on multiple frames of images to obtain a parameter list for each image, including:

根据预设的深度目标监测网络,分别对多帧图像进行识别处理,得到每一所述图像的参数列表;其中,所述深度目标检测网络用于指示目标识别信息,所述参数列表中包括图像中的待识别目标的参数信息。According to the preset depth target detection network, multiple frame images are recognized and processed respectively to obtain a parameter list of each image; wherein, the depth target detection network is used to indicate target identification information, and the parameter list includes images Parameter information of the target to be identified in .

进一步地,待识别目标对应的参数信息包括人脸信息及人体信息,其中,所述人脸信息包括人脸中心横坐标、人脸中心纵坐标、人脸宽度值、人脸高度值、以及人脸面积,所述人体信息包括人体中心横坐标、人体中心纵坐标、人体宽度值、人体高度值、以及人体面积。Further, the parameter information corresponding to the target to be recognized includes face information and human body information, wherein the face information includes the abscissa coordinate of the face center, the ordinate of the face center, the face width value, the face height value, and the face height value. Face area, the human body information includes the human body center abscissa, the human body center ordinate, the human body width value, the human body height value, and the human body area.

第二方面,本申请提供一种图像处理装置,包括:In a second aspect, this application provides an image processing device, including:

识别单元,用于对多帧图像进行识别处理,得到每一所述图像的参数列表;其中,所述参数列表中包括图像中的待识别目标的参数信息;A recognition unit, configured to perform recognition processing on multiple frame images to obtain a parameter list for each image; wherein the parameter list includes parameter information of the target to be recognized in the image;

第一确定单元,用于针对所述多帧图像中的相邻两帧图像,根据所述相邻两帧图像中前一帧图像的参数列表、以及所述相邻两帧图像中后一帧图像的参数列表,确定所述相邻两帧图像中后一帧图像中的待识别目标;其中,所述相邻两帧图像中后一帧图像中的待识别目标,为与所述相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标;The first determination unit is configured to determine, for two adjacent frames of the multi-frame images, according to the parameter list of the previous frame of the image in the two adjacent frames of images, and the subsequent frame of the two adjacent frames of images. The parameter list of the image determines the target to be recognized in the latter frame of the two adjacent images; wherein the target to be recognized in the latter frame of the two adjacent images is the target that is the same as the adjacent one. The target to be recognized corresponding to the target to be recognized in the previous frame of the two images;

第一生成单元,用于生成所述多帧图像中最后一帧图像中的待识别目标对应的识别码;A first generation unit configured to generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image;

第二生成单元,用于生成与所述识别码对应的目标信息;其中,所述识别码表征待识别目标的身份标识,所述目标信息表征待识别目标发出预设动作的次数。The second generation unit is configured to generate target information corresponding to the identification code; wherein the identification code represents the identity of the target to be identified, and the target information represents the number of times the target to be identified has issued a preset action.

进一步地,所述第一确定单元,包括:Further, the first determining unit includes:

第一确定模块,用于根据多帧图像中第一帧图像的参数列表,确定第一帧图像中的待识别目标对应的高斯分布;其中,所述高斯分布表征待识别目标的位置信息;The first determination module is configured to determine the Gaussian distribution corresponding to the target to be identified in the first frame image according to the parameter list of the first frame image in the multi-frame image; wherein the Gaussian distribution represents the position information of the target to be identified;

计算模块,用于针对所述多帧图像中的相邻两帧图像,将所述相邻两帧图像中后一帧图像的参数列表,与所述相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息;其中,所述概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;A calculation module configured to, for two adjacent frames of images in the multi-frame images, compare the parameter list of the latter frame of the image in the two adjacent frames of images with the parameter list of the previous frame of the image in the two adjacent frames of images. The Gaussian distribution corresponding to the target to be identified is subjected to probability calculation processing to obtain the probability result information corresponding to the target to be identified in the latter image of the two adjacent frames; wherein the probability result information is used to indicate the two adjacent frames of images. Gaussian distribution corresponding to the target to be recognized in the next frame of image;

第二确定模块,用于根据所述概率结果信息,确定所述相邻两帧图像中后一帧图像中的待识别目标。The second determination module is configured to determine the target to be recognized in the latter frame of the two adjacent frames of images based on the probability result information.

进一步地,所述第二确定模块,包括:Further, the second determination module includes:

第一确定子模块,用于若确定所述概率结果信息为得到似然概率值,其中,所述似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标对应的最大似然概率值;其中,所述最大似然概率值用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;The first determination sub-module is used to obtain a likelihood probability value if it is determined that the probability result information is, where the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, respectively. and the likelihood probability value between the multiple Gaussian distributions in the previous frame image in the two adjacent frames of images, then the maximum likelihood probability value corresponding to the target to be identified is determined; wherein, the maximum likelihood probability value is used to indicate Gaussian distribution corresponding to the target to be recognized in the latter image of two adjacent frames;

第二确定子模块,用于根据待识别目标对应的最大似然概率值,确定相邻两帧图像中后一帧图像中的待识别目标。The second determination sub-module is used to determine the target to be recognized in the latter frame of the two adjacent frames of images based on the maximum likelihood probability value corresponding to the target to be recognized.

进一步地,待识别目标对应的参数信息包括人脸宽度值;所述方法还包括:Further, the parameter information corresponding to the target to be recognized includes the face width value; the method further includes:

第二确定单元,用于根据预设的第一约束条件信息,确定相邻两帧图像中后一帧图像中的待识别目标,与待识别目标对应的高斯分布所在位置之间的距离;其中,所述第一约束条件信息表征待识别目标对应的位置信息;The second determination unit is used to determine the distance between the target to be identified in the latter frame of the two adjacent images and the position of the Gaussian distribution corresponding to the target to be identified based on the preset first constraint information; wherein , the first constraint information represents the position information corresponding to the target to be identified;

更新单元,用于若确定所述距离大于待识别目标的人脸宽度值,则将相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,替换为相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布,得到更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;An update unit, configured to replace the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images with the Gaussian distribution of the target to be identified in the latter frame of the two adjacent frames of images if it is determined that the distance is greater than the face width value of the target to be identified. The Gaussian distribution corresponding to the target to be recognized in the previous frame of the image is obtained, and the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images is obtained;

第三确定单元,用于根据更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The third determination unit is configured to determine the target to be recognized in the latter image of the two adjacent frames according to the updated Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames.

进一步地,所述第二确定模块,包括:Further, the second determination module includes:

第三确定子模块,用于若确定所述概率结果信息为未得到似然概率值,其中,所述似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标为新增目标;The third determination sub-module is used to determine that the probability result information is that the likelihood probability value has not been obtained, wherein the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, If the likelihood probability values between the multiple Gaussian distributions in the previous frame of the two adjacent images are determined, the target to be identified is determined to be a new target;

第四确定子模块,用于根据新增目标对应的参数信息,确定相邻两帧图像中后一帧图像中的新增目标对应的高斯分布;The fourth determination sub-module is used to determine the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames of images based on the parameter information corresponding to the new target;

第五确定子模块,用于根据相邻两帧图像中后一帧图像中的新增目标对应的高斯分布,确定所述相邻两帧图像中后一帧图像中的待识别目标。The fifth determination sub-module is used to determine the target to be recognized in the latter frame of the two adjacent frames of images according to the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames of images.

进一步地,所述第二确定模块,包括:Further, the second determination module includes:

第六确定子模块,用于若确定所述概率结果信息为未得到与高斯分布对应的参数信息,则确定所述高斯分布对应的待识别目标为消失状态;The sixth determination sub-module is used to determine that the target to be identified corresponding to the Gaussian distribution is in a disappearing state if it is determined that the probability result information is not obtained.

第七确定子模块,用于确定与高斯分布所在位置相邻且预设数量的待识别目标、以及所述待识别目标对应的参数信息;其中,所述高斯分布包括第一人脸面积,所述待识别目标对应的参数信息包括第二人脸面积;The seventh determination sub-module is used to determine a preset number of targets to be identified adjacent to the location of the Gaussian distribution, and parameter information corresponding to the targets to be identified; wherein the Gaussian distribution includes the first face area, so The parameter information corresponding to the target to be recognized includes the second face area;

第八确定子模块,用于根据预设的第二约束条件信息,在预设数量的第二人脸面积中,确定与所述第一人脸面积相等的目标人脸面积;其中,所述第二约束条件信息表征待识别目标的人脸面积的像素占比;The eighth determination sub-module is used to determine a target face area equal to the first face area among a preset number of second face areas according to the preset second constraint information; wherein, the The second constraint information represents the pixel ratio of the face area of the target to be recognized;

第九确定子模块,用于根据与所述目标人脸面积对应的参数信息,确定相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;The ninth determination sub-module is used to determine the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images based on the parameter information corresponding to the target face area;

第十确定子模块,用于根据相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定所述相邻两帧图像中后一帧图像中的待识别目标。The tenth determination sub-module is used to determine the target to be recognized in the latter frame of the two adjacent frames of images according to the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent images.

进一步地,所述识别单元,具体用于:Further, the identification unit is specifically used for:

根据预设的深度目标监测网络,分别对多帧图像进行识别处理,得到每一所述图像的参数列表;其中,所述深度目标检测网络用于指示目标识别信息,所述参数列表中包括图像中的待识别目标的参数信息。According to the preset depth target detection network, multiple frame images are recognized and processed respectively to obtain a parameter list of each image; wherein, the depth target detection network is used to indicate target identification information, and the parameter list includes images Parameter information of the target to be identified in .

进一步地,待识别目标对应的参数信息包括人脸信息及人体信息,其中,所述人脸信息包括人脸中心横坐标、人脸中心纵坐标、人脸宽度值、人脸高度值、以及人脸面积,所述人体信息包括人体中心横坐标、人体中心纵坐标、人体宽度值、人体高度值、以及人体面积。Further, the parameter information corresponding to the target to be recognized includes face information and human body information, wherein the face information includes the abscissa coordinate of the face center, the ordinate of the face center, the face width value, the face height value, and the face height value. Face area, the human body information includes the human body center abscissa, the human body center ordinate, the human body width value, the human body height value, and the human body area.

第三方面,本申请提供一种电子设备,包括存储器、处理器,所述存储器中存储有可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现第一方面所述的方法。In a third aspect, the present application provides an electronic device, including a memory and a processor. The memory stores a computer program that can run on the processor. When the processor executes the computer program, the first aspect is implemented. the method described.

第四方面,本申请提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机执行指令,所述计算机执行指令被处理器执行时用于实现第一方面所述的方法。In a fourth aspect, the present application provides a computer-readable storage medium in which computer-executable instructions are stored, and when executed by a processor, the computer-executable instructions are used to implement the method described in the first aspect.

第五方面,本申请提供一种计算机程序产品,包括计算机程序,该计算机程序被处理器执行时实现第一方面所述的方法。In a fifth aspect, the present application provides a computer program product, including a computer program that implements the method described in the first aspect when executed by a processor.

本申请提供的一种图像处理方法、装置及设备,对多帧图像进行识别处理,得到每一图像的参数列表;其中,参数列表中包括图像中的待识别目标的参数信息。针对多帧图像中的相邻两帧图像,根据相邻两帧图像中前一帧图像的参数列表、以及相邻两帧图像中后一帧图像的参数列表,确定相邻两帧图像中后一帧图像中的待识别目标;其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标。生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。本方案中,图像中包括多个待识别目标,对多帧图像进行识别处理,得到每一图像的参数列表,参数列表中包括图像中的待识别目标的参数信息。然后,根据多帧图像中的相邻两帧图像,可以确定出相邻两帧图像中后一帧图像中的待识别目标,与相邻两帧图像中前一帧图像中的某一待识别目标是同一个待识别目标。然后将确定出的相邻两帧图像中后一帧图像中的待识别目标,作为下一组相邻两帧图像中前一帧图像的待识别目标,并结合下一组相邻两帧图像中后一帧图像中的参数列表,确定下一组相邻两帧图像中后一帧图像中的待识别目标,以此类推,直至确定多帧图像中最后一帧图像中的待识别目标,最后生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息,进而根据目标信息显示待识别目标发出预设动作的次数。因此,可以确定出位于多帧图像中的同一待识别目标,并生成待识别目标对应的识别码,极大的提高了跟踪待识别目标的稳定性,解决了用户的识别码追踪难度较大导致无法获取识别码对应的行为信息的技术问题。This application provides an image processing method, device and equipment that performs recognition processing on multiple frames of images to obtain a parameter list for each image; wherein the parameter list includes parameter information of the target to be recognized in the image. For two adjacent frames of images in a multi-frame image, determine the latter of the two adjacent frames of images based on the parameter list of the previous frame of the image and the parameter list of the latter frame of the two adjacent images. The target to be recognized in one frame of image; wherein, the target to be recognized in the latter frame of two adjacent frames of images is the target to be recognized corresponding to the target to be recognized in the previous frame of the two adjacent images. Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein, the identification code represents the identity of the target to be identified, and the target information represents the preset issued by the target to be identified. The number of actions. In this solution, the image includes multiple targets to be recognized, and the multi-frame images are recognized and processed to obtain a parameter list of each image. The parameter list includes parameter information of the target to be recognized in the image. Then, based on the two adjacent frames in the multi-frame images, it is possible to determine the target to be identified in the latter frame of the two adjacent images and the target to be identified in the previous frame of the two adjacent images. The target is the same target to be identified. Then the determined target to be recognized in the latter frame of the two adjacent frames is used as the target to be recognized in the previous frame of the next set of two adjacent frames, and combined with the next set of two adjacent frames. The parameter list in the next frame of images is used to determine the target to be recognized in the next frame of the next two adjacent frames, and so on, until the target to be recognized in the last frame of the multi-frame image is determined. Finally, an identification code corresponding to the target to be identified in the last frame of the multi-frame image is generated, and target information corresponding to the identification code is generated, and then the number of times the target to be identified has issued a preset action is displayed based on the target information. Therefore, the same target to be identified in multiple frames of images can be determined, and the identification code corresponding to the target to be identified can be generated, which greatly improves the stability of tracking the target to be identified and solves the problem of difficulty in tracking the user's identification code. Technical problem of not being able to obtain the behavioral information corresponding to the identification code.

附图说明Description of the drawings

此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并与说明书一起用于解释本公开的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the disclosure and together with the description, serve to explain the principles of the disclosure.

图1为本申请实施例提供的一种图像处理方法的流程示意图;Figure 1 is a schematic flowchart of an image processing method provided by an embodiment of the present application;

图2为本申请实施例提供的另一种图像处理方法的流程示意图;Figure 2 is a schematic flow chart of another image processing method provided by an embodiment of the present application;

图3为本申请实施例提供的一种图像处理装置的结构示意图;Figure 3 is a schematic structural diagram of an image processing device provided by an embodiment of the present application;

图4为本申请实施例提供的另一种图像处理装置的结构示意图;Figure 4 is a schematic structural diagram of another image processing device provided by an embodiment of the present application;

图5为本申请实施例提供的一种电子设备的结构示意图;Figure 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application;

图6为本申请实施例提供的一种电子设备的框图。Figure 6 is a block diagram of an electronic device provided by an embodiment of the present application.

通过上述附图,已示出本公开明确的实施例,后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本公开构思的范围,而是通过参考特定实施例为本领域技术人员说明本公开的概念。Specific embodiments of the present disclosure have been shown through the above-mentioned drawings and will be described in more detail below. These drawings and written description are not intended to limit the scope of the disclosed concepts in any way, but rather to illustrate the concepts of the present disclosure to those skilled in the art with reference to the specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。Exemplary embodiments will be described in detail herein, examples of which are illustrated in the accompanying drawings. When the following description refers to the drawings, the same numbers in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with the present disclosure.

一个示例中,为了了解学生上课时的课堂行为,需要对每个学生进行跟踪识别。现有技术中,对每个学生进行跟踪识别时,是通过目标检测、动作识别技术识别学生的课堂行为,并且根据目标跟踪算法计算出整节课中每个学生产生的动作次数。然而现有技术中,由于根据目标跟踪算法计算出整节课中每个学生产生的动作次数时,在课堂中学生会有干扰行为,例如干扰行为包括低头、学生之间遮挡等,导致无法准确识别出学生的识别码,进而导致无法获取学生的课堂行为。In one example, in order to understand students' classroom behavior during class, each student needs to be tracked and identified. In the existing technology, when tracking and identifying each student, the student's classroom behavior is identified through target detection and action recognition technology, and the number of actions produced by each student in the entire class is calculated based on the target tracking algorithm. However, in the existing technology, when the number of actions produced by each student in the entire class is calculated based on the target tracking algorithm, students will have disturbing behaviors in the classroom, such as lowering their heads, blocking between students, etc., resulting in the inability to accurately identify The student's identification code is obtained, resulting in the inability to obtain the student's classroom behavior.

本申请提供的一种图像处理方法、装置及设备,旨在解决现有技术的如上技术问题。This application provides an image processing method, device and equipment, aiming to solve the above technical problems of the prior art.

下面以具体地实施例对本申请的技术方案以及本申请的技术方案如何解决上述技术问题进行详细说明。下面这几个具体的实施例可以相互结合,对于相同或相似的概念或过程可能在某些实施例中不再赘述。下面将结合附图,对本申请的实施例进行描述。The technical solution of the present application and how the technical solution of the present application solves the above technical problems will be described in detail below with specific embodiments. The following specific embodiments can be combined with each other, and the same or similar concepts or processes may not be described again in some embodiments. The embodiments of the present application will be described below with reference to the accompanying drawings.

图1为本申请实施例提供的一种图像处理方法的流程示意图,如图1所示,该包括:Figure 1 is a schematic flowchart of an image processing method provided by an embodiment of the present application. As shown in Figure 1, the method includes:

101、对多帧图像进行识别处理,得到每一图像的参数列表;其中,参数列表中包括图像中的待识别目标的参数信息。101. Perform recognition processing on multiple frame images to obtain a parameter list of each image; where the parameter list includes parameter information of the target to be recognized in the image.

示例性地,本实施例的执行主体可以为电子设备、或者终端设备、或者图像处理装置或设备、或者其他可以执行本实施例的装置或设备,对此不做限制。本实施例中以执行主体为电子设备进行介绍。For example, the execution subject of this embodiment may be an electronic device, or a terminal device, or an image processing device or device, or other device or device that can execute this embodiment, without limitation. In this embodiment, the execution subject is an electronic device.

首先,需要获取多帧图像,然后对多帧图像进行识别处理。可以拍摄得到图像,或者从存储器中获取图像;或者,接收其他设备传输的图像。然后可以对多帧图像进行识别处理,得到每一图像的参数列表,参数列表中包括图像中的待识别目标的参数信息,例如,识别处理包括通过预设的深度目标监测网络进行处理;本申请的技术方案为多目标跟踪算法,除了本申请的技术方案,多目标跟踪算法还包括SORT算法、DeepSORT算法。First, multiple frames of images need to be obtained, and then the multi-frame images need to be recognized and processed. Images can be captured, or images can be obtained from memory; or images transmitted by other devices can be received. Then the multi-frame images can be recognized and processed to obtain a parameter list of each image. The parameter list includes parameter information of the target to be recognized in the image. For example, the recognition process includes processing through a preset depth target monitoring network; this application The technical solution is a multi-target tracking algorithm. In addition to the technical solution of this application, the multi-target tracking algorithm also includes the SORT algorithm and the DeepSORT algorithm.

举例来说,根据预设的深度目标监测网络,电子设备分别对多帧图像进行识别处理,检测出每一帧图像的人体人脸,并获得每个人体人脸的bounding box(bbox)坐标及宽高,每一帧都输出一个参数列表bbox,每个参数列表有10个维度特征,分别是:人脸中心横坐标x、人脸中心纵坐标y、人脸bbox宽度值、人脸bbox高度值、人脸bbox面积、人体中心横坐标x、人体中心纵坐标y、人体bbox宽度值、人体bbox高度值、以及人体bbox面积。For example, based on the preset deep target monitoring network, the electronic device performs recognition processing on multiple frames of images, detects the human face in each frame of image, and obtains the bounding box (bbox) coordinates and Width and height, each frame outputs a parameter list bbox. Each parameter list has 10 dimensional features, which are: face center abscissa x, face center ordinate y, face bbox width value, face bbox height value, face bbox area, human body center abscissa x, human body center ordinate y, human bbox width value, human bbox height value, and human bbox area.

102、针对多帧图像中的相邻两帧图像,根据相邻两帧图像中前一帧图像的参数列表、以及相邻两帧图像中后一帧图像的参数列表,确定相邻两帧图像中后一帧图像中的待识别目标;其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标。102. For two adjacent frame images in the multi-frame image, determine the two adjacent frame images according to the parameter list of the previous frame image in the two adjacent frame images and the parameter list of the subsequent frame image in the two adjacent frame images. The target to be recognized in the latter frame of the image; where the target to be recognized in the latter frame of the two adjacent images is the target to be recognized corresponding to the target to be recognized in the previous frame of the two adjacent images. Target.

示例性地,先根据多帧图像中的第一帧图像的参数列表X1,参数列表X1的长度为n,初始化N(N=n+10,10为预留的占位符)个多维(10维)高斯分布G,高斯分布G为每一待识别目标的均值和方差。例如,高斯分布G包括:G1、G2、G3…,其中,G1代表第1个人的高斯分布,G2代表第2个人的高斯分布,G3代表第3个人的高斯分布。Exemplarily, first, according to the parameter list X 1 of the first frame image in the multi-frame image, the length of the parameter list (10-dimensional) Gaussian distribution G, which is the mean and variance of each target to be identified. For example, the Gaussian distribution G includes: G 1 , G 2 , G 3 ..., where G 1 represents the Gaussian distribution of the first person, G 2 represents the Gaussian distribution of the second person, and G 3 represents the Gaussian distribution of the third person.

然后,针对多帧图像中的相邻两帧图像,基于预设的公式,将相邻两帧图像中后一帧图像的参数列表,与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息,概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,预设的公式如下:Then, for two adjacent frames in the multi-frame images, based on the preset formula, the parameter list of the latter frame of the two adjacent images is compared with the parameter list to be recognized in the previous frame of the two adjacent images. The Gaussian distribution corresponding to the target is subjected to probability calculation processing to obtain the probability result information corresponding to the target to be identified in the latter frame of the two adjacent images. The probability result information is used to indicate the probability result information in the latter frame of the two adjacent images. The preset formula for the Gaussian distribution corresponding to the target to be identified is as follows:

其中,Pi,j表示相邻两帧图像中后一帧图像中的第i个待识别目标,与相邻两帧图像中前一帧图像中的第j个高斯分布之间的似然概率;Xa b表示第a帧图像中的第b个待识别目标的参数信息,Gi表示相邻两帧图像中前一帧图像中的第i个待识别目标的高斯分布。Among them, P i,j represents the likelihood probability between the i-th target to be recognized in the latter frame of the two adjacent images and the j-th Gaussian distribution in the previous frame of the two adjacent images. ; _ _

最后,根据概率结果信息,确定相邻两帧图像中后一帧图像中的待识别目标,其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标,即将相邻两帧图像中后一帧图像中的待识别目标,替换为与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布。Finally, based on the probability result information, the target to be recognized in the latter frame of the two adjacent frames of images is determined, where the target to be recognized in the latter frame of the two adjacent frames of images is the same as the target in the latter frame of the two adjacent images. The target to be recognized corresponding to the target to be recognized in the previous frame of the image is to replace the target to be recognized in the latter frame of the two adjacent frames of images with the target to be recognized in the previous frame of the image in the two adjacent frames of images. the corresponding Gaussian distribution.

103、生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。103. Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein, the identification code represents the identity of the target to be identified, and the target information represents the target sent by the target to be identified. The number of preset actions.

示例性地,识别码表示待识别目标的身份标识,预设动作是指电子设备预先存储的动作,例如,预设动作包括举手等。电子设备可以根据预先存储的总人数、以及多帧图像中最后一帧图像中的待识别目标对应的高斯分布,生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与每一识别码对应的目标信息,目标信息表征待识别目标发出预设动作的次数。For example, the identification code represents the identity of the target to be identified, and the preset action refers to an action pre-stored by the electronic device. For example, the preset action includes raising a hand. The electronic device can generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image based on the pre-stored total number of people and the Gaussian distribution corresponding to the target to be identified in the last frame of the multi-frame image, and generate Target information corresponding to each identification code, the target information represents the number of times the target to be identified has issued a preset action.

举例来说,总人数为30人,电子设备可以根据30人、以及多帧图像中最后一帧图像中的30个待识别目标分别对应的高斯分布,生成多帧图像中最后一帧图像中的待识别目标对应的识别码,例如,第1个待识别目标的识别码(即ID)为1、第2个待识别目标的识别码(即ID)为2…以此类推,并生成与每一识别码对应的目标信息。For example, if the total number of people is 30, the electronic device can generate the target in the last frame of the multi-frame image based on the Gaussian distributions corresponding to the 30 people and the 30 targets to be recognized in the last frame of the multi-frame image. The identification code corresponding to the target to be identified, for example, the identification code (i.e., ID) of the first target to be identified is 1, the identification code (i.e., ID) of the second target to be identified is 2... and so on, and generates the corresponding identification code for each target. Target information corresponding to an identification code.

本申请实施例中,对多帧图像进行识别处理,得到每一图像的参数列表;其中,参数列表中包括图像中的待识别目标的参数信息。针对多帧图像中的相邻两帧图像,根据相邻两帧图像中前一帧图像的参数列表、以及相邻两帧图像中后一帧图像的参数列表,确定相邻两帧图像中后一帧图像中的待识别目标;其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标。生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。本方案中,图像中包括多个待识别目标,对多帧图像进行识别处理,得到每一图像的参数列表,参数列表中包括图像中的待识别目标的参数信息。然后,根据多帧图像中的相邻两帧图像,可以确定出相邻两帧图像中后一帧图像中的待识别目标,与相邻两帧图像中前一帧图像中的某一待识别目标是同一个待识别目标。然后将确定出的相邻两帧图像中后一帧图像中的待识别目标,作为下一组相邻两帧图像中前一帧图像的待识别目标,并结合下一组相邻两帧图像中后一帧图像中的参数列表,确定下一组相邻两帧图像中后一帧图像中的待识别目标,以此类推,直至确定多帧图像中最后一帧图像中的待识别目标,最后生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息,进而根据目标信息显示待识别目标发出预设动作的次数。因此,可以确定出位于多帧图像中的同一待识别目标,并生成待识别目标对应的识别码,极大的提高了跟踪待识别目标的稳定性,解决了用户的识别码追踪难度较大导致无法获取识别码对应的行为信息的技术问题。In the embodiment of the present application, multiple frame images are subjected to recognition processing to obtain a parameter list of each image; wherein the parameter list includes parameter information of the target to be recognized in the image. For two adjacent frames of images in a multi-frame image, determine the latter of the two adjacent frames of images based on the parameter list of the previous frame of the image and the parameter list of the latter frame of the two adjacent images. The target to be recognized in one frame of image; wherein, the target to be recognized in the latter frame of two adjacent frames of images is the target to be recognized corresponding to the target to be recognized in the previous frame of the two adjacent images. Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein, the identification code represents the identity of the target to be identified, and the target information represents the preset issued by the target to be identified. The number of actions. In this solution, the image includes multiple targets to be recognized, and the multi-frame images are recognized and processed to obtain a parameter list of each image. The parameter list includes parameter information of the target to be recognized in the image. Then, based on the two adjacent frames in the multi-frame images, it is possible to determine the target to be identified in the latter frame of the two adjacent images and the target to be identified in the previous frame of the two adjacent images. The target is the same target to be identified. Then the determined target to be recognized in the latter frame of the two adjacent frames is used as the target to be recognized in the previous frame of the next set of two adjacent frames, and combined with the next set of two adjacent frames. The parameter list in the next frame of images is used to determine the target to be recognized in the next frame of the next two adjacent frames, and so on, until the target to be recognized in the last frame of the multi-frame image is determined. Finally, an identification code corresponding to the target to be identified in the last frame of the multi-frame image is generated, and target information corresponding to the identification code is generated, and then the number of times the target to be identified has issued a preset action is displayed based on the target information. Therefore, the same target to be identified in multiple frames of images can be determined, and the identification code corresponding to the target to be identified can be generated, which greatly improves the stability of tracking the target to be identified and solves the problem of difficulty in tracking the user's identification code. Technical problem of not being able to obtain the behavioral information corresponding to the identification code.

图2为本申请实施例提供的另一种图像处理方法的流程示意图,如图2所示,该方法包括:Figure 2 is a schematic flow chart of another image processing method provided by an embodiment of the present application. As shown in Figure 2, the method includes:

201、根据预设的深度目标监测网络,分别对多帧图像进行识别处理,得到每一图像的参数列表;其中,深度目标检测网络用于指示目标识别信息,参数列表中包括图像中的待识别目标的参数信息。201. According to the preset depth target detection network, perform recognition processing on multiple frames of images respectively to obtain a parameter list for each image; among them, the depth target detection network is used to indicate target identification information, and the parameter list includes the objects to be recognized in the image. Target parameter information.

一个示例中,待识别目标对应的参数信息包括人脸信息及人体信息,其中,人脸信息包括人脸中心横坐标、人脸中心纵坐标、人脸宽度值、人脸高度值、以及人脸面积,人体信息包括人体中心横坐标、人体中心纵坐标、人体宽度值、人体高度值、以及人体面积。In one example, the parameter information corresponding to the target to be recognized includes face information and human body information, where the face information includes the abscissa of the face center, the ordinate of the face center, the face width value, the face height value, and the face Area, human body information includes the horizontal coordinate of the human body center, the vertical coordinate of the human body center, the human body width value, the human body height value, and the human body area.

示例性地,本步骤可以参见图1中的步骤101,不再赘述。For example, this step can refer to step 101 in Figure 1 and will not be described again.

202、根据多帧图像中第一帧图像的参数列表,确定第一帧图像中的待识别目标对应的高斯分布;其中,高斯分布表征待识别目标的位置信息。202. According to the parameter list of the first frame image in the multi-frame image, determine the Gaussian distribution corresponding to the target to be recognized in the first frame image; wherein the Gaussian distribution represents the position information of the target to be recognized.

示例性地,电子设备根据多帧图像中的第一帧图像的参数列表X1,参数列表X1的长度为n,初始化N(N=n+10,10为预留的占位符)个多维(10维)高斯分布G,高斯分布G为每一待识别目标的均值和方差。例如,高斯分布G包括:G1、G2、G3…,其中,G1代表第1个人的高斯分布,G2代表第2个人的高斯分布,G3代表第3个人的高斯分布。Exemplarily, the electronic device initializes N (N=n+10, 10 is a reserved placeholder) parameters according to the parameter list X 1 of the first frame image in the multi-frame image. The length of the parameter list X 1 is n. Multidimensional (10-dimensional) Gaussian distribution G, which is the mean and variance of each target to be identified. For example, the Gaussian distribution G includes: G 1 , G 2 , G 3 ..., where G 1 represents the Gaussian distribution of the first person, G 2 represents the Gaussian distribution of the second person, and G 3 represents the Gaussian distribution of the third person.

203、针对多帧图像中的相邻两帧图像,将相邻两帧图像中后一帧图像的参数列表,与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息;其中,概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。203. For two adjacent frames in the multi-frame image, compare the parameter list of the latter frame of the two adjacent images with the Gaussian distribution corresponding to the target to be recognized in the previous frame of the two adjacent images. Probability calculation processing is performed to obtain the probability result information corresponding to the target to be identified in the latter frame of the two adjacent images; wherein the probability result information is used to indicate the correspondence of the target to be identified in the latter frame of the two adjacent images. Gaussian distribution.

204、根据概率结果信息,确定相邻两帧图像中后一帧图像中的待识别目标。204. Based on the probability result information, determine the target to be recognized in the latter frame of the two adjacent frames of images.

步骤204包括三种实现方式:Step 204 includes three implementation methods:

步骤204的第一种实现方式:若确定概率结果信息为得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标对应的最大似然概率值;其中,最大似然概率值用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;根据待识别目标对应的最大似然概率值,确定相邻两帧图像中后一帧图像中的待识别目标。The first implementation method of step 204: if the probability result information is determined to obtain a likelihood probability value, where the likelihood probability value represents the parameter information of the target to be identified in the latter frame of two adjacent frames of images, respectively with the adjacent The likelihood probability value between the multiple Gaussian distributions in the previous frame image in the two frames of images is used to determine the maximum likelihood probability value corresponding to the target to be identified; where the maximum likelihood probability value is used to indicate two adjacent frames of images. The Gaussian distribution corresponding to the target to be recognized in the latter frame of the image is determined; according to the maximum likelihood probability value corresponding to the target to be recognized, the target to be recognized in the latter frame of the image in the two adjacent frames is determined.

一个示例中,待识别目标对应的参数信息包括人脸宽度值;步骤204的第一种实现方式还包括:根据预设的第一约束条件信息,确定相邻两帧图像中后一帧图像中的待识别目标,与待识别目标对应的高斯分布所在位置之间的距离;其中,第一约束条件信息表征待识别目标对应的位置信息;若确定距离大于待识别目标的人脸宽度值,则将相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,替换为相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布,得到更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;根据更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。In one example, the parameter information corresponding to the target to be recognized includes a face width value; the first implementation of step 204 also includes: determining, according to the preset first constraint information, the width of the face in the latter frame of the two adjacent frames. The distance between the target to be recognized and the position of the Gaussian distribution corresponding to the target to be recognized; where the first constraint information represents the position information corresponding to the target to be recognized; if it is determined that the distance is greater than the face width value of the target to be recognized, then Replace the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent images with the Gaussian distribution corresponding to the target to be recognized in the previous frame of the two adjacent images to obtain the updated adjacent two frames. The Gaussian distribution corresponding to the target to be recognized in the latter frame of the image; according to the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images after updating, determine the latter of the two adjacent frames of images. The target to be recognized in the frame image.

步骤204的第二种实现方式:若确定概率结果信息为未得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标为新增目标;根据新增目标对应的参数信息,确定相邻两帧图像中后一帧图像中的新增目标对应的高斯分布;根据相邻两帧图像中后一帧图像中的新增目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The second implementation method of step 204: If it is determined that the probability result information is that the likelihood probability value is not obtained, where the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, respectively with the corresponding According to the likelihood probability values between multiple Gaussian distributions in the previous frame of the image in the two adjacent frames, the target to be identified is determined to be a new target; according to the parameter information corresponding to the new target, the target in the two adjacent frames of images is determined. The Gaussian distribution corresponding to the new target in one frame of image; according to the Gaussian distribution corresponding to the new target in the latter frame of two adjacent images, determine the target to be recognized in the latter frame of the two adjacent images. .

步骤204的第三种实现方式:若确定概率结果信息为未得到与高斯分布对应的参数信息,则确定高斯分布对应的待识别目标为消失状态;确定与高斯分布所在位置相邻且预设数量的待识别目标、以及待识别目标对应的参数信息;其中,高斯分布包括第一人脸面积,待识别目标对应的参数信息包括第二人脸面积;根据预设的第二约束条件信息,在预设数量的第二人脸面积中,确定与第一人脸面积相等的目标人脸面积;其中,第二约束条件信息表征待识别目标的人脸面积的像素占比;根据与目标人脸面积对应的参数信息,确定相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布;根据相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The third implementation method of step 204: If it is determined that the probability result information is that parameter information corresponding to the Gaussian distribution has not been obtained, then it is determined that the target to be identified corresponding to the Gaussian distribution is in a disappearing state; it is determined that the target is adjacent to the location of the Gaussian distribution and has a preset number The target to be recognized, and the parameter information corresponding to the target to be recognized; wherein, the Gaussian distribution includes the first face area, and the parameter information corresponding to the target to be recognized includes the second face area; according to the preset second constraint information, in Among the preset number of second face areas, determine the target face area that is equal to the first face area; wherein, the second constraint information represents the pixel ratio of the face area of the target to be recognized; according to the target face area According to the parameter information corresponding to the area, determine the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent images; determine the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent images. The target to be recognized in the latter image of the two adjacent frames.

示例性地,针对多帧图像中的相邻两帧图像,基于预设的公式,电子设备将相邻两帧图像中后一帧图像的参数列表,与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息,概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。根据概率结果信息,确定相邻两帧图像中后一帧图像中的待识别目标,其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标。For example, for two adjacent frames in a multi-frame image, based on a preset formula, the electronic device compares the parameter list of the latter frame of the two adjacent images with the previous frame of the two adjacent images. The Gaussian distribution corresponding to the target to be identified in the image is subjected to probability calculation processing to obtain the probability result information corresponding to the target to be identified in the latter image of the two adjacent frames. The probability result information is used to indicate the latter of the two adjacent frames of images. Gaussian distribution corresponding to the target to be recognized in the frame image. According to the probability result information, the target to be recognized in the latter frame of the two adjacent images is determined, where the target to be recognized in the latter frame of the two adjacent images is the same as the target in the previous frame of the two adjacent images. The target to be recognized corresponding to the target to be recognized in the frame image.

示例性地,在步骤204的第一种实现方式中,如果确定概率结果信息为得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中每一待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则在多个似然概率值中,确定出待识别目标对应的最大似然概率值,其中,最大似然概率值用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。最后,根据待识别目标对应的最大似然概率值,确定相邻两帧图像中后一帧图像中的待识别目标,其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标的高斯分布相等的待识别目标。For example, in the first implementation of step 204, if the probability result information is determined to obtain a likelihood probability value, where the likelihood probability value represents each target to be identified in the latter frame of the two adjacent images. parameter information, respectively, and the likelihood probability values between the multiple Gaussian distributions in the previous frame image in the two adjacent frames of images, then among the multiple likelihood probability values, the maximum likelihood corresponding to the target to be identified is determined Probability value, where the maximum likelihood probability value is used to indicate the Gaussian distribution corresponding to the target to be recognized in the latter image of two adjacent frames. Finally, according to the maximum likelihood probability value corresponding to the target to be recognized, the target to be recognized in the latter frame of the two adjacent frames of images is determined, where the target to be recognized in the latter frame of the two adjacent frames of images is The target to be recognized is equal to the Gaussian distribution of the target to be recognized in the previous frame of the two adjacent images.

确定出相邻两帧图像中后一帧图像中的待识别目标后,还可以进一步的校验相邻两帧图像中后一帧图像中的待识别目标的高斯分布。具体的,待识别目标对应的参数信息包括人脸宽度值,电子设备根据预设的第一约束条件信息,确定相邻两帧图像中后一帧图像中的待识别目标,与待识别目标对应的高斯分布所在位置之间的距离,第一约束条件信息表征待识别目标对应的位置信息。然后将待识别目标的距离与待识别目标的人脸宽度值进行比较,如果确定待识别目标的距离大于待识别目标的人脸宽度值,则说明待识别目标在相邻两帧图像中后一帧图像中的位移过大,待识别目标与相邻两帧图像中前一帧图像中的待识别目标不是同一个,因此将相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,替换为相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布,得到更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。最后根据更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。After determining the target to be recognized in the latter frame of the two adjacent images, the Gaussian distribution of the target to be recognized in the latter frame of the two adjacent images can be further verified. Specifically, the parameter information corresponding to the target to be recognized includes the face width value. The electronic device determines the target to be recognized in the latter frame of the two adjacent frames of images according to the preset first constraint information, corresponding to the target to be recognized. The distance between the positions of the Gaussian distribution, and the first constraint information represents the position information corresponding to the target to be identified. Then compare the distance of the target to be recognized with the face width value of the target to be recognized. If it is determined that the distance of the target to be recognized is greater than the face width value of the target to be recognized, it means that the target to be recognized is the last in the two adjacent frames of images. The displacement in the frame image is too large, and the target to be recognized is not the same as the target to be recognized in the previous frame of the two adjacent frames. Therefore, the target to be recognized in the latter frame of the two adjacent frames is corresponding to The Gaussian distribution is replaced with the Gaussian distribution corresponding to the target to be recognized in the previous frame of the two adjacent images, and the Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames is obtained. Finally, according to the updated Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images, the target to be recognized in the latter frame of the two adjacent frames of images is determined.

举例来说,如果确定概率结果信息为得到似然概率值,似然概率值表示相邻两帧图像中后一帧图像中第1个待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,似然概率值包括:第1个待识别目标的参数信息与相邻两帧图像中前一帧图像中的高斯分布G1之间的似然概率值P1,1、第1个待识别目标的参数信息与相邻两帧图像中前一帧图像中的高斯分布G2之间的似然概率值P1,2、第1个待识别目标的参数信息与相邻两帧图像中前一帧图像中的高斯分布G3之间的似然概率值P1,3。则在多个似然概率值中,确定出待识别目标对应的最大似然概率值P1,3,然后,根据待识别目标对应的最大似然概率值P1,3,可以确定相邻两帧图像中后一帧图像中的第1个待识别目标的高斯分布为G3For example, if the probability result information is determined to obtain a likelihood probability value, the likelihood probability value represents the parameter information of the first target to be recognized in the latter image of two adjacent frames, respectively, and the parameter information of the first target to be identified in the latter image of the two adjacent frames. The likelihood probability value between multiple Gaussian distributions in the previous frame image. The likelihood probability value includes: the parameter information of the first target to be recognized and the Gaussian distribution G 1 in the previous frame image in the two adjacent frames of images. The likelihood probability value P 1,1 between the parameter information of the first target to be identified and the Gaussian distribution G 2 in the previous frame of the two adjacent images, P 1,2 , The likelihood probability value P 1,3 between the parameter information of the first target to be recognized and the Gaussian distribution G 3 in the previous frame of the two adjacent images. Then among multiple likelihood probability values, the maximum likelihood probability value P 1,3 corresponding to the target to be identified is determined. Then, according to the maximum likelihood probability value P 1,3 corresponding to the target to be identified, two adjacent ones can be determined. The Gaussian distribution of the first target to be recognized in the next frame of the image is G 3 .

示例性地,在步骤204的第二种实现方式中,如果确定概率结果信息为未得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定相邻两帧图像中后一帧图像中的待识别目标为新增目标,然后根据新增目标对应的参数信息,计算相邻两帧图像中后一帧图像中的新增目标对应的高斯分布。最后根据相邻两帧图像中后一帧图像中的新增目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。For example, in the second implementation of step 204, if it is determined that the probability result information is that the likelihood probability value is not obtained, where the likelihood probability value represents the target to be identified in the latter frame of the two adjacent images. parameter information, respectively, and the likelihood probability values between multiple Gaussian distributions in the previous frame of the two adjacent images, then it is determined that the target to be identified in the latter frame of the two adjacent frames is the new target. , and then calculate the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames of images based on the parameter information corresponding to the new target. Finally, according to the Gaussian distribution corresponding to the new target in the latter image of the two adjacent frames, the target to be recognized in the latter image of the two adjacent frames is determined.

示例性地,在步骤204的第三种实现方式中,如果确定概率结果信息为未得到与高斯分布对应的参数信息,则确定高斯分布对应的待识别目标为消失状态,其中,消失状态包括待识别目标被遮挡、或低头等,确定与高斯分布所在位置相邻且预设数量的待识别目标、以及预设数量的待识别目标对应的参数信息,其中,高斯分布包括第一人脸面积,待识别目标对应的参数信息包括第二人脸面积,预设数量的待识别目标可以是距高斯分布所在位置最近的3个待识别目标,对预设数量不做限制。For example, in the third implementation of step 204, if it is determined that the probability result information is that parameter information corresponding to the Gaussian distribution has not been obtained, it is determined that the target to be identified corresponding to the Gaussian distribution is in a disappearing state, where the disappearing state includes the to-be-identified target. Recognize that the target is blocked or lowered, etc., determine a preset number of targets to be identified adjacent to the location of the Gaussian distribution, and parameter information corresponding to the preset number of targets to be identified, where the Gaussian distribution includes the first face area, The parameter information corresponding to the target to be recognized includes the second face area. The preset number of targets to be recognized can be the three targets to be recognized that are closest to the location of the Gaussian distribution. There is no limit to the preset number.

然后,根据预设的第二约束条件信息,在预设数量的第二人脸面积中,将预设数量的第二人脸面积依次与第一人脸面积进行比较,确定出与第一人脸面积相等的目标人脸面积,第二约束条件信息表征待识别目标的人脸面积的像素占比。其中,由于课室的座位排布通常比较固定,是前后多排的矩阵型布局,当遮挡发生时,通常是前排学生遮挡后排学生,而前排学生在视频画面中的像素占比较大,所以,可以预先存储待识别目标的人脸面积的像素占比。最后,根据与目标人脸面积对应的参数信息,确定相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,进而根据相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。Then, according to the preset second constraint condition information, among the preset number of second face areas, the preset number of second face areas are compared with the first face area in sequence, and the difference between the preset number of second face areas and the first person's face area is determined. The target face area is equal to the face area, and the second constraint information represents the pixel ratio of the face area of the target to be recognized. Among them, because the seating arrangement in the classroom is usually relatively fixed, with a matrix layout of multiple rows in the front and back, when occlusion occurs, it is usually the students in the front row who block the students in the back row, and the students in the front row occupy a larger proportion of the pixels in the video screen. Therefore, the pixel ratio of the face area of the target to be recognized can be stored in advance. Finally, according to the parameter information corresponding to the target face area, the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images is determined, and then based on the target to be recognized in the latter frame of the two adjacent frames of images, the Gaussian distribution is determined. The Gaussian distribution corresponding to the target determines the target to be recognized in the latter image of the two adjacent frames.

205、生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。205. Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein, the identification code represents the identity of the target to be identified, and the target information represents the target sent by the target to be identified. The number of preset actions.

示例性地,本步骤可以参见图1中的步骤103,不再赘述。For example, this step can refer to step 103 in Figure 1 and will not be described again.

本申请实施例中,根据预设的深度目标监测网络,分别对多帧图像进行识别处理,得到每一图像的参数列表;其中,深度目标检测网络用于指示目标识别信息,参数列表中包括图像中的待识别目标的参数信息。根据多帧图像中第一帧图像的参数列表,确定第一帧图像中的待识别目标对应的高斯分布;其中,高斯分布表征待识别目标的位置信息。针对多帧图像中的相邻两帧图像,将相邻两帧图像中后一帧图像的参数列表,与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息;其中,概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。根据概率结果信息,确定相邻两帧图像中后一帧图像中的待识别目标。生成多帧图像中最后一帧图像中的待识别目标对应的识别码,并生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。因此,结合多项约束条件信息,可以确定出位于多帧图像中的同一待识别目标,并生成待识别目标对应的识别码,极大的提高了跟踪待识别目标的稳定性,解决了用户的识别码追踪难度较大导致无法获取识别码对应的行为信息的技术问题。In the embodiment of the present application, multiple frame images are recognized and processed respectively according to the preset depth target detection network to obtain a parameter list of each image; wherein, the depth target detection network is used to indicate target identification information, and the parameter list includes images Parameter information of the target to be identified in . According to the parameter list of the first frame image in the multi-frame image, the Gaussian distribution corresponding to the target to be recognized in the first frame image is determined; wherein the Gaussian distribution represents the position information of the target to be recognized. For two adjacent frames in a multi-frame image, calculate the probability of the parameter list of the latter frame of the two adjacent images and the Gaussian distribution corresponding to the target to be recognized in the previous frame of the two adjacent images. Process to obtain the probability result information corresponding to the target to be identified in the latter frame of the two adjacent images; where the probability result information is used to indicate the Gaussian corresponding to the target to be identified in the latter frame of the two adjacent images. distributed. According to the probability result information, the target to be recognized in the latter frame of the two adjacent frames of images is determined. Generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image, and generate target information corresponding to the identification code; wherein, the identification code represents the identity of the target to be identified, and the target information represents the preset issued by the target to be identified. The number of actions. Therefore, by combining multiple constraint information, the same target to be identified in multiple frames of images can be determined, and an identification code corresponding to the target to be identified can be generated, which greatly improves the stability of tracking the target to be identified and solves the user's problem. It is difficult to track the identification code, resulting in technical problems such as the inability to obtain the behavioral information corresponding to the identification code.

图3为本申请实施例提供的一种图像处理装置的结构示意图,如图3所示,该装置包括:Figure 3 is a schematic structural diagram of an image processing device provided by an embodiment of the present application. As shown in Figure 3, the device includes:

识别单元31,用于对多帧图像进行识别处理,得到每一图像的参数列表;其中,参数列表中包括图像中的待识别目标的参数信息。The recognition unit 31 is used to perform recognition processing on multiple frame images to obtain a parameter list of each image; wherein the parameter list includes parameter information of the target to be recognized in the image.

第一确定单元32,用于针对多帧图像中的相邻两帧图像,根据相邻两帧图像中前一帧图像的参数列表、以及相邻两帧图像中后一帧图像的参数列表,确定相邻两帧图像中后一帧图像中的待识别目标;其中,相邻两帧图像中后一帧图像中的待识别目标,为与相邻两帧图像中前一帧图像中的待识别目标对应的待识别目标。The first determining unit 32 is configured to, for two adjacent frame images in the multi-frame images, based on the parameter list of the previous frame image in the two adjacent frame images and the parameter list of the subsequent frame image in the two adjacent frame images, Determine the target to be recognized in the latter frame of the two adjacent images; where the target to be recognized in the latter frame of the two adjacent frames is the same as the target to be recognized in the previous frame of the two adjacent images. The target to be recognized corresponding to the recognized target.

第一生成单元33,用于生成多帧图像中最后一帧图像中的待识别目标对应的识别码。The first generation unit 33 is used to generate an identification code corresponding to the target to be identified in the last frame of the multi-frame image.

第二生成单元34,用于生成与识别码对应的目标信息;其中,识别码表征待识别目标的身份标识,目标信息表征待识别目标发出预设动作的次数。The second generation unit 34 is used to generate target information corresponding to the identification code; wherein the identification code represents the identity of the target to be identified, and the target information represents the number of times the target to be identified has issued a preset action.

本实施例的装置,可以执行上述方法中的技术方案,其具体实现过程和技术原理相同,此处不再赘述。The device of this embodiment can execute the technical solution in the above method. Its specific implementation process and technical principles are the same and will not be described again here.

图4为本申请实施例提供的另一种图像处理装置的结构示意图,在图3所示实施例的基础上,如图4所示,第一确定单元32,包括:Figure 4 is a schematic structural diagram of another image processing device provided by an embodiment of the present application. Based on the embodiment shown in Figure 3, as shown in Figure 4, the first determination unit 32 includes:

第一确定模块321,用于根据多帧图像中第一帧图像的参数列表,确定第一帧图像中的待识别目标对应的高斯分布;其中,高斯分布表征待识别目标的位置信息。The first determination module 321 is used to determine the Gaussian distribution corresponding to the target to be recognized in the first frame image according to the parameter list of the first frame image in the multi-frame image; wherein the Gaussian distribution represents the position information of the target to be recognized.

计算模块322,用于针对多帧图像中的相邻两帧图像,将相邻两帧图像中后一帧图像的参数列表,与相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布进行概率计算处理,得到相邻两帧图像中后一帧图像中的待识别目标对应的概率结果信息;其中,概率结果信息用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。The calculation module 322 is used to, for two adjacent frames of images in the multi-frame images, correspond the parameter list of the latter frame of the image in the two adjacent frames of images to the target to be recognized in the previous frame of the image of the two adjacent frames of images. Probability calculation processing is performed on the Gaussian distribution to obtain the probability result information corresponding to the target to be identified in the latter frame of the two adjacent images; where the probability result information is used to indicate the probability result information in the latter frame of the two adjacent images. Gaussian distribution corresponding to the target to be recognized.

第二确定模块323,用于根据概率结果信息,确定相邻两帧图像中后一帧图像中的待识别目标。The second determination module 323 is used to determine the target to be recognized in the latter frame of two adjacent frames of images based on the probability result information.

一个示例中,第二确定模块323,包括:In an example, the second determination module 323 includes:

第一确定子模块3231,用于若确定概率结果信息为得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标对应的最大似然概率值;其中,最大似然概率值用于指示相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。The first determination sub-module 3231 is used to obtain the likelihood probability value if the probability result information is determined, where the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, respectively with the adjacent The likelihood probability value between the multiple Gaussian distributions in the previous frame image in the two frames of images is used to determine the maximum likelihood probability value corresponding to the target to be identified; where the maximum likelihood probability value is used to indicate two adjacent frames of images. Gaussian distribution corresponding to the target to be recognized in the next frame of image.

第二确定子模块3232,用于根据待识别目标对应的最大似然概率值,确定相邻两帧图像中后一帧图像中的待识别目标。The second determination sub-module 3232 is used to determine the target to be recognized in the latter frame of the two adjacent frames of images based on the maximum likelihood probability value corresponding to the target to be recognized.

一个示例中,待识别目标对应的参数信息包括人脸宽度值;该装置还包括:In one example, the parameter information corresponding to the target to be recognized includes a face width value; the device also includes:

第二确定单元41,用于根据预设的第一约束条件信息,确定相邻两帧图像中后一帧图像中的待识别目标,与待识别目标对应的高斯分布所在位置之间的距离;其中,第一约束条件信息表征待识别目标对应的位置信息。The second determination unit 41 is configured to determine the distance between the target to be recognized in the latter frame of the two adjacent frames of images and the location of the Gaussian distribution corresponding to the target to be recognized based on the preset first constraint information; Among them, the first constraint information represents the position information corresponding to the target to be identified.

更新单元42,用于若确定距离大于待识别目标的人脸宽度值,则将相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,替换为相邻两帧图像中前一帧图像中的待识别目标对应的高斯分布,得到更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。The update unit 42 is configured to, if it is determined that the distance is greater than the face width value of the target to be recognized, replace the Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames with the Gaussian distribution of the target to be recognized in the previous two adjacent frames of images. The Gaussian distribution corresponding to the target to be recognized in one frame of image is obtained, and the Gaussian distribution corresponding to the target to be recognized in the latter frame of two adjacent frames of images is obtained.

第三确定单元43,用于根据更新后的相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The third determination unit 43 is configured to determine the target to be recognized in the latter image of the two adjacent frames according to the updated Gaussian distribution corresponding to the target to be recognized in the latter image of the two adjacent frames.

一个示例中,第二确定模块323,包括:In an example, the second determination module 323 includes:

第三确定子模块3233,用于若确定概率结果信息为未得到似然概率值,其中,似然概率值表征相邻两帧图像中后一帧图像中待识别目标的参数信息,分别与相邻两帧图像中前一帧图像中的多个高斯分布之间的似然概率值,则确定待识别目标为新增目标。The third determination sub-module 3233 is used to determine that the probability result information is that the likelihood probability value has not been obtained, where the likelihood probability value represents the parameter information of the target to be identified in the latter frame of the two adjacent frames of images, respectively with the corresponding If the likelihood probability value between the multiple Gaussian distributions in the previous frame image in the two adjacent frames is determined, the target to be identified is determined to be a new target.

第四确定子模块3234,用于根据新增目标对应的参数信息,确定相邻两帧图像中后一帧图像中的新增目标对应的高斯分布。The fourth determination sub-module 3234 is used to determine the Gaussian distribution corresponding to the new target in the latter frame of the two adjacent frames of images based on the parameter information corresponding to the new target.

第五确定子模块3235,用于根据相邻两帧图像中后一帧图像中的新增目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The fifth determination sub-module 3235 is used to determine the target to be recognized in the latter frame of the two adjacent frames of images based on the Gaussian distribution corresponding to the newly added target in the latter frame of the two adjacent images.

一个示例中,第二确定模块323,包括:In an example, the second determination module 323 includes:

第六确定子模块3236,用于若确定概率结果信息为未得到与高斯分布对应的参数信息,则确定高斯分布对应的待识别目标为消失状态。The sixth determination sub-module 3236 is used to determine that the target to be identified corresponding to the Gaussian distribution is in a disappeared state if the probability result information is determined to be that parameter information corresponding to the Gaussian distribution has not been obtained.

第七确定子模块3237,用于确定与高斯分布所在位置相邻且预设数量的待识别目标、以及待识别目标对应的参数信息;其中,高斯分布包括第一人脸面积,待识别目标对应的参数信息包括第二人脸面积。The seventh determination sub-module 3237 is used to determine a preset number of targets to be identified adjacent to the location of the Gaussian distribution, and parameter information corresponding to the targets to be identified; wherein the Gaussian distribution includes the first face area, and the target to be identified corresponds to The parameter information includes the second face area.

第八确定子模块3238,用于根据预设的第二约束条件信息,在预设数量的第二人脸面积中,确定与第一人脸面积相等的目标人脸面积;其中,第二约束条件信息表征待识别目标的人脸面积的像素占比。The eighth determination sub-module 3238 is used to determine a target face area equal to the first face area among a preset number of second face areas according to the preset second constraint information; wherein, the second constraint The condition information represents the pixel ratio of the face area of the target to be recognized.

第九确定子模块3239,用于根据与目标人脸面积对应的参数信息,确定相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布。The ninth determination sub-module 3239 is used to determine the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent frames of images based on the parameter information corresponding to the target face area.

第十确定子模块32310,用于根据相邻两帧图像中后一帧图像中的待识别目标对应的高斯分布,确定相邻两帧图像中后一帧图像中的待识别目标。The tenth determination sub-module 32310 is used to determine the target to be recognized in the latter frame of the two adjacent frames of images according to the Gaussian distribution corresponding to the target to be recognized in the latter frame of the two adjacent images.

一个示例中,识别单元31,具体用于:In an example, the identification unit 31 is specifically used for:

根据预设的深度目标监测网络,分别对多帧图像进行识别处理,得到每一图像的参数列表;其中,深度目标检测网络用于指示目标识别信息,参数列表中包括图像中的待识别目标的参数信息。According to the preset depth target detection network, multiple frame images are recognized and processed respectively to obtain a parameter list for each image; among them, the depth target detection network is used to indicate the target identification information, and the parameter list includes the parameters of the target to be recognized in the image. Parameter information.

一个示例中,待识别目标对应的参数信息包括人脸信息及人体信息,其中,人脸信息包括人脸中心横坐标、人脸中心纵坐标、人脸宽度值、人脸高度值、以及人脸面积,人体信息包括人体中心横坐标、人体中心纵坐标、人体宽度值、人体高度值、以及人体面积。In one example, the parameter information corresponding to the target to be recognized includes face information and human body information, where the face information includes the abscissa of the face center, the ordinate of the face center, the face width value, the face height value, and the face Area, human body information includes the horizontal coordinate of the human body center, the vertical coordinate of the human body center, the human body width value, the human body height value, and the human body area.

本实施例的装置,可以执行上述方法中的技术方案,其具体实现过程和技术原理相同,此处不再赘述。The device of this embodiment can execute the technical solution in the above method. Its specific implementation process and technical principles are the same and will not be described again here.

图5为本申请实施例提供的一种电子设备的结构示意图,如图5所示,电子设备包括:存储器51,处理器52。FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. As shown in FIG. 5 , the electronic device includes: a memory 51 and a processor 52 .

存储器51中存储有可在处理器52上运行的计算机程序。The memory 51 stores computer programs that can be run on the processor 52 .

处理器52被配置为执行如上述实施例提供的方法。The processor 52 is configured to perform the method as provided in the above embodiments.

电子设备还包括接收器53和发送器54。接收器53用于接收外部设备发送的指令和数据,发送器54用于向外部设备发送指令和数据。The electronic device also includes a receiver 53 and a transmitter 54 . The receiver 53 is used to receive instructions and data sent by the external device, and the transmitter 54 is used to send instructions and data to the external device.

图6是本申请实施例提供的一种电子设备的框图,该电子设备可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。Figure 6 is a block diagram of an electronic device provided by an embodiment of the present application. The electronic device may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, or a personal digital assistant. wait.

装置600可以包括以下一个或多个组件:处理组件602,存储器604,电源组件606,多媒体组件608,音频组件610,输入/输出(I/O)接口612,传感器组件614,以及通信组件616。Device 600 may include one or more of the following components: processing component 602, memory 604, power component 606, multimedia component 608, audio component 610, input/output (I/O) interface 612, sensor component 614, and communications component 616.

处理组件602通常控制装置600的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件602可以包括一个或多个处理器620来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件602可以包括一个或多个模块,便于处理组件602和其他组件之间的交互。例如,处理组件602可以包括多媒体模块,以方便多媒体组件608和处理组件602之间的交互。Processing component 602 generally controls the overall operations of device 600, such as operations associated with display, phone calls, data communications, camera operations, and recording operations. The processing component 602 may include one or more processors 620 to execute instructions to complete all or part of the steps of the above method. Additionally, processing component 602 may include one or more modules that facilitate interaction between processing component 602 and other components. For example, processing component 602 may include a multimedia module to facilitate interaction between multimedia component 608 and processing component 602.

存储器604被配置为存储各种类型的数据以支持在装置600的操作。这些数据的示例包括用于在装置600上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器604可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。Memory 604 is configured to store various types of data to support operations at device 600 . Examples of such data include instructions for any application or method operating on device 600, contact data, phonebook data, messages, pictures, videos, etc. Memory 604 may be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EEPROM), Programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disk.

电源组件606为装置600的各种组件提供电力。电源组件606可以包括电源管理系统,一个或多个电源,及其他与为装置600生成、管理和分配电力相关联的组件。Power supply component 606 provides power to the various components of device 600. Power supply components 606 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to device 600 .

多媒体组件608包括在装置600和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件608包括一个前置摄像头和/或后置摄像头。当装置600处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。Multimedia component 608 includes a screen that provides an output interface between device 600 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user. The touch panel includes one or more touch sensors to sense touches, swipes, and gestures on the touch panel. A touch sensor can not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action. In some embodiments, multimedia component 608 includes a front-facing camera and/or a rear-facing camera. When the device 600 is in an operating mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each front-facing camera and rear-facing camera can be a fixed optical lens system or have a focal length and optical zoom capabilities.

音频组件610被配置为输出和/或输入音频信号。例如,音频组件610包括一个麦克风(MIC),当装置600处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器604或经由通信组件616发送。在一些实施例中,音频组件610还包括一个扬声器,用于输出音频信号。Audio component 610 is configured to output and/or input audio signals. For example, audio component 610 includes a microphone (MIC) configured to receive external audio signals when device 600 is in operating modes, such as call mode, recording mode, and speech recognition mode. The received audio signals may be further stored in memory 604 or sent via communications component 616 . In some embodiments, audio component 610 also includes a speaker for outputting audio signals.

I/O接口612为处理组件602和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。The I/O interface 612 provides an interface between the processing component 602 and a peripheral interface module, which may be a keyboard, a click wheel, a button, etc. These buttons may include, but are not limited to: Home button, Volume buttons, Start button, and Lock button.

传感器组件614包括一个或多个传感器,用于为装置600提供各个方面的状态评估。例如,传感器组件614可以检测到装置600的打开/关闭状态,组件的相对定位,例如组件为装置600的显示器和小键盘,传感器组件614还可以检测装置600或装置600一个组件的位置改变,用户与装置600接触的存在或不存在,装置600方位或加速/减速和装置600的温度变化。传感器组件614可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件614还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件614还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。Sensor component 614 includes one or more sensors for providing various aspects of status assessment for device 600 . For example, the sensor component 614 can detect the open/closed state of the device 600, the relative positioning of components, such as the display and keypad of the device 600, the sensor component 614 can also detect the position change of the device 600 or a component of the device 600, the user The presence or absence of contact with device 600, device 600 orientation or acceleration/deceleration and temperature changes of device 600. Sensor assembly 614 may include a proximity sensor configured to detect the presence of nearby objects without any physical contact. Sensor assembly 614 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor component 614 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

通信组件616被配置为便于装置600和其他设备之间有线或无线方式的通信。装置600可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件616经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,通信组件616还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。Communication component 616 is configured to facilitate wired or wireless communication between apparatus 600 and other devices. Device 600 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In one exemplary embodiment, the communication component 616 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In one exemplary embodiment, communications component 616 also includes a near field communications (NFC) module to facilitate short-range communications. For example, the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

在示例性实施例中,装置600可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。In an exemplary embodiment, apparatus 600 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable Gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are implemented for executing the above method.

在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器604,上述指令可由装置600的处理器620执行以完成上述方法。例如,非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, a non-transitory computer-readable storage medium including instructions, such as a memory 604 including instructions, which are executable by the processor 620 of the device 600 to complete the above method is also provided. For example, non-transitory computer-readable storage media may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, etc.

本申请实施例还提供了一种非临时性计算机可读存储介质,当该存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行上述实施例提供的方法。Embodiments of the present application also provide a non-transitory computer-readable storage medium. When instructions in the storage medium are executed by a processor of an electronic device, the electronic device can execute the method provided by the above embodiments.

本申请实施例还提供了一种计算机程序产品,计算机程序产品包括:计算机程序,计算机程序存储在可读存储介质中,电子设备的至少一个处理器可以从可读存储介质读取计算机程序,至少一个处理器执行计算机程序使得电子设备执行上述任一实施例提供的方案。Embodiments of the present application also provide a computer program product. The computer program product includes: a computer program. The computer program is stored in a readable storage medium. At least one processor of the electronic device can read the computer program from the readable storage medium. At least A processor executes a computer program so that the electronic device executes the solution provided by any of the above embodiments.

本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求书指出。Other embodiments of the disclosure will be readily apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure that follow the general principles of the disclosure and include common knowledge or customary technical means in the technical field that are not disclosed in the disclosure. . It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求书来限制。It is to be understood that the present disclosure is not limited to the precise structures described above and illustrated in the accompanying drawings, and various modifications and changes may be made without departing from the scope thereof. The scope of the disclosure is limited only by the appended claims.

Claims (19)

1. An image processing method, comprising:
performing identification processing on multi-frame images to obtain a parameter list of each image; the parameter list comprises parameter information of an object to be identified in the image;
for two adjacent frame images in the multi-frame images, determining a target to be identified in a later frame image in the two adjacent frame images according to a parameter list of a former frame image in the two adjacent frame images and a parameter list of a later frame image in the two adjacent frame images; the target to be identified in the next frame of images in the two adjacent frames is the target to be identified corresponding to the target to be identified in the previous frame of images in the two adjacent frames;
generating an identification code corresponding to a target to be identified in the last frame of image in the multi-frame image, and generating target information corresponding to the identification code; the identification code characterizes the identity of the object to be identified, and the object information characterizes the times of preset actions sent by the object to be identified.
2. The method according to claim 1, wherein determining, for two adjacent frames of the multi-frame images, the object to be identified in a next frame of the two adjacent frames of images based on the parameter list of a previous frame of the two adjacent frames of images and the parameter list of a next frame of the two adjacent frames of images, comprises:
according to a parameter list of a first frame image in the multi-frame image, determining Gaussian distribution corresponding to a target to be identified in the first frame image; wherein the gaussian distribution characterizes position information of the object to be identified;
aiming at two adjacent frame images in the multi-frame images, carrying out probability calculation processing on a parameter list of a next frame image in the two adjacent frame images and Gaussian distribution corresponding to a target to be identified in a previous frame image in the two adjacent frame images to obtain probability result information corresponding to the target to be identified in the next frame image in the two adjacent frame images; the probability result information is used for indicating Gaussian distribution corresponding to a target to be identified in a later frame of images in two adjacent frames of images;
and determining the target to be identified in the next frame of images in the two adjacent frames of images according to the probability result information.
3. The method according to claim 2, wherein determining the target to be identified in the next image of the two adjacent images according to the probability result information comprises:
if the probability result information is determined to be a likelihood probability value, wherein the likelihood probability value represents parameter information of an object to be identified in a next frame image in two adjacent frame images and likelihood probability values between the parameter information and a plurality of Gaussian distributions in a previous frame image in the two adjacent frame images respectively, determining a maximum likelihood probability value corresponding to the object to be identified; the maximum likelihood probability value is used for indicating Gaussian distribution corresponding to a target to be identified in a later frame image in two adjacent frame images;
and determining the target to be identified in the next frame of images in the two adjacent frames according to the maximum likelihood probability value corresponding to the target to be identified.
4. A method according to claim 3, wherein the parameter information corresponding to the object to be identified comprises a face width value; the method further comprises the steps of:
according to preset first constraint condition information, determining a target to be identified in a later frame of images in two adjacent frames of images, and determining the distance between positions of Gaussian distribution corresponding to the target to be identified; the first constraint condition information characterizes position information corresponding to a target to be identified;
If the distance is determined to be larger than the face width value of the target to be identified, the Gaussian distribution corresponding to the target to be identified in the next frame of images in the two adjacent frames is replaced by the Gaussian distribution corresponding to the target to be identified in the previous frame of images in the two adjacent frames, and the Gaussian distribution corresponding to the target to be identified in the next frame of images in the two adjacent frames after updating is obtained;
and determining the target to be identified in the next frame of images in the two adjacent frames according to the Gaussian distribution corresponding to the target to be identified in the next frame of images in the two adjacent frames after updating.
5. The method according to claim 2, wherein determining the target to be identified in the next image of the two adjacent images according to the probability result information comprises:
if the probability result information is determined to be the likelihood probability value which is not obtained, wherein the likelihood probability value represents parameter information of the target to be identified in the next frame image in the two adjacent frame images and likelihood probability values between the parameter information and a plurality of Gaussian distributions in the previous frame image in the two adjacent frame images respectively, the target to be identified is determined to be a newly added target;
according to the parameter information corresponding to the newly-added target, determining Gaussian distribution corresponding to the newly-added target in the next frame of image in the two adjacent frames of images;
And determining the target to be identified in the next frame image in the two adjacent frame images according to the Gaussian distribution corresponding to the new target in the next frame image in the two adjacent frame images.
6. The method according to claim 2, wherein determining the target to be identified in the next image of the two adjacent images according to the probability result information comprises:
if the probability result information is determined to be that the parameter information corresponding to the Gaussian distribution is not obtained, determining that the target to be identified corresponding to the Gaussian distribution is in a vanishing state;
determining a preset number of targets to be identified adjacent to positions of Gaussian distribution and parameter information corresponding to the targets to be identified; the Gaussian distribution comprises a first face area, and the parameter information corresponding to the target to be identified comprises a second face area;
determining a target face area equal to the first face area in a preset number of second face areas according to preset second constraint condition information; wherein the second constraint condition information characterizes the pixel duty ratio of the face area of the object to be identified;
according to the parameter information corresponding to the target face area, determining Gaussian distribution corresponding to a target to be identified in a later frame of images in two adjacent frames of images;
And determining the target to be identified in the next frame of images in the two adjacent frames according to the Gaussian distribution corresponding to the target to be identified in the next frame of images in the two adjacent frames.
7. The method of claim 1, wherein the identifying of the plurality of images to obtain the parameter list for each of the images comprises:
respectively carrying out identification processing on multiple frames of images according to a preset depth target monitoring network to obtain a parameter list of each image; the depth target detection network is used for indicating target identification information, and the parameter list comprises parameter information of targets to be identified in the image.
8. The method according to any one of claims 1-7, wherein the parameter information corresponding to the object to be identified includes face information and human body information, wherein the face information includes a face center abscissa, a face center ordinate, a face width value, a face height value, and a face area, and the human body information includes a human body center abscissa, a human body center ordinate, a human body width value, a human body height value, and a human body area.
9. An image processing apparatus, comprising:
The identification unit is used for carrying out identification processing on the multi-frame images to obtain a parameter list of each image; the parameter list comprises parameter information of an object to be identified in the image;
the first determining unit is used for determining a target to be identified in a next frame image in the two adjacent frame images according to a parameter list of a previous frame image in the two adjacent frame images and a parameter list of a next frame image in the two adjacent frame images aiming at the two adjacent frame images; the target to be identified in the next frame of images in the two adjacent frames is the target to be identified corresponding to the target to be identified in the previous frame of images in the two adjacent frames;
the first generation unit is used for generating an identification code corresponding to the target to be identified in the last frame of image in the multi-frame images;
a second generation unit, configured to generate target information corresponding to the identification code; the identification code characterizes the identity of the object to be identified, and the object information characterizes the times of preset actions sent by the object to be identified.
10. The apparatus according to claim 9, wherein the first determining unit includes:
The first determining module is used for determining Gaussian distribution corresponding to a target to be identified in a first frame image according to a parameter list of the first frame image in the multi-frame image; wherein the gaussian distribution characterizes position information of the object to be identified;
the computing module is used for carrying out probability computing processing on a parameter list of a next frame image in the two adjacent frame images and Gaussian distribution corresponding to a target to be identified in a previous frame image in the two adjacent frame images aiming at the two adjacent frame images in the multi-frame image to obtain probability result information corresponding to the target to be identified in the next frame image in the two adjacent frame images; the probability result information is used for indicating Gaussian distribution corresponding to a target to be identified in a later frame of images in two adjacent frames of images;
and the second determining module is used for determining the target to be identified in the next frame of image in the two adjacent frames of images according to the probability result information.
11. The apparatus of claim 10, wherein the second determining module comprises:
the first determining submodule is used for determining a maximum likelihood probability value corresponding to the object to be identified if the probability result information is determined to be the likelihood probability value, wherein the likelihood probability value represents parameter information of the object to be identified in a later frame image in two adjacent frame images and likelihood probability values between the parameter information and a plurality of Gaussian distributions in the previous frame image in the two adjacent frame images respectively; the maximum likelihood probability value is used for indicating Gaussian distribution corresponding to a target to be identified in a later frame image in two adjacent frame images;
And the second determining submodule is used for determining the target to be identified in the next frame of images in the two adjacent frames of images according to the maximum likelihood probability value corresponding to the target to be identified.
12. The apparatus according to claim 11, wherein the parameter information corresponding to the object to be identified includes a face width value; the apparatus further comprises:
the second determining unit is used for determining a target to be identified in a next frame of images in two adjacent frames according to preset first constraint condition information, and the distance between positions of Gaussian distribution corresponding to the target to be identified; the first constraint condition information characterizes position information corresponding to a target to be identified;
the updating unit is used for replacing the Gaussian distribution corresponding to the target to be identified in the next frame of images in the next two frames of images with the Gaussian distribution corresponding to the target to be identified in the previous frame of images in the next two frames of images if the distance is determined to be larger than the face width value of the target to be identified, so that the Gaussian distribution corresponding to the target to be identified in the next frame of images in the next two frames of images after updating is obtained;
and the third determining unit is used for determining the target to be identified in the next frame image in the two adjacent frame images according to the Gaussian distribution corresponding to the target to be identified in the next frame image in the two adjacent frame images after updating.
13. The apparatus of claim 10, wherein the second determining module comprises:
a third determining sub-module, configured to determine that the object to be identified is an added object if the probability result information is determined that the likelihood probability value is not obtained, where the likelihood probability value characterizes parameter information of the object to be identified in a next frame of image in two adjacent frames of images, and likelihood probability values between the parameter information and a plurality of gaussian distributions in a previous frame of image in two adjacent frames of images respectively;
a fourth determining submodule, configured to determine gaussian distribution corresponding to a new added target in a next frame of image in two adjacent frames of images according to parameter information corresponding to the new added target;
and a fifth determining submodule, configured to determine a target to be identified in a next frame image in two adjacent frame images according to gaussian distribution corresponding to a new target in the next frame image in the two adjacent frame images.
14. The apparatus of claim 10, wherein the second determining module comprises:
a sixth determining submodule, configured to determine that a target to be identified corresponding to the gaussian distribution is in a vanishing state if it is determined that the probability result information is parameter information corresponding to the gaussian distribution;
A seventh determining submodule, configured to determine a preset number of targets to be identified adjacent to positions where gaussian distributions are located, and parameter information corresponding to the targets to be identified; the Gaussian distribution comprises a first face area, and the parameter information corresponding to the target to be identified comprises a second face area;
an eighth determining submodule, configured to determine a target face area equal to the first face area among a preset number of second face areas according to preset second constraint condition information; wherein the second constraint condition information characterizes the pixel duty ratio of the face area of the object to be identified;
a ninth determining submodule, configured to determine gaussian distribution corresponding to a target to be identified in a later frame of images in two adjacent frames of images according to parameter information corresponding to the target face area;
and the tenth determination submodule is used for determining the target to be identified in the next frame image in the two adjacent frame images according to the Gaussian distribution corresponding to the target to be identified in the next frame image in the two adjacent frame images.
15. The device according to claim 9, characterized in that said identification unit is in particular adapted to:
respectively carrying out identification processing on multiple frames of images according to a preset depth target monitoring network to obtain a parameter list of each image; the depth target detection network is used for indicating target identification information, and the parameter list comprises parameter information of targets to be identified in the image.
16. The apparatus according to any one of claims 9-15, wherein the parameter information corresponding to the object to be identified includes face information and human body information, wherein the face information includes a face center abscissa, a face center ordinate, a face width value, a face height value, and a face area, and the human body information includes a human body center abscissa, a human body center ordinate, a human body width value, a human body height value, and a human body area.
17. An electronic device comprising a memory, a processor, the memory having stored therein a computer program executable on the processor, the processor implementing the method of any of the preceding claims 1-8 when the computer program is executed.
18. A computer readable storage medium having stored therein computer executable instructions which when executed by a processor are adapted to carry out the method of any one of claims 1-8.
19. A computer program product comprising a computer program which, when executed by a processor, implements the method of any of claims 1-8.
CN202210464887.4A 2022-04-29 2022-04-29 Image processing method, device and equipment Pending CN117036418A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202210464887.4A CN117036418A (en) 2022-04-29 2022-04-29 Image processing method, device and equipment
PCT/CN2023/091396 WO2023208154A1 (en) 2022-04-29 2023-04-27 Image processing method, apparatus and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210464887.4A CN117036418A (en) 2022-04-29 2022-04-29 Image processing method, device and equipment

Publications (1)

Publication Number Publication Date
CN117036418A true CN117036418A (en) 2023-11-10

Family

ID=88517868

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210464887.4A Pending CN117036418A (en) 2022-04-29 2022-04-29 Image processing method, device and equipment

Country Status (2)

Country Link
CN (1) CN117036418A (en)
WO (1) WO2023208154A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119380262B (en) * 2024-09-05 2025-10-14 北京城建智控科技股份有限公司 Number determination method, device, equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2013823A1 (en) * 2006-03-28 2009-01-14 The University Court Of The University Of Edinburgh A method for automatically characterizing the behavior of one or more objects
CN101369346B (en) * 2007-08-13 2010-09-15 北京航空航天大学 A Tracking Method of Adaptive Window for Video Moving Target
CN109829435B (en) * 2019-01-31 2023-04-25 深圳市商汤科技有限公司 Video image processing method, device and computer readable medium
CN111444817B (en) * 2020-03-24 2023-07-07 咪咕文化科技有限公司 Character image recognition method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
WO2023208154A1 (en) 2023-11-02

Similar Documents

Publication Publication Date Title
RU2577188C1 (en) Method, apparatus and device for image segmentation
EP3125158B1 (en) Method and device for displaying images
CN105528606B (en) Area recognizing method and device
CN106651955B (en) Method and device for positioning target object in picture
CN109522910B (en) Key point detection method and device, electronic device and storage medium
CN105095881B (en) Face recognition method, face recognition device and terminal
EP3333844B1 (en) Display operation method and apparatus, computer program and recording medium
CN111105454A (en) A method, device and medium for obtaining positioning information
CN107944367B (en) Face key point detection method and device
CN107784279B (en) Target tracking method and device
CN106355573A (en) Target object positioning method and device in pictures
CN107169493A (en) information identifying method and device
CN105631803A (en) Method and device for filter processing
CN113920083A (en) Image-based dimension measurement method, device, electronic device, and storage medium
CN105975961B (en) The method, apparatus and terminal of recognition of face
CN107958223A (en) Face identification method and device, mobile equipment, computer-readable recording medium
US9665925B2 (en) Method and terminal device for retargeting images
CN113627277A (en) Method and device for identifying parking space
CN109934168B (en) Face image mapping method and device
WO2023208154A1 (en) Image processing method, apparatus and device
CN108491177A (en) Space appraisal procedure and device
WO2020233201A1 (en) Icon position determination method and device
CN107992894B (en) Image recognition method, device and computer-readable storage medium
CN105635573A (en) Pick-up head visual angle adjusting method and apparatus
CN107239758B (en) Method and device for positioning key points of human face

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination