[go: up one dir, main page]

CN118736683A - A method and system for monitoring operator violations based on a control ball - Google Patents

A method and system for monitoring operator violations based on a control ball Download PDF

Info

Publication number
CN118736683A
CN118736683A CN202411232701.8A CN202411232701A CN118736683A CN 118736683 A CN118736683 A CN 118736683A CN 202411232701 A CN202411232701 A CN 202411232701A CN 118736683 A CN118736683 A CN 118736683A
Authority
CN
China
Prior art keywords
frame image
target frame
key points
key
calculating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202411232701.8A
Other languages
Chinese (zh)
Other versions
CN118736683B (en
Inventor
杨庭
王磊
严俊
陈闻
翁伟兵
范雄
李万鹏
林聪�
王宇飞
徐遥
李彭昊
曾卓
高俊
王博
易阳
张丽君
王红岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei Central China Technology Development Of Electric Power Co ltd
Original Assignee
Hubei Central China Technology Development Of Electric Power Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei Central China Technology Development Of Electric Power Co ltd filed Critical Hubei Central China Technology Development Of Electric Power Co ltd
Priority to CN202411232701.8A priority Critical patent/CN118736683B/en
Publication of CN118736683A publication Critical patent/CN118736683A/en
Application granted granted Critical
Publication of CN118736683B publication Critical patent/CN118736683B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/042Knowledge-based neural networks; Logical representations of neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Medical Informatics (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Social Psychology (AREA)
  • Psychiatry (AREA)
  • Image Analysis (AREA)

Abstract

本发明涉及图像处理领域,本发明涉及一种基于布控球的作业人员违章监测方法及系统,方法包括:对视频分帧;计算目标帧图像的重要程度,重要程度与目标帧图像中人体骨架的关键点的动作显著性正相关;计算第二序列中目标帧图像与其他帧图像的重要程度的差值之和并归一化处理,当归一化后的差值之和小于等于设置的阈值时,将该目标帧图像划分为关键帧;遍历得到所有关键帧;将所有关键帧中的关键点输入到训练好的时空图卷积网络中,输出关键帧中人体动作的分类结果。本发明通过分帧处理并计算每一帧的重要程度,进而筛选出关键帧,并整合深度学习技术监测违章行为,从而减少了需要审核的帧数量,提高了监测的效率和准确性。

The present invention relates to the field of image processing, and the present invention relates to a method and system for monitoring violations of operating personnel based on a control ball, the method comprising: framing a video; calculating the importance of a target frame image, the importance being positively correlated with the action significance of key points of a human skeleton in the target frame image; calculating the sum of the differences in importance between a target frame image and other frame images in a second sequence and normalizing them, and when the sum of the normalized differences is less than or equal to a set threshold, dividing the target frame image into key frames; traversing to obtain all key frames; inputting the key points in all key frames into a trained spatiotemporal graph convolutional network, and outputting the classification results of human actions in the key frames. The present invention processes by framing and calculating the importance of each frame, thereby screening out key frames, and integrating deep learning technology to monitor violations, thereby reducing the number of frames that need to be reviewed and improving the efficiency and accuracy of monitoring.

Description

一种基于布控球的作业人员违章监测方法及系统A method and system for monitoring operator violations based on a control ball

技术领域Technical Field

本发明涉及图像处理领域。本发明涉及一种基于布控球的作业人员违章监测方法及系统。The present invention relates to the field of image processing and a method and system for monitoring violations of operating personnel based on a control ball.

背景技术Background Art

随着施工行业的飞速发展,对于作业人员的安全防护成为了重中之重。但是由于大多数作业人员对于施工过程中的安全防护意识淡薄,而且施工过程中的安全隐患往往不易被人所察觉,如果作业人员没有遵守施工场地的安全条例,如:相互斗殴,非法攀爬,破坏公物等,引发的后果难以预料。With the rapid development of the construction industry, the safety protection of workers has become a top priority. However, since most workers have a weak awareness of safety protection during construction, and the potential safety hazards during construction are often not easily noticed, if workers fail to comply with the safety regulations of the construction site, such as fighting each other, illegal climbing, and damaging public property, the consequences are unpredictable.

布控球作为一种高集成度的监控产品,在施工行业中起到了至关重要的作用。布控球系统可以集成智能分析算法,自动识别作业人员违规行为,如未佩戴安全帽、非法攀爬、斗殴、擅自操作设备等,并立即触发报警,以便管理人员及时干预。As a highly integrated monitoring product, the control ball plays a vital role in the construction industry. The control ball system can integrate intelligent analysis algorithms to automatically identify violations by operators, such as not wearing a safety helmet, illegal climbing, fighting, unauthorized operation of equipment, etc., and immediately trigger an alarm so that management personnel can intervene in time.

公开号为CN117253329A的专利申请文件公开了一种基于发电厂生产区域人员违章监测预警方法及系统。该方法包括:安装监控摄像头对人员活动进行数据采样;利用视频分析技术对人体姿态以及动作进行实时分析;所述利用视频分析技术对人体姿态以及动作进行实时分析是通过循环神经网络实现,所述循环神经网络是通过时间循环和隐藏状态传递来建设输入序列的动作和行为信息,并捕捉到时间上的依赖关系,从而实现对动作的识别和分析;根据违章规则对人员进行违章判定;将违章结果进行告警通知。The patent application document with publication number CN117253329A discloses a method and system for monitoring and warning of personnel violations in the production area of a power plant. The method includes: installing surveillance cameras to sample data on personnel activities; using video analysis technology to perform real-time analysis of human posture and movements; the real-time analysis of human posture and movements using video analysis technology is achieved through a recurrent neural network, which constructs the action and behavior information of the input sequence through time loops and hidden state transfers, and captures the temporal dependencies, thereby realizing the recognition and analysis of actions; judging personnel violations according to violation rules; and notifying the violation results.

然而上述利用视频分析技术时,需要大量的计算资源,特别是在高分辨率和高帧率的视频处理中,同时在模型训练之前需要获取和标注大量的数据,不仅耗时耗力,还需要强大的计算设备支持。However, the use of the above-mentioned video analysis technology requires a large amount of computing resources, especially in high-resolution and high-frame-rate video processing. At the same time, a large amount of data needs to be acquired and labeled before model training, which is not only time-consuming and labor-intensive, but also requires powerful computing equipment support.

发明内容Summary of the invention

为解决上述需要大量的计算资源,且准备数据耗时耗力的技术问题,本发明在如下的多个方面中提供方案。In order to solve the above technical problems that a large amount of computing resources are required and data preparation is time-consuming and labor-intensive, the present invention provides solutions in the following aspects.

在第一方面中,一种基于布控球的作业人员违章监测方法,包括:In a first aspect, a method for monitoring operator violations based on a control ball comprises:

对布控球拍摄的视频进行分帧和预处理,得到每一帧图像;The video shot by the control ball is divided into frames and pre-processed to obtain each frame image;

计算目标帧图像的重要程度,所述目标帧图像为任意一帧图像;所述重要程度与所述目标帧图像中人体骨架的关键点的动作显著性正相关;Calculating the importance of a target frame image, the target frame image being any frame image; the importance being positively correlated with the motion significance of key points of a human skeleton in the target frame image;

以目标帧图像为中心,截取临近的若干帧图像构建第二序列,计算、第二序列中目标帧图像与其他帧图像的重要程度的差值之和并归一化处理,当归一化后的差值之和小于等于设置的阈值时,将该目标帧图像划分为关键帧;遍历得到所有关键帧;Taking the target frame image as the center, intercepting several adjacent frame images to construct a second sequence, calculating and normalizing the sum of the importance differences between the target frame image and other frame images in the second sequence, and when the sum of the normalized differences is less than or equal to the set threshold, the target frame image is divided into a key frame; traversing to obtain all key frames;

将所有关键帧中的关键点输入到训练好的时空图卷积网络中,输出关键帧中人体动作的分类结果。The key points in all key frames are input into the trained spatiotemporal graph convolutional network, and the classification results of human actions in the key frames are output.

有益效果:本发明首先对视频进行分帧和预处理,再通过计算每帧图像中人体骨架的关键点的动作显著性来评估对应帧的图像的重要程度,进而有效地从大量连续视频数据中筛选出具有潜在违章行为的关键帧,减少漏检和误报的情况,并整合深度学习技术监测违章行为,从而减少了需要审核的帧数量,提高了监测的效率和准确性。Beneficial effects: The present invention first divides the video into frames and pre-processes it, and then evaluates the importance of the image of the corresponding frame by calculating the motion significance of the key points of the human skeleton in each frame, thereby effectively screening out key frames with potential violations from a large amount of continuous video data, reducing missed detections and false alarms, and integrating deep learning technology to monitor violations, thereby reducing the number of frames that need to be reviewed and improving monitoring efficiency and accuracy.

优选地,所述重要程度满足关系式:Preferably, the importance satisfies the relationship:

;式中,为目标帧图像的重要程度,为目标帧图像中第个关键点的动作显著性,为第个关键点对应的权重,为目标帧图像中关键点的数量,表示归一化处理。 ; In the formula, is the importance of the target frame image, is the first The action significance of the key points, For the The weight corresponding to the key point is is the number of key points in the target frame image, Indicates normalization processing.

有益效果:考虑到每个关键点的动作显著性,确保了图像中每个关键点的动作对总体的重要程度的贡献都能被充分体现,有助于捕捉到微小但关键的动作变化,从而提高违章行为的识别率。Beneficial effects: Taking into account the motion significance of each key point, it ensures that the contribution of the motion of each key point in the image to the overall importance can be fully reflected, which helps to capture small but critical motion changes, thereby improving the recognition rate of violations.

优选地,所述重要程度为所有关键点的动作显著性与其对应的权重乘积的最大值。Preferably, the importance is the maximum value of the product of the action significance of all key points and their corresponding weights.

有益效果:为不同的关键点设置权重,选取最大加权动作显著性作为重要程度的衡量标准,由于重点关注最显著的动作,即使是短暂的违规动作也难以逃过系统的监测,从而减少了漏检的可能性。Beneficial effect: Weights are set for different key points, and the maximum weighted action significance is selected as the measure of importance. Since the focus is on the most significant actions, even short-term illegal actions are difficult to escape the monitoring of the system, thus reducing the possibility of missed detection.

优选地,所述动作显著性的获取过程包括:Preferably, the process of acquiring the action saliency includes:

计算第个关键点的轨迹变化率和相对变化率,将轨迹变化率和相对变化率加权求和,得到所述动作显著性;Calculate the The trajectory change rate and relative change rate of the key points are weighted and summed to obtain the action significance;

其中,以目标帧图像为中心,截取一定帧的图像构建第一序列,计算第一序列中第个关键点在目标帧图像中分别与第个关键点在其他帧图像中的速率之和的平均值,将所述平均值作为所述轨迹变化率;Among them, taking the target frame image as the center, intercepting a certain frame of images to construct the first sequence, and calculating the The key points in the target frame image are respectively The average value of the sum of the rates of the key points in other frame images is used as the trajectory change rate;

其中,计算第个关键点与目标帧图像中其他所有关键点的第一平均距离,计算第个关键点在第一序列中其他帧图像中的第二平均距离,计算第一平均距离与所有第二平均距离的方差,将所述方差作为相对变化率。Among them, calculate the The first average distance between the key point and all other key points in the target frame image is calculated. The second average distances of the key points in other frame images in the first sequence are calculated, and the variance of the first average distance and all the second average distances is calculated, and the variance is used as the relative change rate.

有益效果:计算速率之和的平均值和平均距离的方差有助于捕捉即使是微小的动作变化,增强对异常行为的敏感性。Beneficial Effects: Calculating the mean of the sum of velocities and the variance of the mean distance helps capture even small changes in motion and increases sensitivity to abnormal behavior.

优选地,所述动作显著性的获取过程包括:Preferably, the process of acquiring the action saliency includes:

使用光流法计算关键点在连续帧之间的运动向量,所述运动向量包括向量幅度;Calculate the motion vector of the key point between consecutive frames using an optical flow method, wherein the motion vector includes a vector magnitude;

计算运动向量幅度的方差,将所述方差作为所述动作显著性。The variance of the motion vector magnitude is calculated, and the variance is used as the motion saliency.

优选地,使用光流法计算关键点在连续帧之间的运动向量,所述运动向量包括向量幅度;计算运动向量幅度的平均值,将所述运动向量幅度的平均值作为所述动作显著性。Preferably, an optical flow method is used to calculate a motion vector of a key point between consecutive frames, wherein the motion vector includes a vector magnitude; an average value of the motion vector magnitudes is calculated, and the average value of the motion vector magnitudes is used as the action saliency.

有益效果:光流法能够准确捕捉关键点在连续帧之间的动态变化,无论是计算方差还是平均值,都可以轻松适应不同的监测环境和需求。Beneficial effects: The optical flow method can accurately capture the dynamic changes of key points between consecutive frames. Whether calculating the variance or the average value, it can easily adapt to different monitoring environments and needs.

优选地,所述时空图卷积网络在训练过程中使用交叉熵损失函数。Preferably, the spatiotemporal graph convolutional network uses a cross entropy loss function during training.

第二方面,一种基于布控球的作业人员违章监测系统,包括:处理器和存储器,所述存储器存储有计算机程序指令,当所述计算机程序指令被所述处理器执行时实现任一项所述的基于布控球的作业人员违章监测方法。In a second aspect, a system for monitoring violations of operating personnel based on a control ball is provided, comprising: a processor and a memory, wherein the memory stores computer program instructions, and when the computer program instructions are executed by the processor, any one of the methods for monitoring violations of operating personnel based on a control ball is implemented.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

通过参考附图阅读下文的详细描述,本发明示例性实施方式的上述以及其他目的、特征和优点将变得易于理解。在附图中,以示例性而非限制性的方式示出了本发明的若干实施方式,并且相同或对应的标号表示相同或对应的部分,其中:By reading the following detailed description with reference to the accompanying drawings, the above and other objects, features and advantages of the exemplary embodiments of the present invention will become readily understood. In the accompanying drawings, several embodiments of the present invention are shown in an exemplary and non-limiting manner, and the same or corresponding reference numerals represent the same or corresponding parts, wherein:

图1是本发明实施例一种基于布控球的作业人员违章监测方法中步骤S1-步骤S4的方法流程图。FIG. 1 is a method flow chart of steps S1 to S4 in a method for monitoring operator violations based on a control ball according to an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative work are within the scope of protection of the present invention.

下面结合附图来详细描述本发明的具体实施方式。The specific embodiments of the present invention are described in detail below with reference to the accompanying drawings.

参照图1,一种基于布控球的作业人员违章监测方法包括步骤S1-步骤S4,具体如下:1 , a method for monitoring operator violations based on a control ball includes steps S1 to S4, which are as follows:

S1:对布控球拍摄的视频进行分帧和预处理,得到每一帧图像。S1: Frame and pre-process the video shot by the control ball to obtain each frame image.

布控球可以实时监控作业人员的行为和姿态,及时发现潜在的违章行为或不安全动作。The control ball can monitor the behavior and posture of workers in real time, and promptly detect potential violations or unsafe actions.

在本发明实施例中,根据实际情况安装布控球在作业环境中,且能够拍摄到作业人员的姿态视频,通过将视频分解为单独的帧,可以更有针对性地分析关键帧,而不是对整个视频序列进行处理,从而提高数据处理的效率。In an embodiment of the present invention, a control ball is installed in the working environment according to actual conditions, and can capture a video of the posture of the workers. By decomposing the video into separate frames, key frames can be analyzed more specifically instead of processing the entire video sequence, thereby improving data processing efficiency.

由于布控球工作时间较长,且所处的作业环境较为复杂,因此在分帧后的图像中可能含有噪声等影响质量的因素,因此,需要对每一帧图像进行预处理。Since the control ball takes a long time to work and the working environment is relatively complex, the framed image may contain noise and other factors that affect the quality. Therefore, each frame image needs to be preprocessed.

具体地,将布控球采集的视频文件输入到OpenCV(Open Source Computer VisionLibrary)中,并对视频进行分帧,得到每一帧图像;Specifically, the video file collected by the control ball is input into OpenCV (Open Source Computer Vision Library), and the video is divided into frames to obtain each frame image;

使用滤波器如高斯滤波去除每一帧图像中的噪声,使用对比度增强算法提升图像的质量。Use filters such as Gaussian filtering to remove noise from each frame of the image, and use contrast enhancement algorithms to improve the quality of the image.

S2:计算目标帧图像的重要程度,所述目标帧图像为任意一帧图像;所述重要程度与所述目标帧图像中人体骨架的关键点的动作显著性正相关。S2: Calculate the importance of a target frame image, where the target frame image is any frame image; the importance is positively correlated with the motion significance of key points of a human skeleton in the target frame image.

在视频分析中,关键点通常指的是图像中显著的、具有特征的点,这些点可以在图像间进行跟踪,用以分析物体的运动、姿态或其他属性。对于人体而言,关键点一般指人体的关节位置,如头部、肩膀、肘部、手腕、髋部、膝盖和脚踝等。在分析时,不仅要考虑单个关键点的运动,还要考虑到关键点之间的相对运动。In video analysis, key points usually refer to prominent and characteristic points in an image, which can be tracked between images to analyze the motion, posture or other attributes of an object. For the human body, key points generally refer to the joint positions of the human body, such as the head, shoulders, elbows, wrists, hips, knees and ankles. When analyzing, not only the movement of individual key points should be considered, but also the relative movement between key points.

COCO(Common Objects in Context)数据集COCO数据集包含91类目标,涵盖了超过80个类别。这些类别包括人、各种动物、交通工具、家具等,不仅用于目标检测,还涉及图像描述生成、人体姿态估计等任务,具有非常高的多样性和应用价值。具体地,首先使用COCO(Common Objects in Context)数据集中的Pose Estimation模型对每一帧图像进行处理,以检测图像中的人体关键点。COCO (Common Objects in Context) dataset The COCO dataset contains 91 types of targets, covering more than 80 categories. These categories include people, various animals, vehicles, furniture, etc., which are not only used for target detection, but also involve tasks such as image description generation and human pose estimation. It has very high diversity and application value. Specifically, the Pose Estimation model in the COCO (Common Objects in Context) dataset is first used to process each frame of the image to detect the key points of the human body in the image.

在本发明实施例中,以每一帧图像最左下角的像素点作为坐标原点,建立直角坐标系,x轴水平向右,y轴垂直向上。In the embodiment of the present invention, a rectangular coordinate system is established with the pixel point at the lower left corner of each frame of image as the coordinate origin, with the x-axis pointing horizontally to the right and the y-axis pointing vertically upward.

为了捕捉动作变化的细节,选择任意一帧图像作为目标帧图像,以目标帧图像为中心,截取一定帧的图像构建第一序列,示例性的,以目标帧图像为中心,按照时间顺序向前向后均截取一帧图像构建第一序列。In order to capture the details of the action changes, any frame image is selected as the target frame image, and a certain frame of images is captured with the target frame image as the center to construct the first sequence. For example, with the target frame image as the center, one frame of images is captured forward and backward in chronological order to construct the first sequence.

示例性的,设定拍摄的视频的帧率为,则每一帧之间的时间间隔为,且每一帧图像中人体的关键点均完整无遮挡。以第帧目标帧图像为例,则第一序列中其他两帧图像分别标记为第帧图像、第帧图像,获取目标帧图像中所有关键点的坐标,其中,选取第个关键点为例,第个关键点的坐标为,同理,获得第个关键点在另外两帧图像中对应的坐标,分别标记为For example, the frame rate of the captured video is set to , then the time interval between each frame is , and the key points of the human body in each frame are complete and unobstructed. The target frame image is taken as an example, and the other two frames in the first sequence are marked as Frame image, Frame image, obtain the coordinates of all key points in the target frame image, among which, select the first For example, the key point The coordinates of the key points are , similarly, get The corresponding coordinates of the key points in the other two frames are marked as , .

则计算第个关键点的轨迹变化率,满足关系式如下:Then calculate the The trajectory change rate of the key points satisfies the following relationship:

式中,为第帧目标帧图像中第个关键点的轨迹变化率,表示第帧图像中的第个关键点与第帧目标帧图像中的第个关键点之间的距离,表示第帧图像中的第个关键点与第帧目标帧图像中的第个关键点之间的距离。In the formula, For the Frame target frame image The trajectory change rate of key points, Indicates The first The key points and The first frame in the target frame image The distance between key points, Indicates The first The key points and The first frame in the target frame image The distance between key points.

其中,反映了连续三帧图像之间同一关键点的速率之和。in, It reflects the sum of the rates of the same key point between three consecutive frames.

需要说明的是,在计算出第个关键点的轨迹变化率之后,还可以进行归一化处理。It should be noted that in calculating the After calculating the trajectory change rate of each key point, normalization can be performed.

另外,计算在该目标帧图像中第个关键点与其他关键点之间的欧氏距离,并将计算的所有欧氏距离相加求和取平均值作为第一平均距离,同理计算分别得到第个关键点在另外两帧图像中的第二平均距离,计算第一平均距离和两个第二平均距离的方差,将计算得到的方差作为第个关键点的相对变化率。In addition, calculate the first The Euclidean distance between the key point and other key points is calculated, and the sum of all the calculated Euclidean distances is taken as the first average distance. Similarly, the calculations are respectively The second average distance of the key point in the other two frames of images, calculate the variance of the first average distance and the two second average distances, and use the calculated variance as the first The relative rate of change of a key point.

将上述计算得到的轨迹变化率和相对变化率加权求和,从而得到第个关键点的动作显著性,该动作显著性满足关系式如下:The trajectory change rate and relative change rate calculated above are weighted and summed to obtain the first The action significance of the key points satisfies the following relationship:

式中,表示在第帧目标帧图像中第个关键点的动作显著性,表示在第帧目标帧图像中第个关键点的轨迹变化率,表示在第帧目标帧图像中第个关键点的相对变化率,均为权重。In the formula, Indicated in Frame target frame image The action significance of the key points, Indicated in Frame target frame image The trajectory change rate of key points, Indicated in Frame target frame image The relative change rate of the key points, , All are weights.

在另一实施例中,在第帧目标帧图像中第个关键点的动作显著性还满足关系式如下:In another embodiment, in the Frame target frame image The action salience of key points It also satisfies the following relationship:

式中,表示在第帧目标帧图像中第个关键点的动作显著性,表示在第帧目标帧图像中第个关键点的轨迹变化率,表示在第帧目标帧图像中第个关键点的相对变化率,表示归一化处理,为逻辑斯蒂函数,将输入映射到一个介于0和1之间的值,输入值越大,输出值越接近1,表示动作显著性越高。In the formula, Indicated in Frame target frame image The action significance of the key points, Indicated in Frame target frame image The trajectory change rate of key points, Indicated in Frame target frame image The relative change rate of the key points, represents normalization processing, It is a logistic function that maps the input to a value between 0 and 1. The larger the input value, the closer the output value is to 1, indicating that the action significance is higher.

在另一实施例中,可以使用光流法计算第个关键点在连续帧之间的运动向量,包括幅度和方向,计算运动向量幅度的平均值或者方差并作为第个关键点的动作显著性。平均值或者方差越大,意味着运动更显著。In another embodiment, the optical flow method can be used to calculate the The motion vector of the key point between consecutive frames, including the magnitude and direction, calculates the average or variance of the motion vector magnitude and uses it as the first The larger the mean or variance, the more significant the motion.

根据上述计算第个关键点的动作显著性的方式同理可计算出第帧目标帧图像中其他所有关键点的动作显著性。进而可计算出第帧目标帧图像的重要程度,满足关系式如下:According to the above calculation The action significance of the key point can be calculated in the same way. The action significance of all other key points in the target frame image can be calculated. The importance of the target frame image satisfies the following relationship:

;式中,为第帧目标帧图像的重要程度,为第帧目标帧图像中第个关键点的动作显著性,为第个关键点对应的权重,为目标帧图像中关键点的数量,表示归一化处理。其中,动作显著性越大,越高,表明该目标帧图像越有可能包含重要的动作或事件。 ; In the formula, For the The importance of the frame target frame image, For the Frame target frame image The action significance of the key points, For the The weight corresponding to the key point is is the number of key points in the target frame image, represents the normalization process. Among them, the action saliency The bigger, The higher the value, the more likely the target frame image is to contain important actions or events.

需要说明的是,可根据经验设置。It should be noted that Can be set based on experience.

在另一实施例中,还可以将所有关键点的动作显著性与其对应的权重乘积的最大值作为该目标帧图像的重要程度。In another embodiment, the maximum value of the product of the action significance of all key points and their corresponding weights may be used as the importance of the target frame image.

S3:以目标帧图像为中心,截取临近的若干帧图像构建第二序列,计算第二序列中目标帧图像与其他帧图像的重要程度的差值之和并归一化处理,当归一化后的差值之和小于等于设置的阈值时,将该目标帧图像划分为关键帧;遍历得到所有关键帧。S3: Taking the target frame image as the center, intercepting several adjacent frame images to construct a second sequence, calculating the sum of the differences in importance between the target frame image and other frame images in the second sequence and normalizing them, and when the sum of the normalized differences is less than or equal to the set threshold, dividing the target frame image into a key frame; traversing to obtain all key frames.

在本发明实施例中,根据上述步骤S2计算的第帧目标帧图像的重要程度,进而可以计算出所有帧图像的重要程度。In the embodiment of the present invention, the first The importance of the target frame image can be calculated, and then the importance of all frame images can be calculated.

具体地,以第帧目标帧图像为中心,截取临近的若干帧图像构建第二序列,计算第二序列中第帧目标帧图像与其他所有帧图像的重要程度的差值之和并归一化处理。当归一化后的差值之和小于等于设置的阈值时,将第帧目标帧图像划分为关键帧;同理可得到所有关键帧。Specifically, the The target frame image is taken as the center, and several adjacent frame images are intercepted to construct the second sequence. The sum of the differences between the importance of the target frame image and all other frame images is normalized. When the sum of the normalized differences is less than or equal to the set threshold, the first The target frame image is divided into key frames; all key frames can be obtained in the same way.

需要说明的是,此处的阈值可根据经验设置为0.6。It should be noted that the threshold here can be set to 0.6 based on experience.

S4:将所有关键帧中的关键点输入到训练好的时空图卷积网络中,输出关键帧中人体动作的分类结果。S4: Input the key points in all key frames into the trained spatiotemporal graph convolutional network and output the classification results of human actions in the key frames.

在本发明实施例中,将上述步骤S3得到的所有关键帧中的关键点输入到训练好的时空图卷积网络(ST-GCN)中,并输出关键帧中人体动作的分类结果,是一种高效的动作识别方法。一般步骤如下:In the embodiment of the present invention, the key points in all key frames obtained in the above step S3 are input into the trained spatiotemporal graph convolutional network (ST-GCN), and the classification results of human actions in the key frames are output, which is an efficient action recognition method. The general steps are as follows:

使用姿态估计算法(如COCO库中的算法)检测每个关键帧中人体的关键点;Use a pose estimation algorithm (such as the one in the COCO library) to detect the key points of the human body in each keyframe;

将关键点及其相互关系构建成图结构,每个关键点是一个节点,节点之间的关系定义了图的边;The key points and their relationships are constructed into a graph structure, where each key point is a node and the relationship between nodes defines the edges of the graph;

提取每个关键点的特征,如位置坐标、速度、加速度等;Extract the features of each key point, such as position coordinates, velocity, acceleration, etc.;

使用标注的动作数据训练ST-GCN网络,学习关键点的时空特征,其中,标注包括关键点坐标、关键点轨迹、动作类别、时间信息等;Use the annotated action data to train the ST-GCN network and learn the spatiotemporal features of key points, where the annotations include key point coordinates, key point trajectories, action categories, time information, etc.

将提取的关键帧中的关键点数据输入到训练好的ST-GCN网络中,网络通过图卷积层学习关键点之间的空间关系,通过时间卷积层学习关键点随时间的变化;The key point data in the extracted key frame is input into the trained ST-GCN network. The network learns the spatial relationship between key points through the graph convolution layer and the change of key points over time through the temporal convolution layer.

网络输出每个关键帧的动作分类结果。The network outputs the action classification result for each keyframe.

训练时空图卷积网络为现有技术,此处不再过多赘述。需要说明的是,在训练时空图卷积网络时,首选使用交叉熵损失函数。Training a spatiotemporal graph convolutional network is an existing technology and will not be described in detail here. It should be noted that when training a spatiotemporal graph convolutional network, the cross entropy loss function is preferably used.

本发明首先对视频进行分帧和预处理,再通过计算每帧图像中人体骨架的关键点的动作显著性来评估对应帧的图像的重要程度,进而有效地从大量连续视频数据中筛选出具有潜在违章行为的关键帧,减少漏检和误报的情况,并整合深度学习技术监测违章行为,从而减少了需要审核的帧数量,提高了监测的效率和准确性。The present invention first divides the video into frames and pre-processes it, and then evaluates the importance of the image of the corresponding frame by calculating the motion significance of the key points of the human skeleton in each frame of the image, thereby effectively screening out key frames with potential violations from a large amount of continuous video data, reducing missed detections and false alarms, and integrating deep learning technology to monitor violations, thereby reducing the number of frames that need to be reviewed and improving the efficiency and accuracy of monitoring.

系统包括处理器和存储器,存储器存储有计算机程序指令,当计算机程序指令被处理器执行时实现根据本发明第一方面的基于布控球的作业人员违章监测方法。The system includes a processor and a memory, wherein the memory stores computer program instructions. When the computer program instructions are executed by the processor, the method for monitoring operator violations based on a control ball according to the first aspect of the present invention is implemented.

系统还包括通信总线和通信接口等本领域技术人员熟知的其他组件,其设置和功能为本领域中已知,因此在此不再赘述。The system also includes other components familiar to those skilled in the art, such as a communication bus and a communication interface. The configuration and functions of these components are known in the art and will not be described in detail here.

在本发明中,前述的存储器可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。例如,计算机可读存储介质可以是任何适当的磁存储介质或者磁光存储介质,比如,阻变式存储器RRAM(Resistive RandomAccess Memory)、动态随机存取存储器DRAM(Dynamic Random Access Memory)、静态随机存取存储器SRAM(Static Random-Access Memory)、增强动态随机存取存储器EDRAM(Enhanced Dynamic Random Access Memory)、高带宽内存HBM(High-Bandwidth Memory)、混合存储立方HMC(Hybrid Memory Cube)等等,或者可以用于存储所需信息并且可以由应用程序、模块或两者访问的任何其他介质。任何这样的计算机存储介质可以是设备的一部分或可访问或可连接到设备。In the present invention, the aforementioned memory may be any tangible medium containing or storing a program, which may be used by or in combination with an instruction execution system, apparatus or device. For example, a computer-readable storage medium may be any appropriate magnetic storage medium or magneto-optical storage medium, such as a resistive random access memory RRAM (Resistive Random Access Memory), a dynamic random access memory DRAM (Dynamic Random Access Memory), a static random access memory SRAM (Static Random-Access Memory), an enhanced dynamic random access memory EDRAM (Enhanced Dynamic Random Access Memory), a high-bandwidth memory HBM (High-Bandwidth Memory), a hybrid memory cube HMC (Hybrid Memory Cube), etc., or any other medium that can be used to store the required information and can be accessed by an application, a module or both. Any such computer storage medium may be part of a device or accessible or connectable to a device.

在本说明书的描述中,“多个”、“若干个”的含义是至少两个,例如两个,三个或更多个等,除非另有明确具体的限定。In the description of this specification, "plurality" or "several" means at least two, such as two, three or more, etc., unless otherwise clearly and specifically defined.

虽然本说明书已经示出和描述了本发明的多个实施例,但对于本领域技术人员显而易见的是,这样的实施例只是以示例的方式提供的。本领域技术人员会在不偏离本发明思想和精神的情况下想到许多更改、改变和替代的方式。应当理解的是在实践本发明的过程中,可以采用对本文所描述的本发明实施例的各种替代方案。Although this specification has shown and described a number of embodiments of the present invention, it will be apparent to those skilled in the art that such embodiments are provided by way of example only. Those skilled in the art will conceive of many modifications, changes and alternatives without departing from the ideas and spirit of the present invention. It should be understood that in the practice of the present invention, various alternatives to the embodiments of the present invention described herein may be employed.

Claims (8)

1. The utility model provides a method for monitoring operators' violations based on control ball, which is characterized by comprising:
framing and preprocessing videos shot by the control ball to obtain each frame of image;
Calculating the importance degree of a target frame image, wherein the target frame image is any frame image; the importance degree is positively related to the action significance of key points of the human skeleton in the target frame image;
Taking the target frame image as a center, intercepting a plurality of adjacent frame images to construct a second sequence, calculating the sum of differences of importance degrees of the target frame image and other frame images in the second sequence, normalizing, and dividing the target frame image into key frames when the sum of the normalized differences is smaller than or equal to a set threshold value; traversing to obtain all key frames;
and inputting the key points in all the key frames into a trained space-time diagram convolution network, and outputting classification results of human actions in the key frames.
2. The method for monitoring the violation of a worker based on a control ball according to claim 1, wherein the importance degree satisfies the relation:
; in the method, in the process of the invention, For the importance level of the target frame image,Is the first in the target frame imageThe significance of the action of the individual key points,Is the firstThe weights corresponding to the individual key points are used,For the number of keypoints in the target frame image,The normalization process is represented.
3. The method for monitoring operator violations based on the control balls according to claim 1, wherein the importance degree is a maximum value of products of action salience of all key points and weights corresponding to the key points.
4. The method for monitoring the violation of the operator based on the control ball according to claim 2, wherein the process for obtaining the action significance comprises the following steps:
Calculate the first The track change rate and the relative change rate of each key point are weighted and summed to obtain the action significance;
Taking the target frame image as the center, intercepting the image of a certain frame to construct a first sequence, and calculating the first sequence The key points are respectively matched with the first in the target frame imageAn average value of the sum of rates of the key points in other frame images, wherein the average value is used as the track change rate;
Wherein, calculate the first Calculating the first average distance between each key point and all other key points in the target frame imageAnd calculating the variance of the first average distance and all the second average distances of the key points in the second average distances of other frame images in the first sequence, and taking the variance as the relative change rate.
5. The method for monitoring the violation of an operator based on a control ball according to claim 3, wherein the process for obtaining the action significance comprises the following steps:
Calculating motion vectors of key points between successive frames using optical streaming, the motion vectors comprising vector magnitudes;
And calculating the variance of the motion vector amplitude, and taking the variance as the motion significance.
6. A method of monitoring operator violations based on a control sphere according to claim 3, characterized in that the motion vectors of key points between successive frames are calculated using an optical flow method, said motion vectors comprising vector magnitudes; and calculating an average value of the motion vector amplitude, and taking the average value of the motion vector amplitude as the motion significance.
7. The method for monitoring operator violations based on a control sphere according to claim 1, wherein said space-time diagram convolution network uses a cross entropy loss function in the training process.
8. An operation personnel monitoring system that violating regulations based on cloth accuse ball, characterized by comprising: a processor and a memory storing computer program instructions that when executed by the processor implement the method of cloth ball based operator violation monitoring of any of claims 1-7.
CN202411232701.8A 2024-09-04 2024-09-04 Method and system for monitoring operator rule violation based on distributed control ball Active CN118736683B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411232701.8A CN118736683B (en) 2024-09-04 2024-09-04 Method and system for monitoring operator rule violation based on distributed control ball

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411232701.8A CN118736683B (en) 2024-09-04 2024-09-04 Method and system for monitoring operator rule violation based on distributed control ball

Publications (2)

Publication Number Publication Date
CN118736683A true CN118736683A (en) 2024-10-01
CN118736683B CN118736683B (en) 2024-11-22

Family

ID=92851573

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411232701.8A Active CN118736683B (en) 2024-09-04 2024-09-04 Method and system for monitoring operator rule violation based on distributed control ball

Country Status (1)

Country Link
CN (1) CN118736683B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022000420A1 (en) * 2020-07-02 2022-01-06 浙江大学 Human body action recognition method, human body action recognition system, and device
US20220147737A1 (en) * 2020-11-12 2022-05-12 Disney Enterprises, Inc. Real time kinematic analyses of body motion
CN115761561A (en) * 2022-04-27 2023-03-07 中国科学院沈阳计算技术研究所有限公司 A Violation Video Detection Method Based on Skeleton Behavior Recognition
CN116524384A (en) * 2022-01-18 2023-08-01 北京小米移动软件有限公司 Image processing method, device, electronic equipment and storage medium
CN117789255A (en) * 2024-02-27 2024-03-29 沈阳二一三电子科技有限公司 Pedestrian abnormal behavior video identification method based on attitude estimation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022000420A1 (en) * 2020-07-02 2022-01-06 浙江大学 Human body action recognition method, human body action recognition system, and device
US20220147737A1 (en) * 2020-11-12 2022-05-12 Disney Enterprises, Inc. Real time kinematic analyses of body motion
CN116524384A (en) * 2022-01-18 2023-08-01 北京小米移动软件有限公司 Image processing method, device, electronic equipment and storage medium
CN115761561A (en) * 2022-04-27 2023-03-07 中国科学院沈阳计算技术研究所有限公司 A Violation Video Detection Method Based on Skeleton Behavior Recognition
CN117789255A (en) * 2024-02-27 2024-03-29 沈阳二一三电子科技有限公司 Pedestrian abnormal behavior video identification method based on attitude estimation

Also Published As

Publication number Publication date
CN118736683B (en) 2024-11-22

Similar Documents

Publication Publication Date Title
CN114926781B (en) Multi-person time-space domain abnormal behavior positioning method and system supporting real-time monitoring scene
Lestari et al. Fire hotspots detection system on CCTV videos using you only look once (YOLO) method and tiny YOLO model for high buildings evacuation
CN114140746B (en) A camera occlusion detection method in a box, an elevator operation control method and device
Iannizzotto et al. Personal Protection Equipment detection system for embedded devices based on DNN and Fuzzy Logic
CN114218992A (en) Abnormal object detection method and related device
Ji et al. A high-performance framework for personal protective equipment detection on the offshore drilling platform
Chen et al. YOLOv7-WFD: A novel convolutional neural network model for helmet detection in High-Risk workplaces
Abbas et al. A comprehensive review for video anomaly detection on videos
CN116259002A (en) A video-based human risk behavior analysis method
CN111985331A (en) Detection method and device for preventing secret of business from being stolen
CN115482502A (en) Abnormal behavior identification method, system and medium based on characteristic object and human body key point
CN118587648A (en) An intelligent visual monitoring and early warning method for the safety of tank truck entry workers
CN117095459A (en) A deep learning-based human behavior analysis method on construction sites
Elhamod et al. Real-time semantics-based detection of suspicious activities in public spaces
CN118799814B (en) Image processing method and device based on distributed monitoring system and terminal equipment
CN119299332B (en) A safety monitoring system and monitoring method based on machine vision
CN118736683A (en) A method and system for monitoring operator violations based on a control ball
Lao et al. Human running detection: Benchmark and baseline
EP4485369A1 (en) Information processing program, information processing method, and information processing apparatus
CN119342314A (en) A portable explosion-proof pan/tilt camera, monitoring system and method thereof
Gong et al. Human elbow flexion behaviour recognition based on posture estimation in complex scenes
CN118072398A (en) A real-time detection method for factory workers' illegal actions based on YOLOv8-pose
CN117765609A (en) System for detecting factory man-object interaction violation
CN120823566B (en) AI-based video analytics-based safety production monitoring methods and systems
Chou et al. Amodal instance segmentation optimized by metaheuristics for enhanced safety behavior detection on construction sites

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant