[go: up one dir, main page]

WO2024255425A1 - Image acquisition - Google Patents

Image acquisition Download PDF

Info

Publication number
WO2024255425A1
WO2024255425A1 PCT/CN2024/087890 CN2024087890W WO2024255425A1 WO 2024255425 A1 WO2024255425 A1 WO 2024255425A1 CN 2024087890 W CN2024087890 W CN 2024087890W WO 2024255425 A1 WO2024255425 A1 WO 2024255425A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
preset
meet
image quality
generation model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/087890
Other languages
French (fr)
Chinese (zh)
Inventor
李若愚
唐董琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alipay Hangzhou Information Technology Co Ltd
Original Assignee
Alipay Hangzhou Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alipay Hangzhou Information Technology Co Ltd filed Critical Alipay Hangzhou Information Technology Co Ltd
Publication of WO2024255425A1 publication Critical patent/WO2024255425A1/en
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition
    • G06V10/12Details of acquisition arrangements; Constructional details thereof
    • G06V10/14Optical characteristics of the device performing the acquisition or on the illumination arrangements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition
    • G06V10/12Details of acquisition arrangements; Constructional details thereof
    • G06V10/14Optical characteristics of the device performing the acquisition or on the illumination arrangements
    • G06V10/141Control of illumination
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
    • G06V10/993Evaluation of the quality of the acquired pattern
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/42Document-oriented image-based pattern recognition based on the type of document
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/166Detection; Localisation; Normalisation using acquisition arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/64Computer-aided capture of images, e.g. transfer from script file into camera, check of taken image quality, advice or proposal for image composition or decision on when to take image

Definitions

  • This document relates to the field of image recognition technology, and in particular to an image acquisition method, device and electronic equipment.
  • the method usually adopted is to continue shooting until the image obtained meets the user's needs. Therefore, it is necessary to provide a more efficient image acquisition method that can reduce the number of repeated shootings.
  • one or more embodiments of the present specification provide an image acquisition method, comprising: acquiring an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position of the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.
  • an image acquisition device including: an image acquisition module, which acquires an image shot for a target object; an image reshooting guidance information generation module, which, when the image does not meet a preset image quality condition, inputs the image into a pre-trained cause generation model, determines the position of the image that does not meet the preset image quality condition through the cause generation model, and generates image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause causing the image to not meet the preset image quality condition and providing corresponding improvement suggestions for the detection result; and a control module, which re-shoots the image of the target object according to the image reshooting guidance information.
  • one or more embodiments of the present specification provide an electronic device, comprising: a processor; and a memory arranged to store computer-executable instructions, wherein when the executable instructions are executed, the processor can: obtain an image taken of a target object; when the image does not meet a preset image quality condition, input the image into a pre-trained cause generation model, determine the position of the image that does not meet the preset image quality condition through the cause generation model, and generate image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-take an image of the target object according to the image reshooting guidance information.
  • one or more embodiments of the present specification provide a storage medium for storing computer-executable instructions, which implement the following process when executed by a processor: obtaining an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position of the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.
  • FIG1 is a schematic flow chart of an image acquisition method according to an embodiment of the present specification.
  • FIG2 is a schematic flow chart of an image acquisition method according to another embodiment of the present specification.
  • FIG3 is a schematic flow chart of an image acquisition method according to another embodiment of the present specification.
  • FIG4 is a schematic diagram of an image acquisition method according to an embodiment of the present specification.
  • FIG5 is a schematic block diagram of an image acquisition device according to an embodiment of the present specification.
  • FIG. 6 is a schematic block diagram of an electronic device according to an embodiment of the present disclosure.
  • One or more embodiments of the present specification provide an image acquisition method, device, and electronic device to solve the current problems.
  • the embodiment of this specification provides an image acquisition method, and the execution subject of the method can be a terminal device, which can be a certain terminal device such as a mobile phone, a tablet computer, or a computer device such as a laptop or a desktop computer, or an IoT device (specifically, a smart watch, a vehicle-mounted device, etc.).
  • the method can specifically include the following steps.
  • step S102 an image captured of a target object is acquired.
  • the method in one or more embodiments of this specification can be applied to scenarios where certain quality requirements are placed on the captured images, such as: capturing scene images with a mobile phone or smart camera, capturing face images with a mobile phone or smart camera, capturing ID images, etc.
  • the target object in an embodiment of this specification can be an ID, a face, or a scene, etc. In actual applications, the target object can also include multiple, for example, the target object includes two, namely an ID and a face, etc.
  • the target object may include the ID and the face.
  • the ID and the face may be photographed respectively by the camera component of the terminal device to obtain the corresponding ID image and face image.
  • the ID and the face may be captured by the camera component of the terminal device to obtain an image containing both the ID and the face.
  • step S104 when the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, the cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshooting guidance information for the determined position.
  • the cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results.
  • the preset image quality conditions are determined according to the user's shooting requirements for the target object. For example, when shooting a document or a face in the eKYC process, the preset image quality conditions may include one or more of the following: the shot image reaches a preset pixel size, the shot image reaches a preset resolution, and the shot image reaches a set exposure parameter. Accordingly, images that do not meet the preset image quality conditions usually include: overexposure of highlights (i.e., the highlight area is too bright).
  • the images include images with an exposure higher than a preset exposure threshold value, images that are too dark (i.e., image brightness lower than a preset brightness threshold value and/or image contrast lower than a preset contrast threshold value, etc.), and images that are blurred (i.e., images with pixels lower than a preset pixel threshold value, etc.).
  • the image obtained through step S102 may or may not meet the preset image quality conditions.
  • There are many ways to determine whether the image meets the preset image quality conditions such as: pre-setting multiple image quality conditions, and determining whether each image quality condition is met by comparing one by one, so as to determine whether the image meets the preset drawing quality conditions.
  • a corresponding algorithm such as a classification algorithm or a random forest algorithm, etc.
  • a corresponding model can be constructed using the selected algorithm, and the model can be used to determine whether the captured image meets the preset image quality conditions.
  • a cause generation model is pre-trained.
  • the cause generation model determines the specific location where the quality problem occurs in the image taken last time, detects the cause of the quality problem, and finally generates image reshoot guidance information for the specific location, thereby providing targeted improvement suggestions for the next image shooting.
  • the input data of the cause generation model is an image that does not meet the preset image quality conditions, which may specifically include the image, the information contained in the image, etc.
  • the output result of the cause generation model is the image reshoot guidance information, which can be presented in text form, for example: "The highlight in the upper left corner of the certificate is overexposed, please adjust the highlight area in the upper left corner when shooting", "The overall image is blurred, please keep the phone still when shooting", etc., and the image reshoot guidance information can also be played in the form of voice broadcast.
  • the image reshooting guidance information may at least include: specific locations of quality issues on corresponding shooting objects and shooting suggestions, and may also include reasons for unqualified quality, etc.
  • the pre-trained cause generation model can adopt a training model from image to text, and the training model can be constructed based on a neural network, such as by a convolutional neural network, by a transformer, etc.
  • the loss function of the training cause generation model can adopt a standard loss function in text generation.
  • a method for determining locations in an image that do not meet preset image quality conditions may be based on a generation model that may use a method of overall comparison of the captured image with a standard image, or may use an image segmentation method, i.e., using a segmentation algorithm to determine which pixel locations in the captured image do not meet a preset pixel size.
  • step S106 the target object is re-imaged according to the image re-shooting guidance information.
  • the generated image reshooting guidance information can be used to guide the user to reshoot the image of the target object in a targeted manner, thereby completing the image acquisition operation with fewer shots, which is beneficial to improving image shooting efficiency.
  • the embodiment of the present specification provides an image acquisition method, firstly acquiring an image shot for a target object, then determining whether the image meets a preset image quality condition, and when the image does not meet the preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshoot guidance information for the determined position, and finally reshooting the target object according to the generated image reshoot guidance information, thereby completing image acquisition.
  • the pre-trained cause generation model can detect the cause of the image not meeting the preset image quality condition in the last shooting process, obtain the specific cause of the quality problem, and locate the position in the last shot image that does not meet the preset image quality condition, obtain the specific position that causes the quality problem, and finally generate image reshoot guidance information for the determined position to feedback to the user, thereby providing targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shootings, improve image shooting efficiency, and thus improve user experience.
  • step S1022 a plurality of image samples taken for different shooting objects, the reason why each image sample does not meet the preset image quality condition, and the position of each image sample that does not meet the preset image quality condition are obtained.
  • the plurality of image samples may include one or more of the following images: historical images of different photographed subjects, original images of each photographed subject that do not meet preset image quality conditions, and images simulated based on the original images of each photographed subject.
  • multiple image samples acquired for different shooting objects, the reason why each image sample does not meet the preset image quality conditions, and the position of each image sample that does not meet the preset image quality conditions can be directly input into the cause generation model as input data for model training.
  • step S1022 may be performed as the following steps A1 and A2.
  • Step A1 Acquire multiple image samples taken for different objects.
  • Step A2 respectively determining the reason why each image sample does not meet the preset image quality condition and the position of each image sample that does not meet the preset image quality condition.
  • multiple image samples taken for different shooting objects can be first obtained, and the multiple image samples can be used as input data for model training. Then, the cause generation model determines that each image sample does not meet the preset The reasons for the image quality conditions and the positions of each image sample that do not meet the preset image quality conditions are determined, and then the network layer for generating reason text set in the reason generation model is used to generate corresponding image reshooting guidance information based on the reasons why each image sample does not meet the preset image quality conditions and the positions of each image sample that do not meet the preset image quality conditions.
  • the reasons why the image sample does not meet the preset image quality conditions include: a first quality problem and a second quality problem, and the first quality problem is an image quality problem in which the image granularity is greater than or equal to a preset image granularity threshold (or called a large-granularity quality problem), and the second quality problem is an image quality problem in which the image granularity is less than a preset image granularity threshold (or called a fine-granularity quality problem).
  • the processing of obtaining the position in each image sample that does not meet the preset image quality conditions in the above step S1022 can be executed as the following steps B1 and B2.
  • Step B1 When the reason why the image sample does not meet the preset image quality condition is the first quality problem, the position of each image sample that does not meet the preset image quality condition is determined based on the target detection algorithm.
  • Step B2 When the reason why the image sample does not meet the preset image quality condition is the second quality problem, the position of each image sample that does not meet the preset image quality condition is identified based on a semantic segmentation algorithm.
  • the target detection algorithm is an algorithm for finding the target of interest from the image, and classifying and locating the found target of interest.
  • the target of interest can be roughly framed out by the target detection algorithm.
  • the embodiment of this specification does not limit the specific target detection algorithm.
  • the TwoStage detection algorithm or the OneStage detection algorithm based on deep learning can be used.
  • the semantic segmentation algorithm is an algorithm that classifies each pixel in the image, thereby dividing the image into multiple regions containing different categories of information.
  • the semantic segmentation algorithm can accurately depict the outline of the target, and can provide coordinate information and classification information in each pixel.
  • the semantic segmentation algorithm can not only predict the position and category of the target object in the image, but also depict the boundaries between different types of target objects.
  • the embodiment of this specification does not limit the specific semantic segmentation algorithm.
  • the FCN Full Convolutional Network
  • the CRF Conditional Random Field algorithm
  • the position of the image sample that does not meet the preset image quality conditions can be determined based on the result determined by the target detection algorithm; when the reason why the image sample does not meet the preset image quality conditions is a fine-grained quality problem, the semantic segmentation algorithm can be used to identify the pixel points at which positions in the image sample do not meet the requirements, thereby identifying the position of the image sample that does not meet the preset image quality conditions.
  • Different algorithms can be used to determine the position of each image sample that does not meet the preset image quality conditions according to the different reasons why the image sample does not meet the preset image quality conditions. The location of the quality conditions makes the positioning method for image quality problems more flexible, which can not only effectively save resources, but also efficiently and accurately obtain positioning results that meet the requirements.
  • step S1024 based on multiple images, the reasons why each image sample does not meet the preset image quality conditions, and the positions in each image sample that do not meet the preset image quality conditions, the cause generation model is trained using a preset first loss function to obtain a trained cause generation model.
  • the preset first loss function adopts a cross entropy loss function.
  • the preset first loss function is used for cause generation model training, which belongs to a supervised training method. It can not only improve the efficiency of model training, but also improve the accuracy of model training output results.
  • the image acquisition method when the image acquisition method in the embodiment of this specification is used in a blockchain system based on electronic identity authentication information, the image acquisition method may include the following steps S202 - S208 .
  • S202 Collect an image of a designated certificate of the target user and/or an image of the target user taken for the target user to access the blockchain system based on electronic identity authentication information, and use the image of the designated certificate and/or the image of the target user taken as the image taken of the target object.
  • the target user in order to access the blockchain system based on electronic identity authentication information, the target user needs to take a corresponding image (which may be an image of a designated document, an image of the target user, or an image of a designated document and an image of the target user), collect the above image, and use the above image as the image taken of the target object.
  • a corresponding image which may be an image of a designated document, an image of the target user, or an image of a designated document and an image of the target user
  • the image is input into a pre-trained cause generation model, the cause generation model is used to determine the position of the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position.
  • the cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions and providing corresponding improvement suggestions for the detection results.
  • the image information capacity in the blockchain system can be increased while ensuring the security and reliability of the image information.
  • the image acquisition method of the embodiments of this specification may include the following steps S302-S308.
  • Step S302 Acquire an image captured of the target object.
  • step S302 can refer to the relevant content of the above step S102, which will not be repeated here.
  • Step S304 input the image into a pre-trained quality model to obtain a quality assessment result of the image.
  • the quality assessment results of the image include: unqualified quality (i.e., the image does not meet the preset image quality conditions) and qualified quality (i.e., the image meets the preset image quality conditions).
  • the quality model is used to determine whether the image meets the preset image quality conditions based on the image classification method, and the quality model is obtained by model training through image samples of different photographed objects and a preset second loss function.
  • the input data of the quality model are image samples of different photographed objects, i.e., the images obtained in the above step S302, and the output results of the quality model are images with quality assessment results, including qualified quality images and unqualified quality images.
  • the quality model can be constructed using a binary classification algorithm, and the images taken for the target object can be divided into qualified quality images (or high-quality images) and unqualified quality images (or low-quality images) through the quality model.
  • the second loss function may adopt a classification model loss function, specifically a cross entropy loss function, or a BCE-loss (binary cross entropy loss function).
  • the user continues to perform subsequent processes based on the acquired image.
  • Step S306 If the quality assessment result indicates that the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, and the cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position.
  • the cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results.
  • Step S308 re-shooting the image of the target object according to the image re-shooting guidance information.
  • FIG4 takes the eKYC process as an example and shows a schematic principle diagram of an image acquisition method according to an embodiment of this specification.
  • the process of an image acquisition method provided by one or more embodiments of this specification in actual application is as follows.
  • a pre-trained quality model based on image classification is used to perform a binary quality classification on the image taken by the user to determine whether the image taken by the user is a high-quality image or a low-quality image.
  • the causes of quality problems in low-quality images are subdivided.
  • the causes of quality problems may include: highlights, low light, blur, etc.
  • the specific locations of quality problems in low-quality images are located in the cause generation model.
  • the embodiment of the present specification provides an image acquisition method, firstly acquiring an image shot for a target object, then determining whether the image meets a preset image quality condition, and when the image does not meet the preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshoot guidance information for the determined position, and finally reshooting the target object according to the generated image reshoot guidance information, thereby completing image acquisition.
  • the pre-trained cause generation model can detect the cause of the image not meeting the preset image quality condition in the last shooting process, obtain the specific cause of the quality problem, and locate the position in the last shot image that does not meet the preset image quality condition, obtain the specific position that causes the quality problem, and finally generate image reshoot guidance information for the determined position to feedback to the user, thereby providing targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shootings, improve image shooting efficiency, and thus improve user experience.
  • one or more embodiments of this specification also provide an image acquisition device, as shown in FIG5 .
  • the image acquisition device includes: an image acquisition module 410 , an image reshooting guidance information generation module 420 and a control module 430 .
  • the image acquisition module 410 acquires an image taken of a target object.
  • the image reshooting guidance information generation module 420 when the image does not meet the preset image quality conditions, inputs the image into a pre-trained cause generation model, determines the position in the image that does not meet the preset image quality conditions through the cause generation model, and generates image reshooting guidance information for the determined position.
  • the cause generation model is a model for detecting the cause of the image not meeting the preset image quality conditions and providing corresponding improvement suggestions for the detection results.
  • the control module 430 re-takes an image of the target object according to the image re-shooting guidance information.
  • the image acquisition module 410 is also used to collect images of the target user's designated certificate and/or the target user taken by the target user to access the blockchain system based on electronic identity authentication information, and use the images of the designated certificate and/or the target user taken as the images taken of the target object; the image acquisition device may also include: a storage module, if the image meets the preset image quality conditions, the image and/or the information contained in the image is stored in the blockchain system.
  • the image acquisition device may also include: a quality assessment result acquisition module, which inputs the image into a pre-trained quality model to obtain a quality assessment result of the image; an image reshooting guidance information generation module 420, which is also used to input the image into a pre-trained cause generation model if the quality assessment result indicates that the image does not meet the preset image quality conditions, determine the position in the image that does not meet the preset image quality conditions through the cause generation model, and generate image reshooting guidance information for the determined position.
  • a quality assessment result acquisition module which inputs the image into a pre-trained quality model to obtain a quality assessment result of the image
  • an image reshooting guidance information generation module 420 which is also used to input the image into a pre-trained cause generation model if the quality assessment result indicates that the image does not meet the preset image quality conditions, determine the position in the image that does not meet the preset image quality conditions through the cause generation model, and generate image reshooting guidance information for the determined position.
  • the embodiment of the present specification provides an image acquisition device, which first acquires an image shot for a target object through an image acquisition module, and then, when the image does not meet the preset image quality conditions, the image retake guidance information generation module inputs the image into a pre-trained cause generation model, determines the position in the image that does not meet the preset image quality conditions through the cause generation model, and generates image retake guidance information for the determined position, and finally controls the image retake guidance information generation module to generate the image retake guidance information for the determined position.
  • the module re-takes the image of the target object according to the generated image retake guidance information, thereby completing the image acquisition.
  • the pre-trained cause generation model can detect the reasons why the image did not meet the preset image quality conditions during the last shooting process, obtain the specific reasons for the quality problem, and locate the position in the last shot that does not meet the preset image quality conditions, obtain the specific position that causes the quality problem, and finally generate image retake guidance information for the determined position and feedback it to the user, so as to provide targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shooting, improve image shooting efficiency, and thus improve user experience.
  • the electronic device may have relatively large differences due to different configurations or performances, and may include one or more processors 501 and a memory 502, and the memory 502 may store one or more storage applications or data.
  • the memory 502 may be a short-term storage or a persistent storage.
  • the application stored in the memory 502 may include one or more modules (not shown in the figure), and each module may include a series of computer executable instructions in the electronic device.
  • the processor 501 may be configured to communicate with the memory 502 to execute a series of computer executable instructions in the memory 502 on the electronic device.
  • the electronic device may also include one or more power supplies 503, one or more wired or wireless network interfaces 504, one or more input and output interfaces 505, and one or more keyboards 506.
  • the electronic device includes a memory, and one or more programs, wherein the one or more programs are stored in the memory, and the one or more programs may include one or more modules, and each module may include a series of computer executable instructions in the electronic device, and is configured to be executed by one or more processors.
  • the one or more programs include the following computer executable instructions: obtaining an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition, and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.
  • One or more embodiments of the present specification also propose a storage medium for storing computer executable instructions, which implement the following process when executed by a processor: obtaining an image taken of a target object; when the image does not meet the preset image quality conditions, inputting the image into a pre-trained cause generation model;
  • the cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position.
  • the cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results; according to the image reshoot guidance information, the target object is re-imaged.
  • a typical implementation device is a computer.
  • the computer may be, for example, a personal computer, a laptop computer, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or a combination of any of these devices.
  • one or more embodiments of this specification may be provided as methods, systems, or computer program products. Therefore, one or more embodiments of this specification may take the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, one or more embodiments of this specification may take the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program code.
  • computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing device to produce a machine, so that the instructions executed by the processor of the computer or other programmable data processing device produce a device for implementing the functions specified in one process or multiple processes in the flowchart and/or one box or multiple boxes in the block diagram.
  • These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.
  • These computer program instructions can also be loaded into a computer or other programmable data processing device, so that a series of operation steps are executed on the computer or other programmable device to produce a computer-implemented process, thereby performing a computer-implemented process on the computer or other programmable device.
  • the instructions executed on other programmable devices provide steps for implementing the functions specified in one or more flows of the flowcharts and/or one or more blocks of the block diagrams.
  • a computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
  • processors CPU
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • Memory may include non-permanent storage in a computer-readable medium, in the form of random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash memory
  • Computer readable media include permanent and non-permanent, removable and non-removable media that can be implemented by any method or technology to store information.
  • Information can be computer readable instructions, data structures, program modules or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, compact disk read-only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, magnetic cassettes, magnetic disk storage or other magnetic storage devices or any other non-transmission media that can be used to store information that can be accessed by a computing device.
  • computer readable media does not include temporary computer readable media (transitory media), such as modulated data signals and carrier waves.
  • program modules include routines, programs, objects, components, data structures, etc. that perform specific tasks or implement specific abstract data types.
  • the present specification may also be practiced in distributed computing environments where tasks are performed by remote processing devices connected through a communication network.
  • program modules may be located in local and remote computer storage media, including storage devices.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Image Analysis (AREA)

Abstract

One or more embodiments of the present description disclose an image acquisition method and apparatus, and an electronic device. The method comprises: acquiring an image captured for a target object; then when the image does not satisfy a preset image quality condition, inputting the image into a pre-trained reason generation model, determining a location, which does not satisfy the preset image quality condition, in the image by means of the reason generation model, and generating image recapture guide information for the determined location, wherein the reason generation model is a model used for detecting a reason causing the image not to satisfy the preset image quality condition, and providing a corresponding improvement suggestion for a detection result; and finally performing image capture on the target object again on the basis of the image recapture guide information.

Description

图像采集Image acquisition 技术领域Technical Field

本文件涉及图像识别技术领域,尤其涉及一种图像采集方法、装置及电子设备。This document relates to the field of image recognition technology, and in particular to an image acquisition method, device and electronic equipment.

背景技术Background Art

在图像识别领域中,对目标拍摄对象进行图像拍摄时,通常会发生一次拍摄所获取的图像无法达到用户满意度的情况。以eKYC(electronic Know Your Customer,企业电子身份认证信息系统)流程为例,在eKYC流程中,用户需要分别对证件和人脸进行拍摄,从而完成身份验证,如果用户拍摄的图片质量偏低(如模糊、高光等情况),则会影响证件识别和人脸识别的准确度,从而导致用户无法顺利完成开户。因此,如何进行图像拍摄,从而获取符合用户满意度的高质量图像,是提高用户体验的关键问题。In the field of image recognition, when capturing an image of a target object, it often happens that the image obtained in one capture fails to meet user satisfaction. Take the eKYC (electronic Know Your Customer, enterprise electronic identity authentication information system) process as an example. In the eKYC process, users need to take photos of their ID and face respectively to complete identity authentication. If the quality of the pictures taken by users is low (such as blur, highlights, etc.), it will affect the accuracy of ID and face recognition, resulting in the user being unable to successfully open an account. Therefore, how to capture images to obtain high-quality images that meet user satisfaction is a key issue in improving user experience.

目前,在一次拍摄所获取的图像无法达到用户满意度时,通常采取的方法是继续拍摄,直到所获取的图像符合用户需求为止。因此,需要提供一种可以减少反复拍摄次数的更加高效的图像采集方法。At present, when the image obtained by one shooting cannot meet the user's satisfaction, the method usually adopted is to continue shooting until the image obtained meets the user's needs. Therefore, it is necessary to provide a more efficient image acquisition method that can reduce the number of repeated shootings.

发明内容Summary of the invention

一方面,本说明书一个或多个实施例提供一种图像采集方法,包括:获取针对目标对象所拍摄的图像;当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。On the one hand, one or more embodiments of the present specification provide an image acquisition method, comprising: acquiring an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position of the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.

另一方面,本说明书一个或多个实施例提供一种图像采集装置,包括:图像获取模块,获取针对目标对象所拍摄的图像;图像重拍引导信息生成模块,当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;控制模块,根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。 On the other hand, one or more embodiments of the present specification provide an image acquisition device, including: an image acquisition module, which acquires an image shot for a target object; an image reshooting guidance information generation module, which, when the image does not meet a preset image quality condition, inputs the image into a pre-trained cause generation model, determines the position of the image that does not meet the preset image quality condition through the cause generation model, and generates image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause causing the image to not meet the preset image quality condition and providing corresponding improvement suggestions for the detection result; and a control module, which re-shoots the image of the target object according to the image reshooting guidance information.

再一方面,本说明书一个或多个实施例提供一种电子设备,包括:处理器;以及被安排成存储计算机可执行指令的存储器,在所述可执行指令被执行时,能够使得所述处理器:获取针对目标对象所拍摄的图像;当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。On the other hand, one or more embodiments of the present specification provide an electronic device, comprising: a processor; and a memory arranged to store computer-executable instructions, wherein when the executable instructions are executed, the processor can: obtain an image taken of a target object; when the image does not meet a preset image quality condition, input the image into a pre-trained cause generation model, determine the position of the image that does not meet the preset image quality condition through the cause generation model, and generate image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-take an image of the target object according to the image reshooting guidance information.

又一方面,本说明书一个或多个实施例提供一种存储介质,所述存储介质用于存储计算机可执行指令,所述可执行指令在被处理器执行时实现以下流程:获取针对目标对象所拍摄的图像;当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。On the other hand, one or more embodiments of the present specification provide a storage medium for storing computer-executable instructions, which implement the following process when executed by a processor: obtaining an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position of the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本说明书一个或多个实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本说明书一个或多个实施例中记载的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate one or more embodiments of this specification or the technical solutions in the prior art, the drawings required for use in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are only some embodiments recorded in one or more embodiments of this specification. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative work.

图1是根据本说明书一实施例的一种图像采集方法的示意性流程图;FIG1 is a schematic flow chart of an image acquisition method according to an embodiment of the present specification;

图2是根据本说明书另一实施例的一种图像采集方法的示意性流程图;FIG2 is a schematic flow chart of an image acquisition method according to another embodiment of the present specification;

图3是根据本说明书又一实施例的一种图像采集方法的示意性流程图;FIG3 is a schematic flow chart of an image acquisition method according to another embodiment of the present specification;

图4是根据本说明书一实施例的图像采集方法的示意性原理图;FIG4 is a schematic diagram of an image acquisition method according to an embodiment of the present specification;

图5是根据本说明书一实施例的一种图像采集装置的示意性框图;FIG5 is a schematic block diagram of an image acquisition device according to an embodiment of the present specification;

图6是根据本说明一实施例的一种电子设备的示意性框图。 FIG. 6 is a schematic block diagram of an electronic device according to an embodiment of the present disclosure.

具体实施方式DETAILED DESCRIPTION

本说明书一个或多个实施例提供一种图像采集方法、装置及电子设备,以解决目前的问题。One or more embodiments of the present specification provide an image acquisition method, device, and electronic device to solve the current problems.

为了使本技术领域的人员更好地理解本说明书一个或多个实施例中的技术方案,下面将结合本说明书一个或多个实施例中的附图,对本说明书一个或多个实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本说明书一部分实施例,而不是全部的实施例。基于本说明书一个或多个实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都应当属于本文件保护的范围。In order to enable those skilled in the art to better understand the technical solutions in one or more embodiments of this specification, the following will be combined with the drawings in one or more embodiments of this specification to clearly and completely describe the technical solutions in one or more embodiments of this specification. Obviously, the described embodiments are only part of the embodiments of this specification, not all of the embodiments. Based on one or more embodiments of this specification, all other embodiments obtained by ordinary technicians in this field without creative work should fall within the scope of protection of this document.

如图1所示,本说明书实施例提供一种图像采集方法,该方法的执行主体可以为终端设备,该终端设备可以如手机、平板电脑等一定终端设备,还可以如笔记本电脑或台式电脑等计算机设备,或者,也可以为IoT设备(具体如智能手表、车载设备等)等。该方法具体可以包括以下步骤。As shown in FIG1 , the embodiment of this specification provides an image acquisition method, and the execution subject of the method can be a terminal device, which can be a certain terminal device such as a mobile phone, a tablet computer, or a computer device such as a laptop or a desktop computer, or an IoT device (specifically, a smart watch, a vehicle-mounted device, etc.). The method can specifically include the following steps.

在步骤S102中,获取针对目标对象所拍摄的图像。In step S102, an image captured of a target object is acquired.

本说明书一个或多个实施例中的方法可以应用于对所拍摄的图像有一定质量要求的情景中,比如:手机或智能相机拍摄景物图像、手机或智能相机拍摄人脸图像、证件图像拍摄等。本说明书一实施例中的目标对象可以为证件、人脸或者景物等,在实际应用中,目标对象也可以包括多个,例如,目标对象包括2个,分别为证件和人脸等。The method in one or more embodiments of this specification can be applied to scenarios where certain quality requirements are placed on the captured images, such as: capturing scene images with a mobile phone or smart camera, capturing face images with a mobile phone or smart camera, capturing ID images, etc. The target object in an embodiment of this specification can be an ID, a face, or a scene, etc. In actual applications, the target object can also include multiple, for example, the target object includes two, namely an ID and a face, etc.

以eKYC流程为例,用户需要分别对证件和人脸进行拍摄从而完成身份验证,根据以上步骤S102,目标对象可以包括证件和人脸,可以通过终端设备的摄像组件分别对证件和人脸进行拍摄,得到相应的证件图像和人脸图像,或者,可以通过终端设备的摄像组件对证件和人脸进行图像采集,得到同时包含证件和人脸的图像等。Taking the eKYC process as an example, the user needs to take photos of the ID and the face respectively to complete the identity verification. According to the above step S102, the target object may include the ID and the face. The ID and the face may be photographed respectively by the camera component of the terminal device to obtain the corresponding ID image and face image. Alternatively, the ID and the face may be captured by the camera component of the terminal device to obtain an image containing both the ID and the face.

在步骤S104中,当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型。In step S104, when the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, the cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshooting guidance information for the determined position. The cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results.

其中,预设的图像质量条件根据用户对目标对象的拍摄要求确定,例如:在eKYC流程中对证件或人脸进行拍摄时,预设的图像质量条件可以包括拍摄的图像达到预设的像素大小、拍摄的图像达到预设的分辨率和拍摄的图像达到设定的曝光参数等中的一项或多项,相应地,不符合预设的图像质量条件的图像通常包括:高光过曝(即高光区域 的曝光度高于预设曝光度阈值)的图像、过暗(即图像亮度低于预设亮度阈值和/或图像对比度低于预设对比度阈值等)的图像、模糊(即像素低于预设像素阈值等)的图像等。The preset image quality conditions are determined according to the user's shooting requirements for the target object. For example, when shooting a document or a face in the eKYC process, the preset image quality conditions may include one or more of the following: the shot image reaches a preset pixel size, the shot image reaches a preset resolution, and the shot image reaches a set exposure parameter. Accordingly, images that do not meet the preset image quality conditions usually include: overexposure of highlights (i.e., the highlight area is too bright). The images include images with an exposure higher than a preset exposure threshold value, images that are too dark (i.e., image brightness lower than a preset brightness threshold value and/or image contrast lower than a preset contrast threshold value, etc.), and images that are blurred (i.e., images with pixels lower than a preset pixel threshold value, etc.).

需要说明的是,通过步骤S102所获取的图像,可能符合预设的图像质量条件,也可能不符合预设的图像质量条件,判断图像是否符合预设的图像质量条件的方法可以有很多种,比如:预先设定多个图像质量条件,通过逐一比对判断是否满足每个图像质量条件,从而确定图像是否符合预设的图纸质量条件,或者,也可以基于图像质量条件,选取相应的算法(例如分类算法或随机森林算法等),并使用选取的算法构建相应的模型,可以通过该模型判断拍摄的图像是否符合预设的图像质量条件等。It should be noted that the image obtained through step S102 may or may not meet the preset image quality conditions. There are many ways to determine whether the image meets the preset image quality conditions, such as: pre-setting multiple image quality conditions, and determining whether each image quality condition is met by comparing one by one, so as to determine whether the image meets the preset drawing quality conditions. Alternatively, based on the image quality conditions, a corresponding algorithm (such as a classification algorithm or a random forest algorithm, etc.) can be selected, and a corresponding model can be constructed using the selected algorithm, and the model can be used to determine whether the captured image meets the preset image quality conditions.

本说明书一实施例中预先训练一原因生成模型,该原因生成模型通过确定上一次所拍摄的图像中出现质量问题的具体位置,对质量问题进行原因检测,最终生成针对该具体位置的图像重拍引导信息,从而为下一次图像拍摄提供有针对性的改进建议。该原因生成模型的输入数据为不符合预设的图像质量条件的图像,具体可以包括图像、图像中包含的信息等。该原因生成模型的输出结果为图像重拍引导信息,图像重拍引导信息可以采用文本形式的呈现,例如:“证件左上角高光过曝,拍摄时请调整左上角高光区域”、“图像整体模糊,拍摄时请保持手机静止”等,也可以采用语音播报的形式播放该图像重拍引导信息。In one embodiment of the present specification, a cause generation model is pre-trained. The cause generation model determines the specific location where the quality problem occurs in the image taken last time, detects the cause of the quality problem, and finally generates image reshoot guidance information for the specific location, thereby providing targeted improvement suggestions for the next image shooting. The input data of the cause generation model is an image that does not meet the preset image quality conditions, which may specifically include the image, the information contained in the image, etc. The output result of the cause generation model is the image reshoot guidance information, which can be presented in text form, for example: "The highlight in the upper left corner of the certificate is overexposed, please adjust the highlight area in the upper left corner when shooting", "The overall image is blurred, please keep the phone still when shooting", etc., and the image reshoot guidance information can also be played in the form of voice broadcast.

在一种实施方式中,图像重拍引导信息中至少可以包括:基于质量问题在对应的拍摄对象上所处的具体位置以及基于拍摄建议,此外,还可以包括基于质量不合格原因等。In one implementation, the image reshooting guidance information may at least include: specific locations of quality issues on corresponding shooting objects and shooting suggestions, and may also include reasons for unqualified quality, etc.

本说明书一实施例中预先训练的原因生成模型可以采用从图像到文本的训练模型,该训练模型可以基于神经网络构建,具体如通过卷积神经网络构建、通过transformer构建等。训练原因生成模型的损失函数可以采用文本生成中的标准损失函数。In one embodiment of the present specification, the pre-trained cause generation model can adopt a training model from image to text, and the training model can be constructed based on a neural network, such as by a convolutional neural network, by a transformer, etc. The loss function of the training cause generation model can adopt a standard loss function in text generation.

本说明书一实施例中确定图像中不符合预设的图像质量条件的位置的方法,原因生成模型可以采用将所拍摄的图像与标准图像进行整体比对的方式,也可以采用图像分割的方式,即:通过分割算法确定所拍摄的图像中哪些位置的像素点不符合预设的像素大小。In one embodiment of the present specification, a method for determining locations in an image that do not meet preset image quality conditions may be based on a generation model that may use a method of overall comparison of the captured image with a standard image, or may use an image segmentation method, i.e., using a segmentation algorithm to determine which pixel locations in the captured image do not meet a preset pixel size.

在步骤S106中,根据图像重拍引导信息,对目标对象重新进行图像拍摄。In step S106, the target object is re-imaged according to the image re-shooting guidance information.

通过步骤S104生成图像重拍引导信息之后,可以通过所生成的图像重拍引导信息,引导用户有针对性地对目标对象重新进行图像拍摄,从而通过较少的拍摄次数即可完成图像采集操作,有利于提高图像拍摄效率。 After the image reshooting guidance information is generated in step S104, the generated image reshooting guidance information can be used to guide the user to reshoot the image of the target object in a targeted manner, thereby completing the image acquisition operation with fewer shots, which is beneficial to improving image shooting efficiency.

本说明书实施例提供一种图像采集方法,首先获取针对目标对象所拍摄的图像,然后判断图像是否符合预设的图像质量条件,且当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,最后根据所生成的图像重拍引导信息,对目标对象重新进行图像拍摄,从而完成图像采集。当图像不符合预设的图像质量条件时,通过预先训练的原因生成模型能够对上一次拍摄过程中造成图像不符合预设的图像质量条件的原因进行检测,获取到导致质量问题的具体原因,并对上一次拍摄的图像中不符合预设的图像质量条件的位置进行定位,获取到导致质量问题的具体位置,最终生成针对确定的位置处的图像重拍引导信息反馈给用户,从而为下一次拍摄提供有针对性的改进建议,能够有效减少重复拍摄的次数,提高图像拍摄效率,进而提高用户体验。The embodiment of the present specification provides an image acquisition method, firstly acquiring an image shot for a target object, then determining whether the image meets a preset image quality condition, and when the image does not meet the preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshoot guidance information for the determined position, and finally reshooting the target object according to the generated image reshoot guidance information, thereby completing image acquisition. When the image does not meet the preset image quality condition, the pre-trained cause generation model can detect the cause of the image not meeting the preset image quality condition in the last shooting process, obtain the specific cause of the quality problem, and locate the position in the last shot image that does not meet the preset image quality condition, obtain the specific position that causes the quality problem, and finally generate image reshoot guidance information for the determined position to feedback to the user, thereby providing targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shootings, improve image shooting efficiency, and thus improve user experience.

进一步地,上述步骤S102中原因生成模型的训练方法可以有多种方式,以下提供一种可选的处理方式,具体可以参见以下步骤S1022-S1024。Furthermore, there may be multiple ways to train the cause generation model in the above step S102. An optional processing method is provided below. For details, please refer to the following steps S1022-S1024.

在步骤S1022中,获取针对不同拍摄对象所拍摄的多张图像样本、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置。In step S1022, a plurality of image samples taken for different shooting objects, the reason why each image sample does not meet the preset image quality condition, and the position of each image sample that does not meet the preset image quality condition are obtained.

在实施中,多张图像样本可以包括以下中的一种或多种图像:不同拍摄对象的历史图像、每个拍摄对象的不符合预设的图像质量条件的原始图像以及基于每个拍摄对象的原始图像模仿拍摄的图像。In implementation, the plurality of image samples may include one or more of the following images: historical images of different photographed subjects, original images of each photographed subject that do not meet preset image quality conditions, and images simulated based on the original images of each photographed subject.

在一种实施方式中,对于所获取的针对不同拍摄对象所拍摄的多张图像样本、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置,可以直接作为模型训练的输入数据输入原因生成模型中。In one embodiment, multiple image samples acquired for different shooting objects, the reason why each image sample does not meet the preset image quality conditions, and the position of each image sample that does not meet the preset image quality conditions can be directly input into the cause generation model as input data for model training.

在另一种实施方式中,上述步骤S1022的处理可以执行为如下的步骤A1和A2。In another implementation, the processing of step S1022 may be performed as the following steps A1 and A2.

步骤A1:获取针对不同拍摄对象所拍摄的多张图像样本。Step A1: Acquire multiple image samples taken for different objects.

步骤A2:分别确定使得每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置。Step A2: respectively determining the reason why each image sample does not meet the preset image quality condition and the position of each image sample that does not meet the preset image quality condition.

由以上步骤A1和A2可知,在另一种实施方式中,可以首先获取针对不同拍摄对象所拍摄的多张图像样本,以多张图像样本作为模型训练的输入数据,然后,原因生成模型通过其中设置的相应网络层根据多张图像样本分别确定每张图像样本不符合预设的 图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置,之后再通过原因生成模型中设置的用于生成原因文本的网络层以所确定的每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置为基础,生成相应的图像重拍引导信息。It can be seen from the above steps A1 and A2 that in another embodiment, multiple image samples taken for different shooting objects can be first obtained, and the multiple image samples can be used as input data for model training. Then, the cause generation model determines that each image sample does not meet the preset The reasons for the image quality conditions and the positions of each image sample that do not meet the preset image quality conditions are determined, and then the network layer for generating reason text set in the reason generation model is used to generate corresponding image reshooting guidance information based on the reasons why each image sample does not meet the preset image quality conditions and the positions of each image sample that do not meet the preset image quality conditions.

在又一种实施方式中,对于图像样本不符合预设的图像质量条件的原因包括:第一质量问题和第二质量问题,且第一质量问题是图像粒度大于等于预设的图像粒度阈值的图像质量问题(或者称为大粒度的质量问题),第二质量问题是图像粒度小于预设的图像粒度阈值的图像质量问题(或者称为细粒度的质量问题)的情况,上述步骤S1022中获取每张图像样本中不符合预设的图像质量条件的位置的处理,可以执行为如下的步骤B1和B2。In another embodiment, the reasons why the image sample does not meet the preset image quality conditions include: a first quality problem and a second quality problem, and the first quality problem is an image quality problem in which the image granularity is greater than or equal to a preset image granularity threshold (or called a large-granularity quality problem), and the second quality problem is an image quality problem in which the image granularity is less than a preset image granularity threshold (or called a fine-granularity quality problem). The processing of obtaining the position in each image sample that does not meet the preset image quality conditions in the above step S1022 can be executed as the following steps B1 and B2.

步骤B1:当图像样本不符合预设的图像质量条件的原因为第一质量问题时,基于目标检测算法确定每张图像样本中不符合预设的图像质量条件的位置。Step B1: When the reason why the image sample does not meet the preset image quality condition is the first quality problem, the position of each image sample that does not meet the preset image quality condition is determined based on the target detection algorithm.

步骤B2:当图像样本不符合预设的图像质量条件的原因为第二质量问题时,基于语义分割算法识别每张图像样本中不符合预设的图像质量条件的位置。Step B2: When the reason why the image sample does not meet the preset image quality condition is the second quality problem, the position of each image sample that does not meet the preset image quality condition is identified based on a semantic segmentation algorithm.

其中,目标检测算法是从图像中找出感兴趣的目标,并对所找出的感兴趣的目标进行分类和定位的算法,通过目标检测算法可以大致框选出感兴趣的目标。本说明书实施例对具体的目标检测算法不作限定,在步骤B1中可以采用基于深度学习的TwoStage检测算法或者OneStage检测算法等。语义分割算法是对图像中的每个像素点进行分类,从而将图像分割成多个含有不同类别信息的区域的算法,通过语义分割算法可以准确描绘出目标的轮廓,并且可以在每个像素中提供坐标信息和分类信息。语义分割算法不但可以对图像中的目标对象的位置和类别进行预测,还可以描绘出不同类目标对象之间的边界。本说明书实施例对具体的语义分割算法不作限定,在步骤B2中可以采用FCN(Fully Convolutional Network,全卷积神经网络)算法、CRF(Conditional Random Field algorithm,条件随机场)算法等。Among them, the target detection algorithm is an algorithm for finding the target of interest from the image, and classifying and locating the found target of interest. The target of interest can be roughly framed out by the target detection algorithm. The embodiment of this specification does not limit the specific target detection algorithm. In step B1, the TwoStage detection algorithm or the OneStage detection algorithm based on deep learning can be used. The semantic segmentation algorithm is an algorithm that classifies each pixel in the image, thereby dividing the image into multiple regions containing different categories of information. The semantic segmentation algorithm can accurately depict the outline of the target, and can provide coordinate information and classification information in each pixel. The semantic segmentation algorithm can not only predict the position and category of the target object in the image, but also depict the boundaries between different types of target objects. The embodiment of this specification does not limit the specific semantic segmentation algorithm. In step B2, the FCN (Fully Convolutional Network) algorithm, the CRF (Conditional Random Field algorithm) algorithm, etc. can be used.

由以上步骤B1和B2可知,当图像样本不符合预设的图像质量条件的原因属于大粒度的质量问题时,可以基于目标检测算法确定的结果确定图像样本中不符合预设的图像质量条件的位置;当图像样本不符合预设的图像质量条件的原因为细粒度的质量问题时,可以基于语义分割算法,通过识别图像样本中哪些位置的像素点不符合要求,从而识别图像样本中不符合预设的图像质量条件的位置。针对图像样本不符合预设的图像质量条件的原因的不同,可以分别采取不同的算法来确定每张图像样本中不符合预设的图像质 量条件的位置,使得对于图像质量问题的定位方式更加灵活,既能够有效节省资源,还能够高效而精准地获取符合要求的定位结果。It can be seen from the above steps B1 and B2 that when the reason why the image sample does not meet the preset image quality conditions is a large-scale quality problem, the position of the image sample that does not meet the preset image quality conditions can be determined based on the result determined by the target detection algorithm; when the reason why the image sample does not meet the preset image quality conditions is a fine-grained quality problem, the semantic segmentation algorithm can be used to identify the pixel points at which positions in the image sample do not meet the requirements, thereby identifying the position of the image sample that does not meet the preset image quality conditions. Different algorithms can be used to determine the position of each image sample that does not meet the preset image quality conditions according to the different reasons why the image sample does not meet the preset image quality conditions. The location of the quality conditions makes the positioning method for image quality problems more flexible, which can not only effectively save resources, but also efficiently and accurately obtain positioning results that meet the requirements.

在步骤S1024中,基于多张图像、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置,通过预设的第一损失函数对原因生成模型进行训练,得到训练后的原因生成模型。In step S1024, based on multiple images, the reasons why each image sample does not meet the preset image quality conditions, and the positions in each image sample that do not meet the preset image quality conditions, the cause generation model is trained using a preset first loss function to obtain a trained cause generation model.

在一种实施方式中,预设的第一损失函数采用交叉熵损失函数,该预设的第一损失函数用于原因生成模型训练,属于有监督训练方式,既能够提高模型训练的效率,又可以提高模型训练输出结果的准确性。In one embodiment, the preset first loss function adopts a cross entropy loss function. The preset first loss function is used for cause generation model training, which belongs to a supervised training method. It can not only improve the efficiency of model training, but also improve the accuracy of model training output results.

进一步地,如图2所示,当本说明书实施例中的图像采集方法用于基于电子身份认证信息的区块链系统时,该图像采集方法可以包括以下步骤S202-S208。Further, as shown in FIG. 2 , when the image acquisition method in the embodiment of this specification is used in a blockchain system based on electronic identity authentication information, the image acquisition method may include the following steps S202 - S208 .

S202:采集目标用户为接入基于电子身份认证信息的区块链系统所拍摄的目标用户的指定证件的图像和/或目标用户的图像,将所拍摄的指定证件的图像和/或目标用户的图像作为对目标对象所拍摄的图像。S202: Collect an image of a designated certificate of the target user and/or an image of the target user taken for the target user to access the blockchain system based on electronic identity authentication information, and use the image of the designated certificate and/or the image of the target user taken as the image taken of the target object.

在实施中,为了接入基于电子身份认证信息的区块链系统,目标用户需要拍摄相应的图像(可能是指定证件的图像,也可能是目标用户的图像,还可能是指定证件的图像和目标用户的图像),采集到上述图像,并将上述图像作为目标对象所拍摄的图像。In implementation, in order to access the blockchain system based on electronic identity authentication information, the target user needs to take a corresponding image (which may be an image of a designated document, an image of the target user, or an image of a designated document and an image of the target user), collect the above image, and use the above image as the image taken of the target object.

S204:当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型。S204: When the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, the cause generation model is used to determine the position of the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position. The cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions and providing corresponding improvement suggestions for the detection results.

S206:根据图像重拍引导信息,对目标对象重新进行图像拍摄。S206: Re-shooting the image of the target object according to the image re-shooting guidance information.

上述步骤S204-S206的具体处理过程可以参见上述步骤S104-S106的相关内容,在此不再赘述。The specific processing process of the above steps S204-S206 can refer to the relevant content of the above steps S104-S106, which will not be repeated here.

S208:如果图像符合预设的图像质量条件,则将图像和/或图像中包含的信息存储于区块链系统中。S208: If the image meets the preset image quality condition, the image and/or the information contained in the image is stored in the blockchain system.

当图像符合预设的图像质量条件时,通过将图像和/或图像中包含的信息存储于区块链系统中,能够在确保图像信息的安全可信的基础上,增加区块链系统中的图像信息容量。 When an image meets the preset image quality conditions, by storing the image and/or the information contained in the image in the blockchain system, the image information capacity in the blockchain system can be increased while ensuring the security and reliability of the image information.

进一步地,如图3所示,本说明书实施例中对所获取的针对目标对象所拍摄的图像,判断其是否符合预设的图像质量条件的方法可以有多种方式,通过质量模型判断所拍摄的图像是否符合预设的图像质量条件是一种可选的处理方法,当通过质量模型判断所拍摄的图像是否符合预设的图像质量条件时,本说明书实施例的图像采集方法可以包括如下步骤S302-S308。Further, as shown in FIG3 , there may be a variety of methods for determining whether an image captured of a target object meets preset image quality conditions in the embodiments of this specification. Determining whether the captured image meets preset image quality conditions through a quality model is an optional processing method. When determining whether the captured image meets preset image quality conditions through a quality model, the image acquisition method of the embodiments of this specification may include the following steps S302-S308.

步骤S302:获取针对目标对象所拍摄的图像。Step S302: Acquire an image captured of the target object.

步骤S302的具体处理过程可以参加上述步骤S102的相关内容,在此不再赘述。The specific processing process of step S302 can refer to the relevant content of the above step S102, which will not be repeated here.

步骤S304:将图像输入预先训练的质量模型中,得到图像的质量评估结果。Step S304: input the image into a pre-trained quality model to obtain a quality assessment result of the image.

其中,图像的质量评估结果包括:质量不合格(即图像不符合预设的图像质量条件)与质量合格(即图像符合预设的图像质量条件)。该质量模型用于基于图像分类的方法确定图像是否符合预设的图像质量条件,且质量模型通过不同拍摄对象的图像样本和预设的第二损失函数进行模型训练得到。质量模型的输入数据为不同拍摄对象的图像样本,即上述步骤S302所获取的图像,质量模型的输出结果为带有质量评估结果的图像,包括质量合格图像与质量不合格图像。The quality assessment results of the image include: unqualified quality (i.e., the image does not meet the preset image quality conditions) and qualified quality (i.e., the image meets the preset image quality conditions). The quality model is used to determine whether the image meets the preset image quality conditions based on the image classification method, and the quality model is obtained by model training through image samples of different photographed objects and a preset second loss function. The input data of the quality model are image samples of different photographed objects, i.e., the images obtained in the above step S302, and the output results of the quality model are images with quality assessment results, including qualified quality images and unqualified quality images.

通过训练一质量模型获取图像的质量评估结果,能够更加快速而精准地判断当前图像是否符合预设的图像质量条件,从而提高图像采集效率。By training a quality model to obtain the quality assessment results of the image, it is possible to more quickly and accurately determine whether the current image meets the preset image quality conditions, thereby improving image acquisition efficiency.

在一种实施方式中,上述质量模型可以采用二分类算法构建,通过上述质量模型可以将针对目标对象所拍摄的图像划分为质量合格图像(或称为高质量图像)与质量不合格图像(或称为低质量图像)。In one embodiment, the quality model can be constructed using a binary classification algorithm, and the images taken for the target object can be divided into qualified quality images (or high-quality images) and unqualified quality images (or low-quality images) through the quality model.

在一种实施例方式中,第二损失函数可以采用分类模型损失函数,具体可以采用交叉熵损失函数,也可以采用BCE-loss(二分类交叉熵损失函数)。In one embodiment, the second loss function may adopt a classification model loss function, specifically a cross entropy loss function, or a BCE-loss (binary cross entropy loss function).

如果质量评估结果指示图像符合预设的图像质量条件,用户根据所获取的图像继续执行后续流程。If the quality assessment result indicates that the image meets the preset image quality condition, the user continues to perform subsequent processes based on the acquired image.

步骤S306:如果质量评估结果指示图像不符合预设的图像质量条件,则将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型。 Step S306: If the quality assessment result indicates that the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, and the cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position. The cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results.

步骤S308:根据图像重拍引导信息,对目标对象重新进行图像拍摄。Step S308: re-shooting the image of the target object according to the image re-shooting guidance information.

上述步骤S306-S308的具体处理过程可以参加上述步骤S104-S106的相关内容,在此不再赘述。The specific processing process of the above steps S306-S308 can refer to the relevant content of the above steps S104-S106, which will not be repeated here.

示例性地,图4以eKYC流程为例,示出了根据本说明书一实施例的图像采集方法的示意性原理图,由图4可知,本说明书一个或多个实施例提供的一种图像采集方法在实际应用中的流程如下。Exemplarily, FIG4 takes the eKYC process as an example and shows a schematic principle diagram of an image acquisition method according to an embodiment of this specification. As can be seen from FIG4, the process of an image acquisition method provided by one or more embodiments of this specification in actual application is as follows.

1)在eKYC流程中,用户完成证件或人脸的拍摄。1) In the eKYC process, the user completes the photo taking of ID or face.

2)获取到用户拍摄的图像后,使用一预先训练的基于图像分类的质量模型,对用户拍摄的图像进行质量二分类,确定用户拍摄的图像为高质量图像或低质量图像。2) After obtaining the image taken by the user, a pre-trained quality model based on image classification is used to perform a binary quality classification on the image taken by the user to determine whether the image taken by the user is a high-quality image or a low-quality image.

3)如果分类结果是高质量图像,则用户继续执行后续的流程;如果分类结果是低质量图像,将图像输入一预先训练的原因生成模型。3) If the classification result is a high-quality image, the user continues to execute the subsequent process; if the classification result is a low-quality image, the image is input into a pre-trained cause generation model.

4)在原因生成模型中,对低质量图像的质量问题原因进行细分,质量问题原因可以包括:高光、低光、模糊等,同时在原因生成模型中对低质量图像中有质量问题的具体位置进行定位。4) In the cause generation model, the causes of quality problems in low-quality images are subdivided. The causes of quality problems may include: highlights, low light, blur, etc. At the same time, the specific locations of quality problems in low-quality images are located in the cause generation model.

5)获得细分后的质量问题原因和具体位置后,通过原因生成模型进行文本生成,得到图像重拍引导信息,如“证件左上角高光过曝”,“图像整体模糊,拍摄时请保持手机静止”等,并将上述图像重拍引导信息反馈给用户。5) After obtaining the segmented causes and specific locations of the quality problems, text generation is performed through the cause generation model to obtain image reshoot guidance information, such as "the highlight in the upper left corner of the document is overexposed", "the overall image is blurred, please keep the phone still when shooting", etc., and the above image reshoot guidance information is fed back to the user.

6)用户根据所获取的图像重拍引导信息进行图像重拍,并继续eKYC流程。6) The user retakes the image according to the obtained image retake guidance information and continues the eKYC process.

本说明书实施例提供一种图像采集方法,首先获取针对目标对象所拍摄的图像,然后判断图像是否符合预设的图像质量条件,且当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,最后根据所生成的图像重拍引导信息,对目标对象重新进行图像拍摄,从而完成图像采集。当图像不符合预设的图像质量条件时,通过预先训练的原因生成模型能够对上一次拍摄过程中造成图像不符合预设的图像质量条件的原因进行检测,获取到导致质量问题的具体原因,并对上一次拍摄的图像中不符合预设的图像质量条件的位置进行定位,获取到导致质量问题的具体位置,最终生成针对确定的位置处的图像重拍引导信息反馈给用户,从而为下一次拍摄提供有针对性的改进建议,能够有效减少重复拍摄的次数,提高图像拍摄效率,进而提高用户体验。 The embodiment of the present specification provides an image acquisition method, firstly acquiring an image shot for a target object, then determining whether the image meets a preset image quality condition, and when the image does not meet the preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshoot guidance information for the determined position, and finally reshooting the target object according to the generated image reshoot guidance information, thereby completing image acquisition. When the image does not meet the preset image quality condition, the pre-trained cause generation model can detect the cause of the image not meeting the preset image quality condition in the last shooting process, obtain the specific cause of the quality problem, and locate the position in the last shot image that does not meet the preset image quality condition, obtain the specific position that causes the quality problem, and finally generate image reshoot guidance information for the determined position to feedback to the user, thereby providing targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shootings, improve image shooting efficiency, and thus improve user experience.

综上,已经对本主题的特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作可以按照不同的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序,以实现期望的结果。在某些实施方式中,多任务处理和并行处理可以是有利的。In summary, specific embodiments of the present subject matter have been described. Other embodiments are within the scope of the appended claims. In some cases, the actions recorded in the claims can be performed in a different order and still achieve the desired results. In addition, the processes depicted in the accompanying drawings do not necessarily require the specific order or sequential order shown to achieve the desired results. In some embodiments, multitasking and parallel processing can be advantageous.

以上为本说明书一个或多个实施例提供的图像采集方法,基于同样的思路,本说明书一个或多个实施例还提供一种图像采集装置,如图5所示。The above is an image acquisition method provided by one or more embodiments of this specification. Based on the same idea, one or more embodiments of this specification also provide an image acquisition device, as shown in FIG5 .

该图像采集装置包括:图像获取模块410、图像重拍引导信息生成模块420和控制模块430。The image acquisition device includes: an image acquisition module 410 , an image reshooting guidance information generation module 420 and a control module 430 .

图像获取模块410,获取针对目标对象所拍摄的图像。The image acquisition module 410 acquires an image taken of a target object.

图像重拍引导信息生成模块420,当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型。The image reshooting guidance information generation module 420, when the image does not meet the preset image quality conditions, inputs the image into a pre-trained cause generation model, determines the position in the image that does not meet the preset image quality conditions through the cause generation model, and generates image reshooting guidance information for the determined position. The cause generation model is a model for detecting the cause of the image not meeting the preset image quality conditions and providing corresponding improvement suggestions for the detection results.

控制模块430,根据图像重拍引导信息,对目标对象重新进行图像拍摄。The control module 430 re-takes an image of the target object according to the image re-shooting guidance information.

在一个实施例中,图像获取模块410,还用于采集目标用户为接入基于电子身份认证信息的区块链系统所拍摄的目标用户的指定证件的图像和/或目标用户的图像,将所拍摄的指定证件的图像和/或目标用户的图像作为对目标对象所拍摄的图像;图像采集装置还可以包括:存储模块,如果图像符合预设的图像质量条件,则将图像和/或图像中包含的信息存储于区块链系统中。In one embodiment, the image acquisition module 410 is also used to collect images of the target user's designated certificate and/or the target user taken by the target user to access the blockchain system based on electronic identity authentication information, and use the images of the designated certificate and/or the target user taken as the images taken of the target object; the image acquisition device may also include: a storage module, if the image meets the preset image quality conditions, the image and/or the information contained in the image is stored in the blockchain system.

在一个实施例中,图像采集装置还可以包括:质量评估结果获取模块,将图像输入预先训练的质量模型中,得到图像的质量评估结果;图像重拍引导信息生成模块420,还用于如果质量评估结果指示图像不符合预设的图像质量条件,则将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息。In one embodiment, the image acquisition device may also include: a quality assessment result acquisition module, which inputs the image into a pre-trained quality model to obtain a quality assessment result of the image; an image reshooting guidance information generation module 420, which is also used to input the image into a pre-trained cause generation model if the quality assessment result indicates that the image does not meet the preset image quality conditions, determine the position in the image that does not meet the preset image quality conditions through the cause generation model, and generate image reshooting guidance information for the determined position.

本说明书实施例提供一种图像采集装置,首先通过图像获取模块获取针对目标对象所拍摄的图像,然后当图像不符合预设的图像质量条件时,通过图像重拍引导信息生成模块将图像输入预先训练的原因生成模型中,经由原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,最后通过控制 模块根据所生成的图像重拍引导信息,对目标对象重新进行图像拍摄,从而完成图像采集。当图像不符合预设的图像质量条件时,通过预先训练的原因生成模型能够对上一次拍摄过程中造成图像不符合预设的图像质量条件的原因进行检测,获取到导致质量问题的具体原因,并对上一次拍摄的图像中不符合预设的图像质量条件的位置进行定位,获取到导致质量问题的具体位置,最终生成针对确定的位置处的图像重拍引导信息反馈给用户,从而为下一次拍摄提供有针对性的改进建议,能够有效减少重复拍摄的次数,提高图像拍摄效率,进而提高用户体验。The embodiment of the present specification provides an image acquisition device, which first acquires an image shot for a target object through an image acquisition module, and then, when the image does not meet the preset image quality conditions, the image retake guidance information generation module inputs the image into a pre-trained cause generation model, determines the position in the image that does not meet the preset image quality conditions through the cause generation model, and generates image retake guidance information for the determined position, and finally controls the image retake guidance information generation module to generate the image retake guidance information for the determined position. The module re-takes the image of the target object according to the generated image retake guidance information, thereby completing the image acquisition. When the image does not meet the preset image quality conditions, the pre-trained cause generation model can detect the reasons why the image did not meet the preset image quality conditions during the last shooting process, obtain the specific reasons for the quality problem, and locate the position in the last shot that does not meet the preset image quality conditions, obtain the specific position that causes the quality problem, and finally generate image retake guidance information for the determined position and feedback it to the user, so as to provide targeted improvement suggestions for the next shooting, which can effectively reduce the number of repeated shooting, improve image shooting efficiency, and thus improve user experience.

本领域的技术人员应可理解,上述图像采集装置能够用来实现前文所述的图像采集方法,其中的细节描述应与前文方法部分描述类似,为避免繁琐,此处不另赘述。Those skilled in the art should understand that the above-mentioned image acquisition device can be used to implement the image acquisition method described above, and the detailed description thereof should be similar to the description of the method part above, and will not be further described here to avoid tediousness.

基于同样的思路,本说明书一个或多个实施例还提供一种电子设备,如图6所示。电子设备可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上的处理器501和存储器502,存储器502中可以存储有一个或一个以上存储应用程序或数据。其中,存储器502可以是短暂存储或持久存储。存储在存储器502的应用程序可以包括一个或一个以上模块(图示未示出),每个模块可以包括对电子设备中的一系列计算机可执行指令。更进一步地,处理器501可以设置为与存储器502通信,在电子设备上执行存储器502中的一系列计算机可执行指令。电子设备还可以包括一个或一个以上电源503,一个或一个以上有线或无线网络接口504,一个或一个以上输入输出接口505,一个或一个以上键盘506。Based on the same idea, one or more embodiments of the present specification also provide an electronic device, as shown in FIG6 . The electronic device may have relatively large differences due to different configurations or performances, and may include one or more processors 501 and a memory 502, and the memory 502 may store one or more storage applications or data. Among them, the memory 502 may be a short-term storage or a persistent storage. The application stored in the memory 502 may include one or more modules (not shown in the figure), and each module may include a series of computer executable instructions in the electronic device. Furthermore, the processor 501 may be configured to communicate with the memory 502 to execute a series of computer executable instructions in the memory 502 on the electronic device. The electronic device may also include one or more power supplies 503, one or more wired or wireless network interfaces 504, one or more input and output interfaces 505, and one or more keyboards 506.

具体在本实施例中,电子设备包括有存储器,以及一个或一个以上的程序,其中一个或者一个以上程序存储于存储器中,且一个或者一个以上程序可以包括一个或一个以上模块,且每个模块可以包括对电子设备中的一系列计算机可执行指令,且经配置以由一个或者一个以上处理器执行该一个或者一个以上程序包含用于进行以下计算机可执行指令:获取针对目标对象所拍摄的图像;当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;根据图像重拍引导信息,对目标对象重新进行图像拍摄。Specifically in this embodiment, the electronic device includes a memory, and one or more programs, wherein the one or more programs are stored in the memory, and the one or more programs may include one or more modules, and each module may include a series of computer executable instructions in the electronic device, and is configured to be executed by one or more processors. The one or more programs include the following computer executable instructions: obtaining an image taken of a target object; when the image does not meet a preset image quality condition, inputting the image into a pre-trained cause generation model, determining the position in the image that does not meet the preset image quality condition through the cause generation model, and generating image reshooting guidance information for the determined position, the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition, and providing corresponding improvement suggestions for the detection result; and re-taking the image of the target object according to the image reshooting guidance information.

本说明书一个或多个实施例还提出了一种存储介质,该存储介质用于存储计算机可执行指令,可执行指令在被处理器执行时实现以下流程:获取针对目标对象所拍摄的图像;当图像不符合预设的图像质量条件时,将图像输入预先训练的原因生成模型中,通 过原因生成模型确定图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;根据图像重拍引导信息,对目标对象重新进行图像拍摄。One or more embodiments of the present specification also propose a storage medium for storing computer executable instructions, which implement the following process when executed by a processor: obtaining an image taken of a target object; when the image does not meet the preset image quality conditions, inputting the image into a pre-trained cause generation model; The cause generation model is used to determine the position in the image that does not meet the preset image quality conditions, and generate image reshoot guidance information for the determined position. The cause generation model is a model for detecting the reasons why the image does not meet the preset image quality conditions, and providing corresponding improvement suggestions for the detection results; according to the image reshoot guidance information, the target object is re-imaged.

上述实施例阐明的系统、装置、模块或单元,具体可以由计算机芯片或实体实现,或者由具有某种功能的产品来实现。一种典型的实现设备为计算机。具体的,计算机例如可以为个人计算机、膝上型计算机、蜂窝电话、相机电话、智能电话、个人数字助理、媒体播放器、导航设备、电子邮件设备、游戏控制台、平板计算机、可穿戴设备或者这些设备中的任何设备的组合。The systems, devices, modules or units described in the above embodiments may be implemented by computer chips or entities, or by products with certain functions. A typical implementation device is a computer. Specifically, the computer may be, for example, a personal computer, a laptop computer, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or a combination of any of these devices.

为了描述的方便,描述以上装置时以功能分为各种单元分别描述。当然,在实施本说明书一个或多个实施例时可以把各单元的功能在同一个或多个软件和/或硬件中实现。For the convenience of description, the above devices are described in terms of functions and are divided into various units. Of course, when implementing one or more embodiments of this specification, the functions of each unit can be implemented in the same or multiple software and/or hardware.

本领域内的技术人员应明白,本说明书一个或多个实施例可提供为方法、系统、或计算机程序产品。因此,本说明书一个或多个实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本说明书一个或多个实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that one or more embodiments of this specification may be provided as methods, systems, or computer program products. Therefore, one or more embodiments of this specification may take the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, one or more embodiments of this specification may take the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program code.

本说明书一个或多个实施例是参照根据本说明书实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。One or more embodiments of the present specification are described with reference to the flowcharts and/or block diagrams of the methods, devices (systems), and computer program products according to the embodiments of the present specification. It should be understood that each process and/or box in the flowchart and/or block diagram, as well as the combination of the processes and/or boxes in the flowchart and/or block diagram, can be implemented by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing device to produce a machine, so that the instructions executed by the processor of the computer or other programmable data processing device produce a device for implementing the functions specified in one process or multiple processes in the flowchart and/or one box or multiple boxes in the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或 其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded into a computer or other programmable data processing device, so that a series of operation steps are executed on the computer or other programmable device to produce a computer-implemented process, thereby performing a computer-implemented process on the computer or other programmable device. The instructions executed on other programmable devices provide steps for implementing the functions specified in one or more flows of the flowcharts and/or one or more blocks of the block diagrams.

在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, a computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.

内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。Memory may include non-permanent storage in a computer-readable medium, in the form of random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.

计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer readable media include permanent and non-permanent, removable and non-removable media that can be implemented by any method or technology to store information. Information can be computer readable instructions, data structures, program modules or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, compact disk read-only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, magnetic cassettes, magnetic disk storage or other magnetic storage devices or any other non-transmission media that can be used to store information that can be accessed by a computing device. As defined in this article, computer readable media does not include temporary computer readable media (transitory media), such as modulated data signals and carrier waves.

还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "include", "comprises" or any other variations thereof are intended to cover non-exclusive inclusion, so that a process, method, commodity or device including a series of elements includes not only those elements, but also other elements not explicitly listed, or also includes elements inherent to such process, method, commodity or device. In the absence of more restrictions, the elements defined by the sentence "comprises a ..." do not exclude the existence of other identical elements in the process, method, commodity or device including the elements.

本说明书一个或多个实施例可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构等等。也可以在分布式计算环境中实践本说明书,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。One or more embodiments of the present specification may be described in the general context of computer-executable instructions executed by a computer, such as program modules. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform specific tasks or implement specific abstract data types. The present specification may also be practiced in distributed computing environments where tasks are performed by remote processing devices connected through a communication network. In a distributed computing environment, program modules may be located in local and remote computer storage media, including storage devices.

本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于系统实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法 实施例的部分说明即可。Each embodiment in this specification is described in a progressive manner. The same or similar parts between the embodiments can be referred to each other. Each embodiment focuses on the differences from other embodiments. In particular, for the system embodiment, since it is basically similar to the method embodiment, the description is relatively simple. For the relevant parts, refer to the method embodiment. A partial description of the embodiments will suffice.

以上所述仅为本说明书一个或多个实施例而已,并不用于限制本申请。对于本领域技术人员来说,本说明书一个或多个实施例可以有各种更改和变化。凡在本说明书一个或多个实施例的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本说明书一个或多个实施例的权利要求范围之内。 The above description is only one or more embodiments of this specification and is not intended to limit this application. For those skilled in the art, one or more embodiments of this specification may have various changes and variations. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of one or more embodiments of this specification should be included in the scope of the claims of one or more embodiments of this specification.

Claims (10)

一种图像采集方法,包括:An image acquisition method, comprising: 获取针对目标对象所拍摄的图像;Acquire an image taken of a target object; 当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;When the image does not meet the preset image quality condition, the image is input into a pre-trained cause generation model, the position of the image that does not meet the preset image quality condition is determined by the cause generation model, and image reshoot guidance information for the determined position is generated, wherein the cause generation model is a model for detecting the cause of the image not meeting the preset image quality condition and providing corresponding improvement suggestions for the detection result; 根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。The target object is imaged again according to the image reshooting guidance information. 根据权利要求1所述的方法,所述原因生成模型的训练方法包括:According to the method of claim 1, the training method of the cause generation model comprises: 获取针对不同拍摄对象所拍摄的多张图像样本、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置;Acquire multiple image samples taken for different photographed objects, the reason why each image sample does not meet the preset image quality condition, and the position of each image sample that does not meet the preset image quality condition; 基于所述多张图像样本、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置,通过预设的第一损失函数对原因生成模型进行训练,得到训练后的原因生成模型。Based on the multiple image samples, the reason why each image sample does not meet the preset image quality conditions, and the position in each image sample that does not meet the preset image quality conditions, the cause generation model is trained using a preset first loss function to obtain a trained cause generation model. 根据权利要求2所述的方法,所述获取针对不同拍摄对象所拍摄的多张图像样本、每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置,包括:According to the method of claim 2, the step of obtaining a plurality of image samples taken for different photographed objects, the reason why each image sample does not meet the preset image quality condition, and the position in each image sample that does not meet the preset image quality condition comprises: 获取针对不同拍摄对象所拍摄的多张图像样本;Acquire multiple image samples taken for different objects; 分别确定使得每张图像样本不符合预设的图像质量条件的原因以及每张图像样本中不符合预设的图像质量条件的位置。The reason why each image sample does not meet the preset image quality condition and the position of each image sample that does not meet the preset image quality condition are determined respectively. 根据权利要求2所述的方法,所述图像样本不符合预设的图像质量条件的原因包括:第一质量问题和第二质量问题,其中,第一质量问题是图像粒度大于等于预设的图像粒度阈值的图像质量问题,第二质量问题是图像粒度小于预设的图像粒度阈值的图像质量问题;According to the method of claim 2, the reason why the image sample does not meet the preset image quality condition includes: a first quality problem and a second quality problem, wherein the first quality problem is an image quality problem in which the image granularity is greater than or equal to a preset image granularity threshold, and the second quality problem is an image quality problem in which the image granularity is less than the preset image granularity threshold; 所述确定每张图像样本中不符合预设的图像质量条件的位置,包括:Determining the position of each image sample that does not meet the preset image quality condition includes: 当所述图像样本不符合预设的图像质量条件的原因为第一质量问题时,基于目标检测算法确定每张图像样本中不符合预设的图像质量条件的位置;When the reason why the image samples do not meet the preset image quality condition is a first quality problem, determining a position in each image sample that does not meet the preset image quality condition based on a target detection algorithm; 当所述图像样本不符合预设的图像质量条件的原因为第二质量问题时,基于语义分割算法识别每张图像样本中不符合预设的图像质量条件的位置。When the reason why the image samples do not meet the preset image quality condition is the second quality problem, a position in each image sample that does not meet the preset image quality condition is identified based on a semantic segmentation algorithm. 根据权利要求1所述的方法,所述获取针对目标对象所拍摄的图像,包括:According to the method of claim 1, the acquiring of the image taken of the target object comprises: 采集目标用户为接入基于电子身份认证信息的区块链系统所拍摄的所述目标用户 的指定证件的图像和/或所述目标用户的图像,将所拍摄的指定证件的图像和/或所述目标用户的图像作为对目标对象所拍摄的图像;The target user is the target user photographed by the blockchain system based on electronic identity authentication information. The image of the designated certificate and/or the image of the target user is used as the image taken of the target object; 所述方法还包括:The method further comprises: 如果所述图像符合预设的图像质量条件,则将所述图像和/或所述图像中包含的信息存储于所述区块链系统中。If the image meets the preset image quality condition, the image and/or the information contained in the image is stored in the blockchain system. 根据权利要求2所述的方法,所述预设的第一损失函数采用交叉熵损失函数。According to the method according to claim 2, the preset first loss function adopts a cross entropy loss function. 根据权利要求1所述的方法,所述方法还包括:The method according to claim 1, further comprising: 将所述图像输入预先训练的质量模型中,得到所述图像的质量评估结果;Inputting the image into a pre-trained quality model to obtain a quality assessment result of the image; 所述当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,包括:When the image does not meet the preset image quality condition, the image is input into a pre-trained cause generation model, a position in the image that does not meet the preset image quality condition is determined by the cause generation model, and image reshooting guidance information for the determined position is generated, including: 如果所述质量评估结果指示所述图像不符合预设的图像质量条件,则将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息。If the quality assessment result indicates that the image does not meet the preset image quality conditions, the image is input into a pre-trained cause generation model, the position of the image that does not meet the preset image quality conditions is determined by the cause generation model, and image reshoot guidance information is generated for the determined position. 根据权利要求7所述的方法,所述质量模型是采用二分类算法构建的模型。According to the method according to claim 7, the quality model is a model constructed using a binary classification algorithm. 一种图像采集装置,包括:An image acquisition device, comprising: 图像获取模块,获取针对目标对象所拍摄的图像;An image acquisition module, which acquires images taken of a target object; 图像重拍引导信息生成模块,当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;an image retake guidance information generating module, which, when the image does not meet the preset image quality condition, inputs the image into a pre-trained cause generation model, determines the position of the image that does not meet the preset image quality condition through the cause generation model, and generates image retake guidance information for the determined position, wherein the cause generation model is a model for detecting the cause causing the image to not meet the preset image quality condition and providing corresponding improvement suggestions for the detection result; 控制模块,根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。A control module retakes an image of the target object according to the image retake guidance information. 一种电子设备,包括:An electronic device, comprising: 处理器;以及Processor; and 被安排成存储计算机可执行指令的存储器,在所述可执行指令被执行时,能够使得所述处理器:a memory arranged to store computer executable instructions which, when executed, cause the processor to: 获取针对目标对象所拍摄的图像;Acquire an image taken of a target object; 当所述图像不符合预设的图像质量条件时,将所述图像输入预先训练的原因生成模型中,通过所述原因生成模型确定所述图像中不符合预设的图像质量条件的位置,并生成针对确定的位置处的图像重拍引导信息,所述原因生成模型是用于对造成图像不符合 预设的图像质量条件的原因进行检测,并为检测结果提供相应的改进建议的模型;When the image does not meet the preset image quality condition, the image is input into a pre-trained cause generation model, the position of the image that does not meet the preset image quality condition is determined by the cause generation model, and image reshoot guidance information for the determined position is generated, wherein the cause generation model is used to determine the cause of the image not meeting the preset image quality condition. A model that detects the causes of preset image quality conditions and provides corresponding improvement suggestions for the detection results; 根据所述图像重拍引导信息,对所述目标对象重新进行图像拍摄。 The target object is imaged again according to the image reshooting guidance information.
PCT/CN2024/087890 2023-06-15 2024-04-16 Image acquisition Pending WO2024255425A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202310710587.4A CN116758258A (en) 2023-06-15 2023-06-15 Image acquisition method, device and electronic equipment
CN202310710587.4 2023-06-15

Publications (1)

Publication Number Publication Date
WO2024255425A1 true WO2024255425A1 (en) 2024-12-19

Family

ID=87954725

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/087890 Pending WO2024255425A1 (en) 2023-06-15 2024-04-16 Image acquisition

Country Status (2)

Country Link
CN (1) CN116758258A (en)
WO (1) WO2024255425A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120782401A (en) * 2025-09-11 2025-10-14 张家港中理外轮理货有限公司 Information processing method for port container number management

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116758258A (en) * 2023-06-15 2023-09-15 支付宝(杭州)信息技术有限公司 Image acquisition method, device and electronic equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108961006A (en) * 2018-07-09 2018-12-07 广州智乐物联网技术有限公司 A kind of identity card real-name authentication system based on block chain
CN112733802A (en) * 2021-01-25 2021-04-30 腾讯科技(深圳)有限公司 Image occlusion detection method and device, electronic equipment and storage medium
CN113139935A (en) * 2021-03-30 2021-07-20 宁波市眼科医院 Slit-lamp picture quality monitoring system
CN114241557A (en) * 2021-12-13 2022-03-25 深圳绿米联创科技有限公司 Image recognition method, device and equipment, smart door lock and medium
CN115035059A (en) * 2022-06-06 2022-09-09 京东方科技集团股份有限公司 Defect detection method, device, defect detection system, equipment and medium
US20220395245A1 (en) * 2021-06-14 2022-12-15 Konica Minolta, Inc. Image processing apparatus and recording medium
CN116758258A (en) * 2023-06-15 2023-09-15 支付宝(杭州)信息技术有限公司 Image acquisition method, device and electronic equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108346139A (en) * 2018-01-09 2018-07-31 阿里巴巴集团控股有限公司 A kind of method for screening images and device
CN112019739A (en) * 2020-08-03 2020-12-01 RealMe重庆移动通信有限公司 Shooting control method and device, electronic equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108961006A (en) * 2018-07-09 2018-12-07 广州智乐物联网技术有限公司 A kind of identity card real-name authentication system based on block chain
CN112733802A (en) * 2021-01-25 2021-04-30 腾讯科技(深圳)有限公司 Image occlusion detection method and device, electronic equipment and storage medium
CN113139935A (en) * 2021-03-30 2021-07-20 宁波市眼科医院 Slit-lamp picture quality monitoring system
US20220395245A1 (en) * 2021-06-14 2022-12-15 Konica Minolta, Inc. Image processing apparatus and recording medium
CN114241557A (en) * 2021-12-13 2022-03-25 深圳绿米联创科技有限公司 Image recognition method, device and equipment, smart door lock and medium
CN115035059A (en) * 2022-06-06 2022-09-09 京东方科技集团股份有限公司 Defect detection method, device, defect detection system, equipment and medium
CN116758258A (en) * 2023-06-15 2023-09-15 支付宝(杭州)信息技术有限公司 Image acquisition method, device and electronic equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120782401A (en) * 2025-09-11 2025-10-14 张家港中理外轮理货有限公司 Information processing method for port container number management

Also Published As

Publication number Publication date
CN116758258A (en) 2023-09-15

Similar Documents

Publication Publication Date Title
KR102354692B1 (en) Rule-based video importance analysis
US9183429B2 (en) Method and apparatus for facial recognition
CN111191568B (en) Method, device, equipment and medium for identifying flip image
US8842889B1 (en) System and method for automatic face recognition
KR101647691B1 (en) Method for hybrid-based video clustering and server implementing the same
US11734347B2 (en) Video retrieval method and apparatus, device and storage medium
WO2020114105A1 (en) Comparison method based on multiple facial images, apparatus and electronic device
US11947631B2 (en) Reverse image search based on deep neural network (DNN) model and image-feature detection model
CN117523323B (en) A detection method and device for generating an image
CN110223320B (en) Object detection and tracking method and detection and tracking device
CN118840695A (en) Target behavior identification method, system and equipment
WO2024255425A1 (en) Image acquisition
KR102799866B1 (en) Method and apparatus for anomaly detection by reward-based retraining of GAN
CN113518217B (en) Object identification method, device, server and medium
CN113496243A (en) Background music obtaining method and related product
CN113033347B (en) An interactive risk assessment method based on human behavior and scenario analysis
CN114037880B (en) Data processing method, device, electronic device and storage medium
CN111898529B (en) Face detection method and device, electronic equipment and computer readable medium
CN120029539B (en) Method and device for storing image and storage medium
CN111343391A (en) Video capture method and electronic device using same
EP4323892A1 (en) Reverse image search based on deep neural network (dnn) model and image-feature detection model
CN118329134A (en) Target object tracking method, device, equipment and storage medium
CN117953202A (en) Method and device for detecting object in image, electronic equipment and readable storage medium
CN114819142A (en) Screen image recognition method and device, training method and device of model of screen image recognition method and device, and electronic equipment
CN120321431A (en) Video frame selection method, device, electronic device, storage medium and program product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24822380

Country of ref document: EP

Kind code of ref document: A1