[go: up one dir, main page]

WO2025152229A1 - Scanning light field self-supervised network reconstruction method and apparatus, electronic device, and medium - Google Patents

Scanning light field self-supervised network reconstruction method and apparatus, electronic device, and medium

Info

Publication number
WO2025152229A1
WO2025152229A1 PCT/CN2024/078130 CN2024078130W WO2025152229A1 WO 2025152229 A1 WO2025152229 A1 WO 2025152229A1 CN 2024078130 W CN2024078130 W CN 2024078130W WO 2025152229 A1 WO2025152229 A1 WO 2025152229A1
Authority
WO
WIPO (PCT)
Prior art keywords
light field
network
supervised
reconstruction
self
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/078130
Other languages
French (fr)
Chinese (zh)
Inventor
戴琼海
卢志
曾昀敏
吴嘉敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Publication of WO2025152229A1 publication Critical patent/WO2025152229A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general
    • G06T2200/04Indexing scheme for image data processing or generation, in general involving 3D image data

Definitions

  • the present application relates to the field of computational imaging technology, and in particular to a method, device, electronic device and medium for self-supervised network reconstruction of a scanned light field.
  • Light fields or scanned light fields have attracted much attention because they can achieve large-scale rapid 3D imaging with a single or a small number of captured images. They are widely used in biological applications including 3D calcium imaging, among which reconstruction is an important step in scanning light field imaging and one of the main factors affecting image quality.
  • the 3D structure can be restored from the captured image according to the point spread function and light propagation model of the optical system through traditional deconvolution methods based on the light propagation model, or through network reconstruction methods based on deep learning, using light field or scanned light field data as input and high-resolution true value images as supervision, so that the network can learn the reconstruction process.
  • a first aspect of the present application provides a method for self-supervised network reconstruction of a scanned light field, comprising the following steps: acquiring scanned light field data and a point spread function of an optical system that captures the scanned light field data; preprocessing the scanned light field data to obtain preprocessed data; and constructing a self-supervised reconstruction network based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.
  • the scanned light field data is preprocessed to obtain preprocessed data, including: rearranging the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all pixel points at the same position behind the microlens are arranged in sequence according to spatial order.
  • preprocessing the scanned light field data to obtain preprocessed data further includes: performing a linear transformation on the multi-angle data so that the pixel values are within a preset range to obtain changed data.
  • the third aspect of the present application provides an electronic device, comprising: a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the program to implement the scanning light field self-supervised network reconstruction method as described in the above embodiment.
  • the structure of the scanning light field self-supervised network reconstruction method includes: a scanning light field data acquisition unit, a data preprocessing unit, a self-supervised reconstruction network training unit and a self-supervised reconstruction network testing unit.
  • a scanning light field data unit is obtained to provide data for training and testing of the network, and a point spread function of the optical system that captured the data is passed to a random multi-angle forward transmission module for self-supervised reconstruction network training, wherein the data and the corresponding point spread function can be captured by a scanning optical microscope or downloaded from a public dataset;
  • the self-supervised reconstruction network training unit includes the network input forward transmission function, the random multi-angle forward transmission function and the self-supervised loss function return transmission function. Completing the above three steps is called an iteration.
  • the selected data often does not overlap with the data used by the self-supervised reconstruction network training unit, and the size is not necessarily the same. If the size is different, the selected data should be cropped with overlap and cropped into several images that meet the size requirements of the network input layer. Let the network predict separately and then splice the prediction results.
  • FIG2 is a flow chart of a scanning light field self-supervised network reconstruction method provided in an embodiment of the present application.
  • the scanned light field self-supervised network reconstruction method includes the following steps:
  • step S201 scanning light field data and a point spread function of an optical system for capturing the scanning light field data are obtained.
  • the light field refers to the changing distribution of information such as light intensity, direction and phase in space and time
  • the scanned light field data refers to a series of images or measurements obtained by sampling the light field from different angles and positions.
  • the embodiment of the present application can rearrange the pixel points of the scanned light field data according to their positions behind the microlens, and all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order to form 3D multi-angle data.
  • the multi-angle data can be linearly transformed so that the pixel value is between 0 and 1 to obtain normalized data.
  • the normalized data can be subjected to random angle rotation, flipping, cropping and other operations to obtain the final data after amplification.
  • the embodiment of the present application can pre-process the scanned light field data, such as data rearrangement, data normalization, data By performing amplification and other processing, the preprocessed data is obtained, thereby improving the precision and accuracy of the data, improving the clarity of the image, and further improving the authenticity of the reconstruction results.
  • the scanned light field data is preprocessed to obtain preprocessed data, including: rearranging the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all pixel points at the same position behind the microlens are arranged in sequence according to spatial order.
  • the embodiment of the present application can rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system, and all the pixel points at the same position behind the microlens are arranged in sequence in spatial order, thereby forming 3D multi-angle data.
  • the scanned light field data includes 5 viewing angles scanned from left to right in the horizontal direction, each viewing angle has 10 ⁇ 10 pixels, and for the first viewing angle, after rearrangement, the 10 pixel points in the first row will correspond to a specific spatial position. Similarly, the 10 pixel points in the second row will correspond to the next spatial position, and so on. Subsequently, for other viewing angles, the pixel points will be arranged in sequence according to the same spatial position.
  • the embodiment of the present application can rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, and can obtain 3D multi-angle data based on the rearrangement of the position behind the microlens, thereby providing more comprehensive and accurate information, reducing artifacts, improving detail restoration, and ensuring the accuracy of the reconstruction results.
  • preprocessing the scanned light field data to obtain preprocessed data further includes: performing a linear transformation on the multi-angle data so that the pixel values are within a preset range to obtain changed data.
  • the embodiments of the present application can perform a linear transformation on multi-angle data so that the pixel value is between a preset range to obtain the changed data. For example, if the pixel value needs to be scaled to between 0 and 1, a linear transformation can be performed to map it from the current pixel value range to the target range to obtain the changed data.
  • step S203 a self-supervised reconstruction network is constructed according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.
  • the preset iteration stop condition may be that the number of iterations reaches a preset iteration threshold, such as the number of iterations reaches 50,000 times.
  • the embodiment of the present application can use the preprocessed data for supervised reconstruction network training or supervised reconstruction network testing, wherein the self-supervised reconstruction network training includes network input forward transmission, random multi-angle forward transmission and self-supervised loss function back transmission. Completion of the above three steps is called an iteration.
  • the network training When the network training is not completed, it enters the next iteration. That is, the next batch of data is forwarded through the network, randomly transmitted at multiple angles, and transmitted back through the self-supervised loss function; the self-supervised reconstruction network test is performed after the self-supervised reconstruction network training is completed, and is used to test the final performance of the selected data on the network.
  • the embodiment of the present application can use a scanning light field instrument (scanning magnification is 3, the number of pixels behind the microlens is 13 ⁇ 13) to shoot zebrafish embryo data, and then rearrange, normalize and amplify the data to generate 1,000 multi-angle images with a size of 60 ⁇ 60 ⁇ 169 for self-supervised reconstruction network training; use the Keras deep learning framework based on Tensorflow and the Python programming language to build a self-supervised reconstruction network.
  • scanning magnification is 3, the number of pixels behind the microlens is 13 ⁇ 13
  • the embodiments of the present application can utilize preprocessed data and point spread functions to gradually improve the ability to reconstruct light field images during the training process. Through iterative training, the adaptability and generalization ability to different samples and scenes can be improved, thereby improving the accuracy and efficiency of reconstruction.
  • a self-supervised reconstruction network is constructed based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result, including: selecting any image processing network, inputting multi-angle data and forwarding it to obtain a network output; randomly selecting several angles of the point spread function, and forward projecting the network output.
  • the embodiment of the present application can randomly select two point spread function angles for forward projection, convolve the point spread function of the corresponding angle with the network output to obtain the forward projection, and can adjust the size of the forward projection according to the scanning magnification and the number of pixels behind the microlens.
  • the embodiment of the present application adds a random multi-angle forward transmission module and uses the information of the point spread function to simulate the optical forward transmission process of the network output.
  • the specific steps are as follows:
  • N-M angles are randomly selected in each iteration, and the other M angles are randomly selected from the N angles selected in the previous iteration, and they are guaranteed not to overlap with the N-M angles selected in this iteration;
  • the above selection method may cause slightly different network convergence speeds in different embodiments, but the final effect is similar in theory, and the selection of some angles for calculation is to save memory and video memory overhead and avoid memory or video memory overflow during network training. After a sufficient number of training iterations, due to the randomness of angle selection, the network optimization result should theoretically be the same as the result of selecting all angles for calculation each time.
  • step S1 for each angle selected in step S1, the part of the point spread function corresponding to the angle, that is, a 3D vector, is convolved with the 3D network output to obtain the forward projection.
  • Proj 1 , ..., Proj N can be concatenated into a 3D vector Proj_tmp to obtain a forward projection of N angles.
  • step S1 to step S3 the embodiment of the present application completes the optical forward transmission simulation of the network output Output. If the part corresponding to N angles is selected from the network input and recorded as Input, then Input and Proj have the same size.
  • the embodiment of the present application randomly selects the point spread function angle for forward projection, and adjusts the size of the forward projection according to the scanning magnification and the number of pixels behind the microlens, which can improve the comprehensiveness and diversity of the projection results, thereby improving the applicability of the scanning light field technology.
  • Step S4 Calculate the self-supervised loss function.
  • Proj is the adjusted forward projection
  • Input is the part of the network input corresponding to the angle
  • Output is the network output
  • L is the self-supervised loss function
  • MSE is the mean square error between the forward projection and the network input
  • Hess is the second norm of the second-order derivative of the network output
  • Cont is the axial continuity constraint of the network output
  • is the weight of the mean square error
  • is the weight of the second norm of the second-order derivative
  • is the weight of the continuity constraint.
  • Output is the network output, which is a three-dimensional vector, the third dimension represents the axial direction, Cont is the axial continuity constraint of the network output, N is the number of axial pixels output by the network, and Sum( ⁇ ) is the sum of each component of the two-dimensional vector.
  • the embodiments of the present application can quantify the error between the network reconstruction result and the real light field data by calculating the mean square error between the network output and the forward projection, and can gradually reduce the mean square error by optimizing the self-supervised loss function, thereby improving the accuracy of the reconstruction result.
  • the second norm of the second-order derivative of the network output and the axial continuity constraint it helps to suppress noise and artifacts in the reconstruction result and improve the smoothness and stability of the reconstruction result.
  • the scanning light field data can be pre-processed by obtaining the scanning light field data and the point spread function of the optical system that shoots the scanning light field data, and a self-supervised reconstruction network is constructed according to the pre-processed data and the point spread function until a preset iteration stop condition is reached, thereby obtaining the scanning light field self-supervised network reconstruction result, which can be continuously calibrated and adjusted to improve the accuracy of the data, thereby improving
  • the stability and reliability of the system are improved, and the need for manual intervention is reduced, avoiding the influence of subjective factors and operational errors.
  • FIG5 is a schematic diagram of the structure of a scanning light field self-supervisory network reconstruction device according to an embodiment of the present application.
  • the scanning light field self-supervised network reconstruction device 10 includes: an acquisition module 100 , a processing module 200 and a reconstruction module 300 .
  • the acquisition module 100 is used to acquire the scanning light field data and the point spread function of the optical system that captures the scanning light field data.
  • the processing module 200 is used to pre-process the scanned light field data to obtain pre-processed data.
  • the reconstruction module 300 is used to construct a self-supervised reconstruction network according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.
  • the processing module 200 includes: a generating unit.
  • the generating unit is used to rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order.
  • the processing module 200 further includes: a transformation unit.
  • the transformation unit is used to perform linear transformation on the multi-angle data so that the pixel value is within a preset range to obtain the changed data.
  • the forward transmission unit is used to select any image processing network, input multi-angle data and forward transmit it to obtain network output;
  • the projection unit is used to randomly select several angles of the point spread function and obtain the forward projection of the network output.
  • a second calculation unit is used to calculate the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output;
  • FIG6 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.
  • the electronic device may include:
  • a memory 601 a processor 602 , and a computer program stored in the memory 601 and executable on the processor 602 .
  • the scanning light field self-supervised network reconstruction method provided in the above embodiment is implemented.
  • the communication interface 603 is used for communication between the memory 601 and the processor 602 .
  • the memory 601 is used to store computer programs that can be executed on the processor 602 .
  • Memory 601 may include high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk storage.
  • the bus can be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus.
  • ISA Industry Standard Architecture
  • PCI Peripheral Component Interconnect
  • EISA Extended Industry Standard Architecture
  • the bus can be divided into an address bus, a data bus, a control bus, etc. For ease of representation, only one thick line is used in FIG6, but it does not mean that there is only one bus or one type of bus.
  • the memory 601, the processor 602 and the communication interface 603 are integrated on a chip, the memory 601, the processor 602 and the communication interface 603 can communicate with each other through an internal interface.
  • Processor 602 may be a central processing unit (CPU), or an application specific integrated circuit (ASIC), or one or more integrated circuits configured to implement the embodiments of the present application.
  • CPU central processing unit
  • ASIC application specific integrated circuit
  • the present application also provides a computer-readable storage medium on which a computer program is stored.
  • the above scanned light field self-supervised network reconstruction method is implemented when the processor is executed.
  • first and second are used for descriptive purposes only and should not be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Therefore, a feature defined as “first” or “second” may explicitly or implicitly include at least one of the features.
  • N means at least two, such as two, three, etc., unless otherwise clearly and specifically defined.
  • Any process or method description in a flowchart or otherwise described herein may be understood to represent a module, fragment or portion of code comprising one or N executable instructions for implementing the steps of a custom logical function or process, and the scope of the preferred embodiments of the present application includes alternative implementations in which functions may not be performed in the order shown or discussed, including performing functions in a substantially simultaneous manner or in reverse order depending on the functions involved, which should be understood by technicians in the technical field to which the embodiments of the present application belong.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Microscoopes, Condenser (AREA)

Abstract

The present application relates to the technical field of computational imaging, and in particular to a scanning light field self-supervised network reconstruction method and apparatus, an electronic device, and a medium. The method comprises: acquiring scanning light field data and a point spread function of an optical system capturing the scanning light field data; preprocessing the scanning light field data to obtain preprocessed data; and constructing a self-supervised reconstruction network on the basis of the preprocessed data and the point spread function until a preset iteration stop condition is reached, to obtain a scanning light field self-supervised network reconstruction result. Thus, the present application solves the problems in the related art that the need to provide high-resolution ground truth images leads to increased costs, longer times consumed, and reduced reconstruction efficiency, and the existence of serious reconstruction artifacts at the original object plane easily leads to loss or distortion of details on object surfaces and reduced quality and accuracy of reconstruction results.

Description

扫描光场自监督网络重建方法、装置、电子设备及介质Scanning light field self-supervised network reconstruction method, device, electronic device and medium

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本申请基于申请号为202410074899.5,申请日为2024年01月18日申请的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on the Chinese patent application with application number 202410074899.5 and application date January 18, 2024, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby introduced into this application as a reference.

技术领域Technical Field

本申请涉及计算成像技术领域,特别涉及一种扫描光场自监督网络重建方法、装置、电子设备及介质。The present application relates to the field of computational imaging technology, and in particular to a method, device, electronic device and medium for self-supervised network reconstruction of a scanned light field.

背景技术Background Art

光场或扫描光场因能够用单张或少量拍摄图像实现大范围快速3D成像而备受关注,在包括3D钙成像等的生物应用中使用广泛,其中,重建是扫描光场成像的重要步骤,是影响图像质量的主要因素之一。相关技术中,可以通过基于光线传播模型的传统解卷积方法,根据光学系统的点扩散函数和光线传播模型,从拍摄到的图像还原出3D结构,也可以通过基于深度学习的网络重建方法,用光场或扫描光场数据作为输入,高分辨率的真值图像作为监督,从而能够使网络学习重建过程。Light fields or scanned light fields have attracted much attention because they can achieve large-scale rapid 3D imaging with a single or a small number of captured images. They are widely used in biological applications including 3D calcium imaging, among which reconstruction is an important step in scanning light field imaging and one of the main factors affecting image quality. In related technologies, the 3D structure can be restored from the captured image according to the point spread function and light propagation model of the optical system through traditional deconvolution methods based on the light propagation model, or through network reconstruction methods based on deep learning, using light field or scanned light field data as input and high-resolution true value images as supervision, so that the network can learn the reconstruction process.

然而,相关技术中,由于需要提供高分辨率的真值图像,导致需要额外用其他显微方法来拍摄同一组样本,从而造成成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影等问题,容易导致物体表面的细节丢失或扭曲,从而影响重建结果的质量和准确性。However, in the related art, due to the need to provide high-resolution true-value images, other microscopic methods need to be used to photograph the same set of samples, which increases costs, takes a long time, and reduces reconstruction efficiency. In addition, due to the existence of serious reconstruction artifacts at the original object plane and other problems, it is easy to cause the details of the object surface to be lost or distorted, thereby affecting the quality and accuracy of the reconstruction results.

发明内容Summary of the invention

本申请提供一种扫描光场自监督网络重建方法、装置、电子设备及存储介质,以解决相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性等问题。The present application provides a scanning light field self-supervised network reconstruction method, device, electronic device and storage medium to solve the problems in the related art that the need to provide a high-resolution true value image leads to increased costs, long time consumption, reduced reconstruction efficiency, and the presence of serious reconstruction artifacts at the original object plane, which easily leads to loss or distortion of details on the object surface, reducing the quality and accuracy of the reconstruction result.

本申请第一方面实施例提供一种扫描光场自监督网络重建方法,包括以下步骤:获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数;对扫描光场数据进行预处理,得到预处理后的数据;以及根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。 A first aspect of the present application provides a method for self-supervised network reconstruction of a scanned light field, comprising the following steps: acquiring scanned light field data and a point spread function of an optical system that captures the scanned light field data; preprocessing the scanned light field data to obtain preprocessed data; and constructing a self-supervised reconstruction network based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.

可选地,在本申请的一个实施例中,对扫描光场数据进行预处理,得到预处理后的数据,包括:将扫描光场数据的像素点基于在光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在微透镜后同一位置的像素点按照空间顺序依次排列。Optionally, in one embodiment of the present application, the scanned light field data is preprocessed to obtain preprocessed data, including: rearranging the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all pixel points at the same position behind the microlens are arranged in sequence according to spatial order.

可选地,在本申请的一个实施例中,对扫描光场数据进行预处理,得到预处理后的数据,还包括:对多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。Optionally, in one embodiment of the present application, preprocessing the scanned light field data to obtain preprocessed data further includes: performing a linear transformation on the multi-angle data so that the pixel values are within a preset range to obtain changed data.

可选地,在本申请的一个实施例中,根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,包括:选择任一图像处理网络,将多角度数据输入并前传得到网络输出;随机挑选点扩散函数的几个角度,对网络输出得到前向投影。Optionally, in one embodiment of the present application, a self-supervised reconstruction network is constructed based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result, including: selecting any image processing network, inputting multi-angle data and forwarding it to obtain a network output; randomly selecting several angles of the point spread function, and forward projecting the network output.

可选地,在本申请的一个实施例中,根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,还包括:计算网络输出和前向投影的对应角度之间的均方误差;计算网络输出的二阶导的二范数和网络输出的轴向连续性约束;对均方误差、二范数和轴向连续性约束进行加权,计算自监督重建网络的自监督损失函数,并回传更新网络参数,得到扫描光场自监督网络重建结果。Optionally, in one embodiment of the present application, a self-supervised reconstruction network is constructed based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result, and also includes: calculating the mean square error between the network output and the corresponding angle of the forward projection; calculating the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output; weighting the mean square error, the second norm and the axial continuity constraint, calculating the self-supervised loss function of the self-supervised reconstruction network, and transmitting back to update the network parameters to obtain the scanned light field self-supervised network reconstruction result.

本申请第二方面实施例提供一种扫描光场自监督网络重建装置,包括:获取模块,用于获取扫描光场数据与拍摄所述扫描光场数据的光学系统的点扩散函数;处理模块,用于对所述扫描光场数据进行预处理,得到预处理后的数据;以及重建模块,用于根据所述预处理后的数据和所述点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。A second aspect of the present application provides a scanning light field self-supervised network reconstruction device, including: an acquisition module, used to acquire scanning light field data and a point spread function of an optical system that shoots the scanning light field data; a processing module, used to preprocess the scanning light field data to obtain preprocessed data; and a reconstruction module, used to construct a self-supervised reconstruction network according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanning light field self-supervised network reconstruction result.

可选地,在本申请的一个实施例中,所述处理模块包括:生成单元,用于将所述扫描光场数据的像素点基于在所述光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在所述微透镜后同一位置的像素点按照空间顺序依次排列。Optionally, in one embodiment of the present application, the processing module includes: a generation unit, used to rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all pixel points at the same position behind the microlens are arranged in sequence according to spatial order.

可选地,在本申请的一个实施例中,所述处理模块还包括:变换单元,用于对所述多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。Optionally, in one embodiment of the present application, the processing module further includes: a transformation unit, configured to perform a linear transformation on the multi-angle data so that the pixel value is within a preset range to obtain changed data.

可选地,在本申请的一个实施例中,所述重建模块包括:前传单元,用于选择任一图像处理网络,将所述多角度数据输入并前传得到网络输出;投影单元,用于随机挑选点扩散函数的几个角度,对所述网络输出得到前向投影。Optionally, in one embodiment of the present application, the reconstruction module includes: a forward transmission unit, used to select any image processing network, input the multi-angle data and forward transmit it to obtain the network output; a projection unit, used to randomly select several angles of the point spread function and obtain a forward projection of the network output.

可选地,在本申请的一个实施例中,所述重建模块还包括:第一计算单元,用于计算所述网络输出和所述前向投影的对应角度之间的均方误差;第二计算单元,用于计算所述网络输出的二阶导的二范数和所述网络输出的轴向连续性约束;加权单元,用于对所述均方误差、所述二范数和所述轴向连续性约束进行加权,计算所述自监督重建网络的自监督 损失函数,并回传更新网络参数,得到所述扫描光场自监督网络重建结果。Optionally, in one embodiment of the present application, the reconstruction module further includes: a first calculation unit, used to calculate the mean square error between the network output and the corresponding angle of the forward projection; a second calculation unit, used to calculate the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output; a weighting unit, used to weight the mean square error, the second norm and the axial continuity constraint to calculate the self-supervised reconstruction network. The loss function is then fed back to update the network parameters to obtain the scanned light field self-supervised network reconstruction result.

本申请第三方面实施例提供一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述程序,以实现如上述实施例所述的扫描光场自监督网络重建方法。The third aspect of the present application provides an electronic device, comprising: a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the program to implement the scanning light field self-supervised network reconstruction method as described in the above embodiment.

本申请第四方面实施例提供一种计算机可读存储介质,所述计算机可读存储介质存储计算机程序,该程序被处理器执行时实现如上的扫描光场自监督网络重建方法。The fourth aspect of the present application provides a computer-readable storage medium, which stores a computer program. When the program is executed by a processor, it implements the above-mentioned scanning light field self-supervised network reconstruction method.

本申请实施例可以通过获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,对扫描光场数据进行预处理,并根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,从而得到扫描光场自监督网络重建结果,能够进行不断地校准和调整,提高数据的准确性,进而提高系统的稳定性和可靠性,且减少对人工干预的需求,避免主观因素和操作误差的影响。由此,解决了相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性等问题。The embodiment of the present application can pre-process the scanned light field data by obtaining the scanned light field data and the point spread function of the optical system that shoots the scanned light field data, and construct a self-supervised reconstruction network based on the pre-processed data and the point spread function until the preset iteration stop condition is reached, thereby obtaining the scanned light field self-supervised network reconstruction result, which can be continuously calibrated and adjusted to improve the accuracy of the data, thereby improving the stability and reliability of the system, and reducing the need for manual intervention, avoiding the influence of subjective factors and operational errors. Thus, the problems in the related art that the need to provide a high-resolution true value image leads to increased costs, longer time consumption, and reduced reconstruction efficiency, and that the existence of serious reconstruction artifacts at the original object plane easily leads to loss or distortion of details on the object surface, reducing the quality and accuracy of the reconstruction results, etc.

本申请附加的方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本申请的实践了解到。Additional aspects and advantages of the present application will be given in part in the description below, and in part will become apparent from the description below, or will be learned through the practice of the present application.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

本申请上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present application will become apparent and easily understood from the following description of the embodiments in conjunction with the accompanying drawings, in which:

图1为根据本申请一个实施例的扫描光场自监督网络重建方法的结构示意图;FIG1 is a schematic diagram of the structure of a scanning light field self-supervised network reconstruction method according to an embodiment of the present application;

图2为根据本申请实施例提供的一种扫描光场自监督网络重建方法的流程图;FIG2 is a flow chart of a scanning light field self-supervised network reconstruction method provided according to an embodiment of the present application;

图3为根据本申请一个实施例的扫描光场原始图像与经过网络重建的图像的对比示意图;FIG3 is a schematic diagram showing a comparison between an original image of a scanned light field and an image reconstructed through a network according to an embodiment of the present application;

图4为根据本申请一个实施例的随机多角度前传与自监督损失函数计算过程的原理示意图;FIG4 is a schematic diagram of the principle of random multi-angle forward transmission and self-supervised loss function calculation process according to an embodiment of the present application;

图5为根据本申请实施例提供的一种扫描光场自监督网络重建装置的结构示意图;FIG5 is a schematic diagram of the structure of a scanning light field self-supervisory network reconstruction device provided according to an embodiment of the present application;

图6为根据本申请实施例提供的电子设备的结构示意图。FIG6 is a schematic diagram of the structure of an electronic device provided according to an embodiment of the present application.

具体实施方式DETAILED DESCRIPTION

下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描 述的实施例是示例性的,旨在用于解释本申请,而不能理解为对本申请的限制。The embodiments of the present application are described in detail below, and examples of the embodiments are shown in the accompanying drawings, wherein the same or similar reference numerals throughout represent the same or similar elements or elements having the same or similar functions. The embodiments described are exemplary and are intended to be used to explain the present application, but should not be construed as limiting the present application.

下面参考附图描述本申请实施例的扫描光场自监督网络重建方法、装置、电子设备及存储介质。针对上述背景技术中心提到的相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性的问题,本申请提供了一种扫描光场自监督网络重建方法,在该方法中,可以通过获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,对扫描光场数据进行预处理,并根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,从而得到扫描光场自监督网络重建结果,能够进行不断地校准和调整,提高数据的准确性,进而提高系统的稳定性和可靠性,且减少对人工干预的需求,避免主观因素和操作误差的影响。由此,解决了相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性等问题。The following describes the scanning light field self-supervised network reconstruction method, device, electronic device and storage medium of the embodiment of the present application with reference to the accompanying drawings. In view of the related technologies mentioned in the background technology center, the need to provide high-resolution true value images leads to increased costs, longer time consumption, and reduced reconstruction efficiency. In addition, due to the existence of serious reconstruction artifacts at the original object plane, it is easy to cause the details of the object surface to be lost or distorted, thereby reducing the quality and accuracy of the reconstruction results. The present application provides a scanning light field self-supervised network reconstruction method, in which the scanning light field data can be pre-processed by obtaining the scanning light field data and the point spread function of the optical system that shoots the scanning light field data, and a self-supervised reconstruction network is constructed based on the pre-processed data and the point spread function until the preset iteration stop condition is reached, thereby obtaining the scanning light field self-supervised network reconstruction result, which can be continuously calibrated and adjusted to improve the accuracy of the data, thereby improving the stability and reliability of the system, and reducing the need for manual intervention, avoiding the influence of subjective factors and operational errors. Thereby, the problems in the related technology of the need to provide high-resolution true-value images, which increases costs, takes a long time, reduces reconstruction efficiency, and the existence of serious reconstruction artifacts at the original object plane, which easily leads to loss or distortion of details on the object surface, thereby reducing the quality and accuracy of the reconstruction results, are solved.

在对本申请实施例提供的扫描光场自监督网络重建方法进行解释说明之前,先对本申请实施例涉及的扫描光场自监督网络重建方法的结构进行说明。Before explaining the scanning light field self-supervised network reconstruction method provided in the embodiment of the present application, the structure of the scanning light field self-supervised network reconstruction method involved in the embodiment of the present application is first explained.

如图1所示,扫描光场自监督网络重建方法的结构包括:获取扫描光场数据单元、数据预处理单元、自监督重建网络训练单元和自监督重建网络测试单元。As shown in FIG1 , the structure of the scanning light field self-supervised network reconstruction method includes: a scanning light field data acquisition unit, a data preprocessing unit, a self-supervised reconstruction network training unit and a self-supervised reconstruction network testing unit.

其中,获取扫描光场数据单元,用于为网络的训练和测试提供数据,并将拍摄数据的光学系统的点扩散函数传递给自监督重建网络训练的随机多角度前传模块,其中,数据和对应的点扩散函数可以用扫描光学显微仪器拍摄或从公开数据集下载;Among them, a scanning light field data unit is obtained to provide data for training and testing of the network, and a point spread function of the optical system that captured the data is passed to a random multi-angle forward transmission module for self-supervised reconstruction network training, wherein the data and the corresponding point spread function can be captured by a scanning optical microscope or downloaded from a public dataset;

数据预处理单元,包括数据重排功能、数据归一化功能和数据扩增功能。The data preprocessing unit includes data rearrangement function, data normalization function and data amplification function.

其中,数据重排功能,用于将扫描光场数据的像素点根据在微透镜后的位置重新排列,所有在微透镜后同一位置的像素点按照空间顺序依次排列,形成3D多角度数据;数据归一化功能,用于将多角度数据做线性变换,使得像素值位于0-1之间,得到归一化数据;数据扩增功能,用于对归一化数据进行随机角度旋转、翻转、剪裁等操作,得到扩增后的最终数据,最终数据将用于自监督重建网络训练或自监督重建网络测试;Among them, the data rearrangement function is used to rearrange the pixel points of the scanned light field data according to the position behind the microlens, and all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order to form 3D multi-angle data; the data normalization function is used to perform linear transformation on the multi-angle data so that the pixel value is between 0 and 1 to obtain normalized data; the data augmentation function is used to perform random angle rotation, flipping, cropping and other operations on the normalized data to obtain the final data after augmentation, and the final data will be used for self-supervised reconstruction network training or self-supervised reconstruction network testing;

自监督重建网络训练单元,包括网络输入前传功能、随机多角度前传功能和自监督损失函数回传功能,完成以上三步即称作一次迭代。The self-supervised reconstruction network training unit includes the network input forward transmission function, the random multi-angle forward transmission function and the self-supervised loss function return transmission function. Completing the above three steps is called an iteration.

其中,网络输入前传功能,通过输入层接收数据预处理单元输出的最终数据,并让其通过网络,通过输出层得到网络输出;随机多角度前传功能,用于随机挑选点扩散函数的几个角度,将对应角度的点扩散函数与网络输出做卷积得到前向投影,并根 据扫描倍率与微透镜后的像素点个数调整前向投影的大小;自监督损失函数回传功能,用于首先计算自监督损失函数值,然后进行梯度回传来更新网络参数,其中,自监督损失函数值由调整后的前向投影与网络输入的对应角度之间的均方误差,和网络输出的二阶导的二范数,和网络输出的轴向连续性约束加权形成,通常设定一个迭代次数阈值,不超过该值时,认为网络训练没有结束。当网络训练没有结束时,进入下一次迭代,即下一批数据进行网络输入前传、随机多角度前传和自监督损失函数回传;Among them, the network input forward transmission function receives the final data output by the data preprocessing unit through the input layer, passes it through the network, and obtains the network output through the output layer; the random multi-angle forward transmission function is used to randomly select several angles of the point spread function, convolve the point spread function of the corresponding angle with the network output to obtain the forward projection, and then The size of the forward projection is adjusted according to the scanning magnification and the number of pixels behind the microlens; the self-supervised loss function feedback function is used to first calculate the value of the self-supervised loss function, and then perform gradient feedback to update the network parameters, where the self-supervised loss function value is formed by the mean square error between the adjusted forward projection and the corresponding angle of the network input, the second norm of the second-order derivative of the network output, and the axial continuity constraint of the network output. Usually, a threshold of the number of iterations is set. When it does not exceed this value, it is considered that the network training has not ended. When the network training has not ended, it enters the next iteration, that is, the next batch of data is forwarded to the network input, forwarded at random multiple angles, and back-transmitted to the self-supervised loss function;

自监督重建网络测试单元,在自监督重建网络训练完成后进行,用于测试选定数据在网络上的最终表现。The self-supervised reconstruction network testing unit is performed after the self-supervised reconstruction network training is completed, and is used to test the final performance of the selected data on the network.

其中,选定数据往往与自监督重建网络训练单元所使用的数据不重合,且尺寸不一定相同,若尺寸不同,应当对选定数据进行有重叠的裁剪,裁剪为若干张符合网络输入层尺寸要求的图像,让网络分别预测,然后拼接预测结果。Among them, the selected data often does not overlap with the data used by the self-supervised reconstruction network training unit, and the size is not necessarily the same. If the size is different, the selected data should be cropped with overlap and cropped into several images that meet the size requirements of the network input layer. Let the network predict separately and then splice the prediction results.

接下来,对本申请实施例的扫描光场自监督网络重建方法进行详细赘述。Next, the scanning light field self-supervised network reconstruction method of the embodiment of the present application is described in detail.

具体而言,图2为本申请实施例所提供的一种扫描光场自监督网络重建方法的流程示意图。Specifically, FIG2 is a flow chart of a scanning light field self-supervised network reconstruction method provided in an embodiment of the present application.

如图2所示,该扫描光场自监督网络重建方法包括以下步骤:As shown in FIG2 , the scanned light field self-supervised network reconstruction method includes the following steps:

在步骤S201中,获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数。In step S201 , scanning light field data and a point spread function of an optical system for capturing the scanning light field data are obtained.

可以理解的是,光场指的是光的强度、方向和相位等信息在空间和时间上的变化分布,扫描光场数据指的是从不同角度和位置下对光场进行采样得到的一系列图像或测量值。It can be understood that the light field refers to the changing distribution of information such as light intensity, direction and phase in space and time, and the scanned light field data refers to a series of images or measurements obtained by sampling the light field from different angles and positions.

具体地,本申请实施例可以通过扫描光学显微仪器获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,例如通过扫描光学显微仪器,可以获取细胞在不同角度和位置下的图像信息,包括光源亮度、光线走向等,并通过计算测量可以得到拍摄扫描光场数据的光学系统的点扩散函数。Specifically, the embodiments of the present application can obtain scanning light field data and the point spread function of the optical system that captures the scanning light field data through a scanning optical microscope. For example, through a scanning optical microscope, image information of cells at different angles and positions can be obtained, including light source brightness, light direction, etc., and the point spread function of the optical system that captures the scanning light field data can be obtained through calculation and measurement.

本申请实施例通过获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,提高了图像信息的全面性,并为后续处理提供数据基础。The embodiment of the present application improves the comprehensiveness of image information and provides a data basis for subsequent processing by acquiring scanning light field data and a point spread function of an optical system that captures the scanning light field data.

在步骤S202中,对扫描光场数据进行预处理,得到预处理后的数据。In step S202, the scanned light field data is preprocessed to obtain preprocessed data.

可以理解的是,预处理包括数据重排、数据归一化、数据扩增等。It can be understood that preprocessing includes data rearrangement, data normalization, data amplification, etc.

具体地,本申请实施例可以将扫描光场数据的像素点,根据在微透镜后的位置重新排列,所有在微透镜后同一位置的像素点按照空间顺序依次排列,形成3D多角度数据,可以将多角度数据做线性变换,使得像素值位于0-1之间,得到归一化数据,可以对归一化数据进行随机角度旋转、翻转、剪裁等操作,得到扩增后的最终数据。Specifically, the embodiment of the present application can rearrange the pixel points of the scanned light field data according to their positions behind the microlens, and all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order to form 3D multi-angle data. The multi-angle data can be linearly transformed so that the pixel value is between 0 and 1 to obtain normalized data. The normalized data can be subjected to random angle rotation, flipping, cropping and other operations to obtain the final data after amplification.

本申请实施例可以通过对扫描光场数据进行预处理,如数据重排、数据归一化、数据 扩增等处理,得到预处理后的数据,从而可以提高数据的精确度和准确度,改善图像的清晰度,进一步提高重建结果的真实性。The embodiment of the present application can pre-process the scanned light field data, such as data rearrangement, data normalization, data By performing amplification and other processing, the preprocessed data is obtained, thereby improving the precision and accuracy of the data, improving the clarity of the image, and further improving the authenticity of the reconstruction results.

可选地,在本申请的一个实施例中,对扫描光场数据进行预处理,得到预处理后的数据,包括:将扫描光场数据的像素点基于在光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在微透镜后同一位置的像素点按照空间顺序依次排列。Optionally, in one embodiment of the present application, the scanned light field data is preprocessed to obtain preprocessed data, including: rearranging the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all pixel points at the same position behind the microlens are arranged in sequence according to spatial order.

具体地,本申请实施例可以将扫描光场数据的像素点,基于在光学系统的微透镜后的位置重新排列,且所有在微透镜后同一位置的像素点按照空间顺序依次排列,从而可以形成3D多角度数据,例如,扫描光场数据包括在水平方向上从左到右扫描的5个视角,每个视角有10×10个像素,对于第一个视角,在重新排列后,第一行的10个像素点会对应一个特定的空间位置,同样地,第二行的10个像素点会对应下一个空间位置,依此类推,随后,对于其他视角,也会依次按照相同的空间位置进行像素点的排列。Specifically, the embodiment of the present application can rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system, and all the pixel points at the same position behind the microlens are arranged in sequence in spatial order, thereby forming 3D multi-angle data. For example, the scanned light field data includes 5 viewing angles scanned from left to right in the horizontal direction, each viewing angle has 10×10 pixels, and for the first viewing angle, after rearrangement, the 10 pixel points in the first row will correspond to a specific spatial position. Similarly, the 10 pixel points in the second row will correspond to the next spatial position, and so on. Subsequently, for other viewing angles, the pixel points will be arranged in sequence according to the same spatial position.

本申请实施例可以将扫描光场数据的像素点基于在光学系统的微透镜后的位置重新排列,生成多角度数据,可以获得基于微透镜后位置重新排列的3D多角度数据,进而提供更全面和准确的信息,能够减少伪影、提高细节还原,确保重建结果的准确性。The embodiment of the present application can rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, and can obtain 3D multi-angle data based on the rearrangement of the position behind the microlens, thereby providing more comprehensive and accurate information, reducing artifacts, improving detail restoration, and ensuring the accuracy of the reconstruction results.

可选地,在本申请的一个实施例中,对扫描光场数据进行预处理,得到预处理后的数据,还包括:对多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。Optionally, in one embodiment of the present application, preprocessing the scanned light field data to obtain preprocessed data further includes: performing a linear transformation on the multi-angle data so that the pixel values are within a preset range to obtain changed data.

可以理解的是,预设范围指的是预先设置的数据的像素值范围,如将像素值缩放到0到1之间。It can be understood that the preset range refers to a preset pixel value range of data, such as scaling the pixel value to between 0 and 1.

具体地,本申请实施例可以对多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据,例如,需要将像素值缩放到0到1之间,可以进行线性变换从当前的像素值范围将其映射到目标范围内,从而得到变化后的数据。Specifically, the embodiments of the present application can perform a linear transformation on multi-angle data so that the pixel value is between a preset range to obtain the changed data. For example, if the pixel value needs to be scaled to between 0 and 1, a linear transformation can be performed to map it from the current pixel value range to the target range to obtain the changed data.

本申请实施例通过对多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据,可以有效提高图像的对比度和显示效果,有助于识别和分析图像中的细节,减少视觉不一致性。The embodiment of the present application performs a linear transformation on multi-angle data so that the pixel value is within a preset range to obtain the changed data, which can effectively improve the contrast and display effect of the image, help identify and analyze the details in the image, and reduce visual inconsistency.

在步骤S203中,根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。In step S203, a self-supervised reconstruction network is constructed according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.

可以理解的是,预设迭代停止条件可以为迭代次数达到预设迭代次数阈值,如迭代次数达到50000次等。It can be understood that the preset iteration stop condition may be that the number of iterations reaches a preset iteration threshold, such as the number of iterations reaches 50,000 times.

具体地,本申请实施例可以利用预处理后的数据,用于有监督重建网络训练或有监督重建网络测试,其中,自监督重建网络训练,包括网络输入前传、随机多角度前传和自监督损失函数回传,完成以上三步即称作一次迭代,当网络训练没有结束时进入下一次迭代, 即下一批数据通过网络、随机多角度前传和自监督损失函数回传;自监督重建网络测试,在自监督重建网络训练完成后进行,用于测试选定数据在网络上的最终表现。Specifically, the embodiment of the present application can use the preprocessed data for supervised reconstruction network training or supervised reconstruction network testing, wherein the self-supervised reconstruction network training includes network input forward transmission, random multi-angle forward transmission and self-supervised loss function back transmission. Completion of the above three steps is called an iteration. When the network training is not completed, it enters the next iteration. That is, the next batch of data is forwarded through the network, randomly transmitted at multiple angles, and transmitted back through the self-supervised loss function; the self-supervised reconstruction network test is performed after the self-supervised reconstruction network training is completed, and is used to test the final performance of the selected data on the network.

举例而言,结合图3所示,本申请实施例可以根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,其中,经过网络重建的图像是3D图像,但显示为沿z轴的投影。具体地,本申请实施例可以使用扫描光场仪器(扫描倍率为3,微透镜后的像素点个数为13×13)拍摄斑马鱼胚胎数据,然后对数据进行重排、归一化和扩增,产生1,000张尺寸为60×60×169的多角度图像,用于自监督重建网络训练;使用基于Tensorflow的Keras深度学习框架和Python编程语言搭建自监督重建网络,具体地,在输入层后使用双线性插值将网络输入调整为目标大小,然后经过一个U-Net,进一步地,通过训练网络,其中,初始学习率为1×10-4,训练批大小为3,使用Adam优化器进行反向传播迭代优化,共训练50,000个迭代,每10,000个迭代学习率降为原来的一半,从而可以将多角度测试图像输入训练完成的自监督重建网络,得到重建图像。For example, in combination with what is shown in FIG3 , an embodiment of the present application can construct a self-supervised reconstruction network based on the preprocessed data and the point spread function until a preset iteration stop condition is reached, thereby obtaining a scanned light field self-supervised network reconstruction result, wherein the image reconstructed by the network is a 3D image, but is displayed as a projection along the z-axis. Specifically, the embodiment of the present application can use a scanning light field instrument (scanning magnification is 3, the number of pixels behind the microlens is 13×13) to shoot zebrafish embryo data, and then rearrange, normalize and amplify the data to generate 1,000 multi-angle images with a size of 60×60×169 for self-supervised reconstruction network training; use the Keras deep learning framework based on Tensorflow and the Python programming language to build a self-supervised reconstruction network. Specifically, after the input layer, bilinear interpolation is used to adjust the network input to the target size, and then pass through a U-Net, and further, through the training network, wherein the initial learning rate is 1× 10-4 , the training batch size is 3, and the Adam optimizer is used for back-propagation iterative optimization, and a total of 50,000 iterations are trained, and the learning rate is reduced to half of the original every 10,000 iterations, so that the multi-angle test image can be input into the trained self-supervised reconstruction network to obtain a reconstructed image.

本申请实施例可以利用预处理后的数据和点扩散函数,在训练过程中逐步提升对光场图像的重建能力,通过迭代训练,提高对不同样本和场景的适应度和泛化能力,进而能够提升重建的准确性和效率。The embodiments of the present application can utilize preprocessed data and point spread functions to gradually improve the ability to reconstruct light field images during the training process. Through iterative training, the adaptability and generalization ability to different samples and scenes can be improved, thereby improving the accuracy and efficiency of reconstruction.

可选地,在本申请的一个实施例中,根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,包括:选择任一图像处理网络,将多角度数据输入并前传得到网络输出;随机挑选点扩散函数的几个角度,对网络输出得到前向投影。Optionally, in one embodiment of the present application, a self-supervised reconstruction network is constructed based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result, including: selecting any image processing network, inputting multi-angle data and forwarding it to obtain a network output; randomly selecting several angles of the point spread function, and forward projecting the network output.

在实际执行过程中,本申请实施例可以随机挑选两个点扩散函数角度进行前向投影,将对应角度的点扩散函数与网络输出做卷积得到前向投影,并可以根据扫描倍率与微透镜后的像素点个数调整前向投影的大小。During the actual implementation process, the embodiment of the present application can randomly select two point spread function angles for forward projection, convolve the point spread function of the corresponding angle with the network output to obtain the forward projection, and can adjust the size of the forward projection according to the scanning magnification and the number of pixels behind the microlens.

具体地,结合图4所示,本申请实施例通过加入随机多角度前传模块,使用点扩散函数的信息,进而可以模拟网络输出的光学前传过程,具体步骤如下:Specifically, in conjunction with FIG. 4 , the embodiment of the present application adds a random multi-angle forward transmission module and uses the information of the point spread function to simulate the optical forward transmission process of the network output. The specific steps are as follows:

步骤S1:随机挑选角度。Step S1: Randomly select an angle.

可以理解的是,光场或扫描光场显微系统的核心是相机前的微透镜阵列,每个微透镜后有Nnum×Nnum个相机感光像素点,也被称作角度,因为不同位置的像素点对应不同的角度,且微透镜边缘的角度信息少、光强度低,所以只使用以微透镜光心为圆心、半径为r的圆内的角度,其中,r一般比Nnum的一半少一到两个像素点,也可以根据计算资源和实际需要来指定,从圆内角度中再挑选N个角度进行后续计算,其中,挑选方法可以如下所示: It can be understood that the core of the light field or scanning light field microscopy system is the microlens array in front of the camera. There are Nnum×Nnum camera photosensitive pixels behind each microlens, also known as angles. Because pixels at different positions correspond to different angles, and the angle information at the edge of the microlens is small and the light intensity is low, only the angles within the circle with the microlens optical center as the center and a radius of r are used, where r is generally one or two pixels less than half of Nnum. It can also be specified based on computing resources and actual needs, and N angles are selected from the angles within the circle for subsequent calculations. The selection method can be as follows:

(1)每次迭代都随机挑选N个角度;(1) N angles are randomly selected in each iteration;

(2)每次迭代有M个角度固定,不随着迭代而改变,另外随机挑选N-M个角度,且不与固定的M个角度重合。例如,M=1,固定微透镜光心所在的角度,另外随机挑选除微透镜光心所在角度外的N-1个角度;(2) Each iteration has M fixed angles that do not change with the iteration, and N-M angles are randomly selected, and they do not overlap with the fixed M angles. For example, M = 1, the angle where the optical center of the microlens is fixed, and N-1 angles other than the angle where the optical center of the microlens is randomly selected;

(3)每次迭代随机挑选N-M个角度,另外M个角度从上一次迭代中挑选的N个角度中随机挑选,且保证不与本次迭代挑选的N-M个角度重合;(3) N-M angles are randomly selected in each iteration, and the other M angles are randomly selected from the N angles selected in the previous iteration, and they are guaranteed not to overlap with the N-M angles selected in this iteration;

(4)将所有可挑选的角度排列成的圆等分成N份,每次迭代都从N份中各随机挑选一个角度;(4) Divide the circle formed by arranging all the selectable angles into N equal parts, and randomly select an angle from each of the N parts in each iteration;

值得注意的是,以上挑选方法可能在不同实施例中使网络收敛速度略有不同,但最终达到的效果从理论上是相似的,且挑选部分角度进行计算是为了节省内存与显存的开销,避免网络训练过程中内存或显存溢出的问题,在训练迭代次数足够多后,由于角度挑选的随机性,网络优化结果理论上应当与每次挑选所有角度计算的结果相同。It is worth noting that the above selection method may cause slightly different network convergence speeds in different embodiments, but the final effect is similar in theory, and the selection of some angles for calculation is to save memory and video memory overhead and avoid memory or video memory overflow during network training. After a sufficient number of training iterations, due to the randomness of angle selection, the network optimization result should theoretically be the same as the result of selecting all angles for calculation each time.

步骤S2:将对应角度的点扩散函数与网络输出做卷积。Step S2: Convolve the point spread function of the corresponding angle with the network output.

其中,对步骤S1中挑选的每个角度,取点扩散函数对应角度的部分,即一个3D向量,与3D网络输出做卷积,得到前向投影,在实际执行过程中,可以用频域操作替代空域卷积,可以如下所示:
Proji=iFFTshift{iFFT[FFTshift(FFT(Output))·FFTshift(FFT(PSFi))]}
Among them, for each angle selected in step S1, the part of the point spread function corresponding to the angle, that is, a 3D vector, is convolved with the 3D network output to obtain the forward projection. In the actual execution process, the frequency domain operation can be used instead of the spatial domain convolution, which can be shown as follows:
Proj i =iFFTshift{iFFT[FFTshift(FFT(Output))·FFTshift(FFT(PSF i ))]}

其中,Proji为前向投影的第i个角度分量,是一个2D图像;PSFi为点扩散函数的第i个角度分量;Output为网络输出;FFT为快速傅里叶变换;FFTshift为移动频谱使得零频处于中央的操作;iFFT为FFT的逆操作;iFFTshift为FFTshift的逆操作。Among them, Proj i is the i-th angular component of the forward projection, which is a 2D image; PSF i is the i-th angular component of the point spread function; Output is the network output; FFT is the fast Fourier transform; FFTshift is the operation of moving the spectrum so that the zero frequency is in the center; iFFT is the inverse operation of FFT; iFFTshift is the inverse operation of FFTshift.

进而,可以将Proj1,...,ProjN拼接成3D向量Proj_tmp,得到N个角度的前向投影。Furthermore, Proj 1 , ..., Proj N can be concatenated into a 3D vector Proj_tmp to obtain a forward projection of N angles.

步骤S3:根据扫描倍率与微透镜后的像素点个数调整前向投影的大小。Step S3: adjusting the size of the forward projection according to the scanning magnification and the number of pixels behind the microlens.

具体地,调整后的前向投影与步骤S2中拼接得到的前向投影的关系可以由如下表达式决定:
Proj=ImResize(Proj_tmp,scanning/Nnum)
Specifically, the relationship between the adjusted forward projection and the forward projection obtained by splicing in step S2 can be determined by the following expression:
Proj=ImResize(Proj_tmp,scanning/Nnum)

其中,Proj为调整后的前向投影;Proj_tmp为步骤S2中拼接得到的前向投影;ImResize(Img,scale)为某种调整图像尺寸的方法,如双线性插值、最临近插值等,将Img调整为原大小的scale倍;scanning为扫描倍率;Nnum为微透镜后一列或一排像素点的个数。Among them, Proj is the adjusted forward projection; Proj_tmp is the forward projection obtained by splicing in step S2; ImResize(Img,scale) is a method of adjusting the image size, such as bilinear interpolation, nearest neighbor interpolation, etc., which adjusts Img to scale times the original size; scanning is the scanning magnification; Nnum is the number of pixels in one column or one row behind the microlens.

值得注意的是,在步骤S1至步骤S3中,本申请实施例完成了对网络输出Output的光学前传模拟,若从网络输入中挑选对应N个角度的部分,记为Input,那么Input和Proj具有相同的尺寸。 It is worth noting that in step S1 to step S3, the embodiment of the present application completes the optical forward transmission simulation of the network output Output. If the part corresponding to N angles is selected from the network input and recorded as Input, then Input and Proj have the same size.

本申请实施例通过随机挑选点扩散函数角度进行前向投影,并根据扫描倍率和微透镜后的像素点个数调整前向投影的大小,可以提高投影结果的全面性和多样化,进而提高扫描光场技术的适用性。The embodiment of the present application randomly selects the point spread function angle for forward projection, and adjusts the size of the forward projection according to the scanning magnification and the number of pixels behind the microlens, which can improve the comprehensiveness and diversity of the projection results, thereby improving the applicability of the scanning light field technology.

可选地,在本申请的一个实施例中,根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,还包括:计算网络输出和前向投影的对应角度之间的均方误差;计算网络输出的二阶导的二范数和网络输出的轴向连续性约束;对均方误差、二范数和轴向连续性约束进行加权,计算自监督重建网络的自监督损失函数,并回传更新网络参数,得到扫描光场自监督网络重建结果。Optionally, in one embodiment of the present application, a self-supervised reconstruction network is constructed based on the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result, and also includes: calculating the mean square error between the network output and the corresponding angle of the forward projection; calculating the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output; weighting the mean square error, the second norm and the axial continuity constraint, calculating the self-supervised loss function of the self-supervised reconstruction network, and transmitting back to update the network parameters to obtain the scanned light field self-supervised network reconstruction result.

具体地,结合图4所示,本申请实施例可以对随机多角度前传与自监督损失函数进行计算,具体步骤如下:Specifically, in combination with FIG4 , the embodiment of the present application can calculate the random multi-angle forward transmission and self-supervised loss function, and the specific steps are as follows:

步骤S4:计算自监督损失函数。Step S4: Calculate the self-supervised loss function.

具体地,函数值可以由如下表达式决定:
L(Proj,Input,Output)=αMSE(Proj,Input)+βHess(Output)+γCont(Output)
Specifically, the function value can be determined by the following expression:
L(Proj,Input,Output)=αMSE(Proj,Input)+βHess(Output)+γCont(Output)

其中,Proj为所述调整后的前向投影,Input为所述网络输入对应角度的部分,Output为所述网络输出,L为所述自监督损失函数,MSE为所述前向投影与所述网络输入之间的均方误差,Hess为所述网络输出的二阶导的二范数,Cont为所述网络输出的轴向连续性约束,α为所述均方误差的权重,β为所述二阶导的二范数的权重,γ为所述连续性约束的权重。Among them, Proj is the adjusted forward projection, Input is the part of the network input corresponding to the angle, Output is the network output, L is the self-supervised loss function, MSE is the mean square error between the forward projection and the network input, Hess is the second norm of the second-order derivative of the network output, Cont is the axial continuity constraint of the network output, α is the weight of the mean square error, β is the weight of the second norm of the second-order derivative, and γ is the weight of the continuity constraint.

其中,网络输出的轴向连续性约束可以由以下表达式决定:
Among them, the axial continuity constraint of the network output can be determined by the following expression:

其中,Output为所述网络输出,是一个三维向量,第三维代表轴向,Cont为所述网络输出的轴向连续性约束,N为所述网络输出的轴向像素点个数,Sum(·)为对二维向量每个分量的求和。Among them, Output is the network output, which is a three-dimensional vector, the third dimension represents the axial direction, Cont is the axial continuity constraint of the network output, N is the number of axial pixels output by the network, and Sum(·) is the sum of each component of the two-dimensional vector.

本申请实施例可以通过计算网络输出和前向投影之间的均方误差,以量化网络重建结果与真实光场数据之间的误差,并通过优化自监督损失函数,可以逐步减小均方误差,从而提高重建结果的准确性,通过考虑网络输出的二阶导的二范数和轴向连续性约束,有助于抑制重建结果中的噪声和伪影,提升重建结果的平滑性和稳定性。The embodiments of the present application can quantify the error between the network reconstruction result and the real light field data by calculating the mean square error between the network output and the forward projection, and can gradually reduce the mean square error by optimizing the self-supervised loss function, thereby improving the accuracy of the reconstruction result. By considering the second norm of the second-order derivative of the network output and the axial continuity constraint, it helps to suppress noise and artifacts in the reconstruction result and improve the smoothness and stability of the reconstruction result.

根据本申请实施例提出的扫描光场自监督网络重建方法,可以通过获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,对扫描光场数据进行预处理,并根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,从而得到扫描光场自监督网络重建结果,能够进行不断地校准和调整,提高数据的准确性,进而提高 系统的稳定性和可靠性,且减少对人工干预的需求,避免主观因素和操作误差的影响。由此,解决了相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性等问题。According to the scanning light field self-supervised network reconstruction method proposed in the embodiment of the present application, the scanning light field data can be pre-processed by obtaining the scanning light field data and the point spread function of the optical system that shoots the scanning light field data, and a self-supervised reconstruction network is constructed according to the pre-processed data and the point spread function until a preset iteration stop condition is reached, thereby obtaining the scanning light field self-supervised network reconstruction result, which can be continuously calibrated and adjusted to improve the accuracy of the data, thereby improving The stability and reliability of the system are improved, and the need for manual intervention is reduced, avoiding the influence of subjective factors and operational errors. This solves the problems in related technologies, such as the need to provide high-resolution true-value images, which increases costs, takes a long time, reduces reconstruction efficiency, and the existence of serious reconstruction artifacts at the original object plane, which easily leads to loss or distortion of object surface details, reducing the quality and accuracy of the reconstruction results.

其次参照附图描述根据本申请实施例提出的扫描光场自监督网络重建装置。Next, the scanning light field self-supervised network reconstruction device proposed according to the embodiment of the present application is described with reference to the accompanying drawings.

图5是本申请实施例的扫描光场自监督网络重建装置的结构示意图。FIG5 is a schematic diagram of the structure of a scanning light field self-supervisory network reconstruction device according to an embodiment of the present application.

如图5所示,该扫描光场自监督网络重建装置10包括:获取模块100、处理模块200和重建模块300。As shown in FIG. 5 , the scanning light field self-supervised network reconstruction device 10 includes: an acquisition module 100 , a processing module 200 and a reconstruction module 300 .

具体地,获取模块100,用于获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数。Specifically, the acquisition module 100 is used to acquire the scanning light field data and the point spread function of the optical system that captures the scanning light field data.

处理模块200,用于对扫描光场数据进行预处理,得到预处理后的数据。The processing module 200 is used to pre-process the scanned light field data to obtain pre-processed data.

重建模块300,用于根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。The reconstruction module 300 is used to construct a self-supervised reconstruction network according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result.

可选地,在本申请的一个实施例中,处理模块200包括:生成单元。Optionally, in one embodiment of the present application, the processing module 200 includes: a generating unit.

其中,生成单元,用于将扫描光场数据的像素点基于在光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在微透镜后同一位置的像素点按照空间顺序依次排列。The generating unit is used to rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order.

可选地,在本申请的一个实施例中,处理模块200还包括:变换单元。Optionally, in one embodiment of the present application, the processing module 200 further includes: a transformation unit.

其中,变换单元,用于对多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。The transformation unit is used to perform linear transformation on the multi-angle data so that the pixel value is within a preset range to obtain the changed data.

可选地,在本申请的一个实施例中,重建模块300包括:前传单元和投影单元。Optionally, in one embodiment of the present application, the reconstruction module 300 includes: a forward transmission unit and a projection unit.

其中,前传单元,用于选择任一图像处理网络,将多角度数据输入并前传得到网络输出;The forward transmission unit is used to select any image processing network, input multi-angle data and forward transmit it to obtain network output;

投影单元,用于随机挑选点扩散函数的几个角度,对网络输出得到前向投影。The projection unit is used to randomly select several angles of the point spread function and obtain the forward projection of the network output.

可选地,在本申请的一个实施例中,重建模块300还包括:第一计算单元、第二计算单元和加权单元。Optionally, in one embodiment of the present application, the reconstruction module 300 further includes: a first calculation unit, a second calculation unit and a weighting unit.

其中,第一计算单元,用于计算网络输出和前向投影的对应角度之间的均方误差;Wherein, the first calculation unit is used to calculate the mean square error between the network output and the corresponding angle of the forward projection;

第二计算单元,用于计算网络输出的二阶导的二范数和网络输出的轴向连续性约束;A second calculation unit is used to calculate the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output;

加权单元,用于对均方误差、二范数和轴向连续性约束进行加权,计算自监督重建网络的自监督损失函数,并回传更新网络参数,得到扫描光场自监督网络重建结果。The weighting unit is used to weight the mean square error, the second norm and the axial continuity constraint, calculate the self-supervised loss function of the self-supervised reconstruction network, and transmit back to update the network parameters to obtain the scanned light field self-supervised network reconstruction result.

需要说明的是,前述对扫描光场自监督网络重建方法实施例的解释说明也适用于该实 施例的扫描光场自监督网络重建装置,此处不再赘述。It should be noted that the above explanation of the embodiment of the scanning light field self-supervised network reconstruction method is also applicable to this embodiment. The scanning light field self-supervised network reconstruction device of the embodiment will not be described in detail here.

根据本申请实施例提出的扫描光场自监督网络重建装置,可以通过获取扫描光场数据与拍摄扫描光场数据的光学系统的点扩散函数,对扫描光场数据进行预处理,并根据预处理后的数据和点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,从而得到扫描光场自监督网络重建结果,能够进行不断地校准和调整,提高数据的准确性,进而提高系统的稳定性和可靠性,且减少对人工干预的需求,避免主观因素和操作误差的影响。由此,解决了相关技术中,由于需要提供高分辨率的真值图像,导致成本增加,耗时较长,降低了重建效率,且由于存在原始物平面处的严重重建伪影,容易导致物体表面的细节丢失或扭曲,降低重建结果的质量和准确性等问题。According to the scanning light field self-supervised network reconstruction device proposed in the embodiment of the present application, the scanning light field data can be pre-processed by obtaining the scanning light field data and the point spread function of the optical system that shoots the scanning light field data, and a self-supervised reconstruction network is constructed according to the pre-processed data and the point spread function until the preset iteration stop condition is reached, thereby obtaining the scanning light field self-supervised network reconstruction result, which can be continuously calibrated and adjusted to improve the accuracy of the data, thereby improving the stability and reliability of the system, and reducing the need for manual intervention, avoiding the influence of subjective factors and operational errors. Thus, the problems in the related art that the need to provide a high-resolution true value image leads to increased costs, longer time consumption, and reduced reconstruction efficiency, and that the existence of serious reconstruction artifacts at the original object plane easily leads to loss or distortion of details on the object surface, reducing the quality and accuracy of the reconstruction results, etc.

图6为本申请实施例提供的电子设备的结构示意图。该电子设备可以包括:FIG6 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application. The electronic device may include:

存储器601、处理器602及存储在存储器601上并可在处理器602上运行的计算机程序。A memory 601 , a processor 602 , and a computer program stored in the memory 601 and executable on the processor 602 .

处理器602执行程序时实现上述实施例中提供的扫描光场自监督网络重建方法。When the processor 602 executes the program, the scanning light field self-supervised network reconstruction method provided in the above embodiment is implemented.

进一步地,电子设备还包括:Furthermore, the electronic device further comprises:

通信接口603,用于存储器601和处理器602之间的通信。The communication interface 603 is used for communication between the memory 601 and the processor 602 .

存储器601,用于存放可在处理器602上运行的计算机程序。The memory 601 is used to store computer programs that can be executed on the processor 602 .

存储器601可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。Memory 601 may include high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk storage.

如果存储器601、处理器602和通信接口603独立实现,则通信接口603、存储器601和处理器602可以通过总线相互连接并完成相互间的通信。总线可以是工业标准体系结构(Industry Standard Architecture,简称为ISA)总线、外部设备互连(Peripheral Component Interconnect,简称为PCI)总线或扩展工业标准体系结构(Extended Industry Standard Architecture,简称为EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示,图6中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。If the memory 601, the processor 602 and the communication interface 603 are implemented independently, the communication interface 603, the memory 601 and the processor 602 can be connected to each other through a bus and communicate with each other. The bus can be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus. The bus can be divided into an address bus, a data bus, a control bus, etc. For ease of representation, only one thick line is used in FIG6, but it does not mean that there is only one bus or one type of bus.

可选地,在具体实现上,如果存储器601、处理器602及通信接口603,集成在一块芯片上实现,则存储器601、处理器602及通信接口603可以通过内部接口完成相互间的通信。Optionally, in a specific implementation, if the memory 601, the processor 602 and the communication interface 603 are integrated on a chip, the memory 601, the processor 602 and the communication interface 603 can communicate with each other through an internal interface.

处理器602可能是一个中央处理器(Central Processing Unit,简称为CPU),或者是特定集成电路(Application Specific Integrated Circuit,简称为ASIC),或者是被配置成实施本申请实施例的一个或多个集成电路。Processor 602 may be a central processing unit (CPU), or an application specific integrated circuit (ASIC), or one or more integrated circuits configured to implement the embodiments of the present application.

本申请实施例还提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处 理器执行时实现如上的扫描光场自监督网络重建方法。The present application also provides a computer-readable storage medium on which a computer program is stored. The above scanned light field self-supervised network reconstruction method is implemented when the processor is executed.

在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或N个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, the description with reference to the terms "one embodiment", "some embodiments", "example", "specific example", or "some examples" etc. means that the specific features, structures, materials or characteristics described in conjunction with the embodiment or example are included in at least one embodiment or example of the present application. In this specification, the schematic representations of the above terms do not necessarily refer to the same embodiment or example. Moreover, the specific features, structures, materials or characteristics described may be combined in any one or N embodiments or examples in a suitable manner. In addition, those skilled in the art may combine and combine the different embodiments or examples described in this specification and the features of the different embodiments or examples, without contradiction.

此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“N个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only and should not be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Therefore, a feature defined as "first" or "second" may explicitly or implicitly include at least one of the features. In the description of this application, "N" means at least two, such as two, three, etc., unless otherwise clearly and specifically defined.

流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或N个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in a flowchart or otherwise described herein may be understood to represent a module, fragment or portion of code comprising one or N executable instructions for implementing the steps of a custom logical function or process, and the scope of the preferred embodiments of the present application includes alternative implementations in which functions may not be performed in the order shown or discussed, including performing functions in a substantially simultaneous manner or in reverse order depending on the functions involved, which should be understood by technicians in the technical field to which the embodiments of the present application belong.

在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或N个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in the flowchart or otherwise described herein, for example, can be considered as an ordered list of executable instructions for implementing logical functions, and can be embodied in any computer-readable medium for use by an instruction execution system, device or apparatus (such as a computer-based system, a system including a processor, or other system that can fetch instructions from an instruction execution system, device or apparatus and execute instructions), or in combination with these instruction execution systems, devices or apparatuses. For the purpose of this specification, "computer-readable medium" can be any device that can contain, store, communicate, propagate or transmit a program for use by an instruction execution system, device or apparatus, or in combination with these instruction execution systems, devices or apparatuses. More specific examples of computer-readable media (a non-exhaustive list) include the following: an electrical connection with one or N wirings (electronic devices), a portable computer disk box (magnetic device), a random access memory (RAM), a read-only memory (ROM), an erasable and programmable read-only memory (EPROM or flash memory), a fiber optic device, and a portable compact disk read-only memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program is printed, since the program may be obtained electronically by optically scanning the paper or other medium and then editing, interpreting or processing in other suitable ways as necessary and then storing it in a computer memory.

应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实 施方式中,N个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。如果用硬件来实现和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of the present application can be implemented by hardware, software, firmware or a combination thereof. In the embodiment, the N steps or methods may be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system. If implemented by hardware, as in another embodiment, it may be implemented by any one of the following technologies known in the art or a combination thereof: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, a dedicated integrated circuit having a suitable combination of logic gate circuits, a programmable gate array (PGA), a field programmable gate array (FPGA), etc.

本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。A person skilled in the art may understand that all or part of the steps in the method for implementing the above-mentioned embodiment may be completed by instructing related hardware through a program, and the program may be stored in a computer-readable storage medium, which, when executed, includes one or a combination of the steps of the method embodiment.

此外,在本申请各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present application may be integrated into a processing module, or each unit may exist physically separately, or two or more units may be integrated into one module. The above-mentioned integrated module may be implemented in the form of hardware or in the form of a software functional module. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium.

上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本申请的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。 The storage medium mentioned above may be a read-only memory, a magnetic disk or an optical disk, etc. Although the embodiments of the present application have been shown and described above, it can be understood that the above embodiments are exemplary and cannot be understood as limiting the present application. A person of ordinary skill in the art may change, modify, replace and modify the above embodiments within the scope of the present application.

Claims (12)

一种扫描光场自监督网络重建方法,其特征在于,包括以下步骤:A scanning light field self-supervised network reconstruction method, characterized by comprising the following steps: 获取扫描光场数据与拍摄所述扫描光场数据的光学系统的点扩散函数;Acquire scanning light field data and a point spread function of an optical system that photographs the scanning light field data; 对所述扫描光场数据进行预处理,得到预处理后的数据;以及Preprocessing the scanned light field data to obtain preprocessed data; and 根据所述预处理后的数据和所述点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。A self-supervised reconstruction network is constructed according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result. 根据权利要求1所述的扫描光场自监督网络重建方法,其特征在于,所述对所述扫描光场数据进行预处理,得到预处理后的数据,包括:The scanning light field self-supervised network reconstruction method according to claim 1 is characterized in that the preprocessing of the scanning light field data to obtain the preprocessed data comprises: 将所述扫描光场数据的像素点基于在所述光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在所述微透镜后同一位置的像素点按照空间顺序依次排列。The pixels of the scanned light field data are rearranged based on the positions behind the microlens of the optical system to generate multi-angle data, wherein all the pixels at the same position behind the microlens are arranged in sequence according to a spatial order. 根据权利要求2所述的扫描光场自监督网络重建方法,其特征在于,所述对所述扫描光场数据进行预处理,得到预处理后的数据,还包括:The scanned light field self-supervised network reconstruction method according to claim 2 is characterized in that the preprocessing of the scanned light field data to obtain preprocessed data further comprises: 对所述多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。A linear transformation is performed on the multi-angle data so that the pixel value is within a preset range to obtain the transformed data. 根据权利要求2所述的扫描光场自监督网络重建方法,其特征在于,所述根据所述预处理后的数据和所述点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,包括:The scanning light field self-supervised network reconstruction method according to claim 2 is characterized in that the self-supervised reconstruction network is constructed according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain the scanning light field self-supervised network reconstruction result, comprising: 选择任一图像处理网络,将所述多角度数据输入并前传得到网络输出;Select any image processing network, input the multi-angle data and forward it to obtain network output; 随机挑选点扩散函数的几个角度,对所述网络输出得到前向投影。Several angles of the point spread function are randomly selected and forward projected onto the network output. 根据权利要求4所述的扫描光场自监督网络重建方法,其特征在于,所述根据所述预处理后的数据和所述点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果,还包括:The scanning light field self-supervised network reconstruction method according to claim 4 is characterized in that the self-supervised reconstruction network is constructed according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain the scanning light field self-supervised network reconstruction result, and further comprises: 计算所述网络输出和所述前向投影的对应角度之间的均方误差;Calculating the mean square error between the network output and the corresponding angle of the forward projection; 计算所述网络输出的二阶导的二范数和所述网络输出的轴向连续性约束;Calculating the second norm of the second derivative of the network output and the axial continuity constraint of the network output; 对所述均方误差、所述二范数和所述轴向连续性约束进行加权,计算所述自监督重建网络的自监督损失函数,并回传更新网络参数,得到所述扫描光场自监督网络重建结果。The mean square error, the second norm and the axial continuity constraint are weighted, the self-supervised loss function of the self-supervised reconstruction network is calculated, and the updated network parameters are fed back to obtain the scanned light field self-supervised network reconstruction result. 一种扫描光场自监督网络重建装置,其特征在于,包括:A scanning light field self-supervised network reconstruction device, characterized by comprising: 获取模块,用于获取扫描光场数据与拍摄所述扫描光场数据的光学系统的点扩散函数;An acquisition module, used for acquiring scanning light field data and a point spread function of an optical system for photographing the scanning light field data; 处理模块,用于对所述扫描光场数据进行预处理,得到预处理后的数据;以及a processing module, used for preprocessing the scanned light field data to obtain preprocessed data; and 重建模块,用于根据所述预处理后的数据和所述点扩散函数构建自监督重建网络,直至达到预设迭代停止条件,得到扫描光场自监督网络重建结果。The reconstruction module is used to construct a self-supervised reconstruction network according to the preprocessed data and the point spread function until a preset iteration stop condition is reached to obtain a scanned light field self-supervised network reconstruction result. 根据权利要求6所述的扫描光场自监督网络重建装置,其特征在于,所述处理模块包括: The scanning light field self-supervised network reconstruction device according to claim 6, characterized in that the processing module comprises: 生成单元,用于将所述扫描光场数据的像素点基于在所述光学系统的微透镜后的位置重新排列,生成多角度数据,其中,所有在所述微透镜后同一位置的像素点按照空间顺序依次排列。A generating unit is used to rearrange the pixel points of the scanned light field data based on the position behind the microlens of the optical system to generate multi-angle data, wherein all the pixel points at the same position behind the microlens are arranged in sequence according to the spatial order. 根据权利要求7所述的扫描光场自监督网络重建装置,其特征在于,所述处理模块还包括:The scanning light field self-supervised network reconstruction device according to claim 7, characterized in that the processing module further comprises: 变换单元,用于对所述多角度数据做线性变换,以使像素值位于预设范围之间,得到变化后的数据。The transformation unit is used to perform linear transformation on the multi-angle data so that the pixel value is within a preset range to obtain the transformed data. 根据权利要求7所述的扫描光场自监督网络重建装置,其特征在于,所述重建模块包括:The scanning light field self-supervised network reconstruction device according to claim 7, characterized in that the reconstruction module comprises: 前传单元,用于选择任一图像处理网络,将所述多角度数据输入并前传得到网络输出;A forward transmission unit, used for selecting any image processing network, inputting the multi-angle data and forwarding it to obtain a network output; 投影单元,用于随机挑选点扩散函数的几个角度,对所述网络输出得到前向投影。The projection unit is used to randomly select several angles of the point spread function to obtain a forward projection of the network output. 根据权利要求9所述的扫描光场自监督网络重建装置,其特征在于,所述重建模块还包括:The scanning light field self-supervised network reconstruction device according to claim 9, characterized in that the reconstruction module further comprises: 第一计算单元,用于计算所述网络输出和所述前向投影的对应角度之间的均方误差;A first calculation unit, configured to calculate a mean square error between the network output and a corresponding angle of the forward projection; 第二计算单元,用于计算所述网络输出的二阶导的二范数和所述网络输出的轴向连续性约束;A second calculation unit, used for calculating the second norm of the second-order derivative of the network output and the axial continuity constraint of the network output; 加权单元,用于对所述均方误差、所述二范数和所述轴向连续性约束进行加权,计算所述自监督重建网络的自监督损失函数,并回传更新网络参数,得到所述扫描光场自监督网络重建结果。A weighting unit is used to weight the mean square error, the second norm and the axial continuity constraint, calculate the self-supervised loss function of the self-supervised reconstruction network, and transmit back to update the network parameters to obtain the scanned light field self-supervised network reconstruction result. 一种电子设备,其特征在于,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述程序,以实现如权利要求1-5任一项所述的扫描光场自监督网络重建方法。An electronic device, characterized in that it includes: a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the program to implement the scanning light field self-supervised network reconstruction method as described in any one of claims 1 to 5. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行,以用于实现如权利要求1-5任一项所述的扫描光场自监督网络重建方法。 A computer-readable storage medium having a computer program stored thereon, characterized in that the program is executed by a processor to implement the scanning light field self-supervised network reconstruction method as described in any one of claims 1 to 5.
PCT/CN2024/078130 2024-01-18 2024-02-22 Scanning light field self-supervised network reconstruction method and apparatus, electronic device, and medium Pending WO2025152229A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202410074899.5 2024-01-18
CN202410074899.5A CN118172479A (en) 2024-01-18 2024-01-18 Method, device, electronic equipment and medium for reconstructing scanning light field self-supervision network

Publications (1)

Publication Number Publication Date
WO2025152229A1 true WO2025152229A1 (en) 2025-07-24

Family

ID=91358999

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/078130 Pending WO2025152229A1 (en) 2024-01-18 2024-02-22 Scanning light field self-supervised network reconstruction method and apparatus, electronic device, and medium

Country Status (2)

Country Link
CN (1) CN118172479A (en)
WO (1) WO2025152229A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119444575B (en) * 2024-10-31 2025-10-03 华中科技大学 Self-supervised light field three-dimensional high-resolution reconstruction method, system and medium
CN119942006B (en) * 2025-04-08 2025-09-16 清华大学 Scanning light field multispectral reconstruction network method, device, electronic device and medium
CN120235748A (en) * 2025-05-29 2025-07-01 中交第一航务工程勘察设计院有限公司 A large-scale bathymetric topographic map interpolation method based on deep learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210350590A1 (en) * 2019-01-29 2021-11-11 Korea Advanced Institute Of Science And Technology Method and device for imaging of lensless hyperspectral image
CN113870433A (en) * 2021-09-09 2021-12-31 华中科技大学 A light field super-resolution three-dimensional reconstruction method and system
CN115220211A (en) * 2022-07-29 2022-10-21 江南大学 Microscopic imaging system and method based on deep learning and light field imaging
CN117078850A (en) * 2023-08-02 2023-11-17 南京航空航天大学 Multi-modal high-resolution light field reconstruction method based on deep learning
CN117078514A (en) * 2023-07-31 2023-11-17 华中科技大学 A training method, system and product for multi-level light field super-resolution network

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210350590A1 (en) * 2019-01-29 2021-11-11 Korea Advanced Institute Of Science And Technology Method and device for imaging of lensless hyperspectral image
CN113870433A (en) * 2021-09-09 2021-12-31 华中科技大学 A light field super-resolution three-dimensional reconstruction method and system
CN115220211A (en) * 2022-07-29 2022-10-21 江南大学 Microscopic imaging system and method based on deep learning and light field imaging
CN117078514A (en) * 2023-07-31 2023-11-17 华中科技大学 A training method, system and product for multi-level light field super-resolution network
CN117078850A (en) * 2023-08-02 2023-11-17 南京航空航天大学 Multi-modal high-resolution light field reconstruction method based on deep learning

Also Published As

Publication number Publication date
CN118172479A (en) 2024-06-11

Similar Documents

Publication Publication Date Title
WO2025152229A1 (en) Scanning light field self-supervised network reconstruction method and apparatus, electronic device, and medium
Yan et al. Uncertainty estimation in HDR imaging with Bayesian neural networks
Yan et al. Dual-attention-guided network for ghost-free high dynamic range imaging
Sen et al. Robust patch-based hdr reconstruction of dynamic scenes.
JP4159986B2 (en) Method and system for calculating an image converted from a digital image
JP5517954B2 (en) Method for fast and memory efficient implementation of conversion
US12136184B2 (en) Changing the size of images by means of a neural network
CN114494258B (en) Lens aberration prediction and image reconstruction method and device
CN117541501B (en) Scanning light field self-supervision network denoising method and device, electronic equipment and medium
CN113469880B (en) Image stitching method and device, storage medium and electronic equipment
CN112801904A (en) Hybrid degraded image enhancement method based on convolutional neural network
Conde et al. Lens-to-lens bokeh effect transformation. NTIRE 2023 challenge report
CN120092246A (en) Neural network training method and device, image processing method and device
CN114640885B (en) Video frame inserting method, training device and electronic equipment
Ceruso et al. Relative multiscale deep depth from focus
CN117218282A (en) A method and system for three-dimensional generalization reconstruction of light fields based on model migration
CN109767381A (en) A shape-optimized rectangular panoramic image construction method based on feature selection
CN116228753B (en) Tumor prognosis assessment method, device, computer equipment and storage medium
CN116071239B (en) CT image super-resolution method and device based on mixed attention model
CN114581493B (en) Bidirectional optical flow estimation method and device
JP2019023798A (en) Super-resolution device and program
CN118015179A (en) Three-dimensional light field display method, device, equipment and medium based on camera in-loop
CN118052708A (en) High-resolution image reconstruction method and system based on pathological pictures
JP7704148B2 (en) Training multiple image alignment
Aakerberg et al. Pda-rwsr: Pixel-wise degradation adaptive real-world super-resolution

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24917847

Country of ref document: EP

Kind code of ref document: A1