CN116188881A

CN116188881A - Method, device, electronic equipment and storage medium for extracting feature elements from remote sensing images

Info

Publication number: CN116188881A
Application number: CN202310147350.XA
Authority: CN
Inventors: 吴有明; 候建龙; 闫志远; 戴威; 王佩瑾
Original assignee: Aerospace Information Research Institute of CAS
Current assignee: Aerospace Information Research Institute of CAS
Priority date: 2023-02-14
Filing date: 2023-02-14
Publication date: 2023-05-30

Abstract

The disclosure provides a method, a device, electronic equipment and a storage medium for extracting ground feature elements of a remote sensing image, which can be applied to the technical field of remote sensing image analysis. The method comprises the following steps: inputting the remote sensing image into a coding network for self-adaptive boundary extraction, extracting image features of the remote sensing image by utilizing different convolution groups, and respectively outputting a first feature image and a second feature image, wherein the convolution times in the first convolution group used for outputting the first feature image are smaller than the convolution times in the second convolution group used for outputting the second feature image, and the remote sensing image comprises boundary information of ground feature elements; pyramid pooling is carried out on the second feature map to obtain a pooled feature map; utilizing a coding network to code and enhance the boundary information of the first feature map to obtain an enhanced feature map; capturing a long-distance dependency relationship of the enhanced feature map to obtain an extracted feature map; and carrying out feature fusion on the extracted feature map and the pooled feature map to generate a target feature map.

Description

Method, device, electronic equipment and storage medium for extracting feature elements from remote sensing images

技术领域technical field

本公开涉及遥感图像解析的技术领域，更具体地，涉及一种遥感图像地物要素提取方法、装置、电子设备及存储介质和程序产品。The present disclosure relates to the technical field of remote sensing image analysis, and more specifically, to a method, device, electronic equipment, storage medium, and program product for extracting feature elements from remote sensing images.

背景技术Background technique

随着遥感技术的飞速发展，从遥感图像中可获取的地物信息越来越丰富，因此遥感图像地物要素提取在众多领域中有着广泛的应用。提取地物要素的目的是为了将遥感图像中地物目标的边界精确地勾勒出来。相关技术中，深度学习方法通过级联非线性的映射将低级别的特征转换为高级别和抽象的特征，提升了遥感图像地物要素提取的性能。With the rapid development of remote sensing technology, the ground object information that can be obtained from remote sensing images is becoming more and more abundant, so the extraction of ground object elements from remote sensing images has a wide range of applications in many fields. The purpose of extracting ground features is to accurately outline the boundaries of ground features in remote sensing images. In related technologies, the deep learning method converts low-level features into high-level and abstract features through cascading nonlinear mapping, which improves the performance of remote sensing image feature extraction.

在实现本公开构思的过程中，发明人发现相关技术中至少存在如下问题，利用现有深度学习方法在对遥感图像地物要素提取的过程中，由于遥感图像本身具有尺寸大、场景复杂、小目标众多，且物体边界处有阴影、遮挡等，导致地物要素预测准确度较低。In the process of realizing the concept of the present disclosure, the inventors found that there are at least the following problems in the related technologies. In the process of extracting ground features from remote sensing images using existing deep learning methods, the remote sensing images themselves have large size, complex scenes, small There are many targets, and there are shadows, occlusions, etc. at the boundaries of objects, resulting in low prediction accuracy of ground object elements.

发明内容Contents of the invention

有鉴于此，本公开实施例提供了一种遥感图像地物要素提取方法、装置、电子设备及存储介质和程序产品。In view of this, the embodiments of the present disclosure provide a method, device, electronic equipment, storage medium, and program product for extracting feature elements from remote sensing images.

本公开实施例的一个方面提供了一种遥感图像地物要素提取方法，包括：将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取上述遥感图像的图像特征，分别输出第一特征图和第二特征图，其中，输出上述第一特征图使用的第一卷积组中的卷积次数小于输出上述第二特征图使用的第二卷积组中的卷积次数，上述遥感图像中包括地物要素的边界信息；对上述第二特征图进行金字塔池化，得到池化特征图；利用上述编码网络对上述第一特征图的边界信息进行编码增强，得到增强特征图；捕获上述增强特征图的长距离依赖关系，得到提取特征图；将上述提取特征图与上述池化特征图进行特征融合，生成目标特征图。One aspect of the embodiments of the present disclosure provides a method for extracting feature elements from remote sensing images, including: inputting remote sensing images into an encoding network for adaptive boundary extraction, using different convolution groups to extract the image features of the above remote sensing images, and outputting The first feature map and the second feature map, wherein the number of convolutions in the first convolution group used to output the first feature map is smaller than the number of convolutions in the second convolution group used to output the second feature map, The above-mentioned remote sensing image includes boundary information of feature elements; pyramid pooling is performed on the above-mentioned second feature map to obtain a pooled feature map; the above-mentioned encoding network is used to encode and enhance the boundary information of the above-mentioned first feature map to obtain an enhanced feature map ; Capture the long-distance dependency of the above-mentioned enhanced feature map to obtain the extracted feature map; perform feature fusion on the above-mentioned extracted feature map and the above-mentioned pooled feature map to generate a target feature map.

根据本公开的实施例，上述将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取上述遥感图像的图像特征，分别输出第一特征图和第二特征图，包括：利用上述编码网络的第一卷积组对上述遥感图像进行下采样，得到上述第一特征图；利用上述编码网络的第二卷积组对上述第一特征图进行下采样，得到上述第二特征图，上述第二卷积组中的卷积次数大于第一卷积组中的卷积次数。According to an embodiment of the present disclosure, the remote sensing image is input into the encoding network for adaptive boundary extraction, and different convolution groups are used to extract the image features of the remote sensing image, and the first feature map and the second feature map are respectively output, including: using The first convolutional group of the encoding network downsamples the remote sensing image to obtain the first feature map; the second convolutional group of the encoding network downsamples the first feature map to obtain the second feature map , the number of convolutions in the second convolution group is greater than the number of convolutions in the first convolution group.

根据本公开的实施例，上述利用上述编码网络的第一卷积组对上述遥感图像进行下采样，得到上述第一特征图，包括：利用上述第一卷积组中的第一动态混合梯度卷积对上述遥感图像进行下采样，得到初始特征图；利用上述第一卷积组中的第二动态混合梯度卷积对上述初始特征图进行下采样，得到上述第一特征图。According to an embodiment of the present disclosure, using the first convolutional group of the encoding network to down-sample the remote sensing image to obtain the first feature map includes: using the first dynamic hybrid gradient volume in the first convolutional group The above remote sensing image is down-sampled to obtain an initial feature map; the above-mentioned initial feature map is down-sampled by using the second dynamic hybrid gradient convolution in the above-mentioned first convolution group to obtain the above-mentioned first feature map.

根据本公开的实施例，上述利用上述第一卷积组中的第一动态混合梯度卷积对上述遥感图像进行下采样，得到初始特征图，包括：基于上述遥感图像确定上述第一动态混合梯度卷积的多个卷积权重系数，上述第一动态混合梯度卷积包括普通卷积核、第一梯度卷积核和第二梯度卷积核；基于上述普通卷积核对应的卷积权重系数，利用上述普通卷积核对上述遥感图像进行特征提取，得到普通特征；基于上述第一梯度卷积核对应的卷积权重系数，利用上述第一梯度卷积核对上述遥感图像进行特征提取，得到第一梯度特征；基于上述第二梯度卷积核对应的卷积权重系数，利用上述第二梯度卷积核对上述遥感图像进行特征提取，得到第二梯度特征；将上述普通特征、上述第一梯度特征和上述第二梯度特征进行特征融合，生成上述初始特征图。According to an embodiment of the present disclosure, the aforementioned remote sensing image is down-sampled using the first dynamic hybrid gradient convolution in the first convolution group to obtain an initial feature map, including: determining the first dynamic hybrid gradient based on the remote sensing image A plurality of convolution weight coefficients for convolution, the above-mentioned first dynamic hybrid gradient convolution includes ordinary convolution kernels, first gradient convolution kernels and second gradient convolution kernels; based on the convolution weight coefficients corresponding to the above-mentioned ordinary convolution kernels , using the above-mentioned ordinary convolution kernel to perform feature extraction on the above-mentioned remote sensing image to obtain ordinary features; A gradient feature; based on the convolution weight coefficient corresponding to the second gradient convolution kernel, use the second gradient convolution kernel to perform feature extraction on the remote sensing image to obtain a second gradient feature; the above-mentioned common feature, the above-mentioned first gradient feature Perform feature fusion with the above-mentioned second gradient feature to generate the above-mentioned initial feature map.

根据本公开的实施例，上述利用上述编码网络的第二卷积组对上述第一特征图进行下采样，得到上述第二特征图，包括：利用上述第二卷积组中的第一动态混合梯度卷积对上述第一特征图进行下采样，得到第一卷积图；利用上述第二卷积组中的第二动态混合梯度卷积对上述第一卷积图进行编码增强，得到第二卷积图；利用上述第二卷积组中的第三动态混合梯度卷积对上述第二卷积图进行编码增强，得到上述第二特征图。According to an embodiment of the present disclosure, using the second convolution group of the encoding network to down-sample the first feature map to obtain the second feature map includes: using the first dynamic mixing in the second convolution group Gradient convolution downsamples the above first feature map to obtain the first convolution map; uses the second dynamic hybrid gradient convolution in the above second convolution group to encode and enhance the above first convolution map to obtain the second Convolution map: using the third dynamic hybrid gradient convolution in the second convolution group to encode and enhance the second convolution map to obtain the second feature map.

根据本公开的实施例，上述方法还包括：利用上述编码网络对上述目标特征图的边界信息进行编码增强，得到目标增强特征图；捕获上述目标增强特征图的长距离依赖关系，得到目标提取特征图；输出上述目标提取特征图。According to an embodiment of the present disclosure, the method further includes: using the encoding network to encode and enhance the boundary information of the target feature map to obtain the target enhanced feature map; capturing the long-distance dependency of the above target enhanced feature map to obtain the target extraction feature map; output the above target extraction feature map.

本公开实施例的另一个方面提供了一种遥感图像地物要素提取装置，包括：编码模块，用于将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取上述遥感图像的图像特征，分别输出第一特征图和第二特征图，其中，输出上述第一特征图使用的第一卷积组中的卷积次数小于输出上述第二特征图使用的第二卷积组中的卷积次数，上述遥感图像中包括地物要素的边界信息；池化模块，用于对上述第二特征图进行金字塔池化，得到池化特征图；增强模块，用于利用上述编码网络对上述第一特征图的边界信息进行编码增强，得到增强特征图；捕获模块，用于捕获上述增强特征图的长距离依赖关系，得到提取特征图；融合模块，用于将上述提取特征图与上述池化特征图进行特征融合，生成目标特征图。Another aspect of the embodiments of the present disclosure provides a device for extracting feature elements from remote sensing images, including: an encoding module, configured to input remote sensing images into an encoding network for adaptive boundary extraction, and use different convolution groups to extract the above remote sensing images The image features of , respectively output the first feature map and the second feature map, wherein the number of convolutions in the first convolution group used to output the first feature map is smaller than the second convolution group used to output the second feature map The number of convolutions in the above-mentioned remote sensing image includes the boundary information of the feature elements; the pooling module is used to perform pyramid pooling on the above-mentioned second feature map to obtain a pooled feature map; the enhancement module is used to use the above-mentioned encoding network Encoding and enhancing the boundary information of the above-mentioned first feature map to obtain an enhanced feature map; the capture module is used to capture the long-distance dependency of the above-mentioned enhanced feature map to obtain an extracted feature map; the fusion module is used to combine the above-mentioned extracted feature map with The above pooled feature map performs feature fusion to generate a target feature map.

本公开实施例的另一个方面提供了一种电子设备，包括：一个或多个处理器；存储器，用于存储一个或多个程序，其中，当上述一个或多个程序被上述一个或多个处理器执行时，使得上述一个或多个处理器实现如上上述的方法。Another aspect of the embodiments of the present disclosure provides an electronic device, including: one or more processors; memory for storing one or more programs, wherein, when the above one or more programs are used by the above one or more When the processor executes, the above-mentioned one or more processors are made to implement the above-mentioned method.

本公开实施例的另一个方面提供了一种计算机可读存储介质，存储有计算机可执行指令，上述指令在被执行时用于实现如上所述的方法。Another aspect of the embodiments of the present disclosure provides a computer-readable storage medium storing computer-executable instructions, and the above-mentioned instructions are used to implement the above-mentioned method when executed.

本公开实施例的另一个方面提供了一种计算机程序产品，上述计算机程序产品包括计算机可执行指令，上述指令在被执行时用于实现如上所述的方法。Another aspect of the embodiments of the present disclosure provides a computer program product, the above computer program product includes computer executable instructions, and the above instructions are used to implement the above method when executed.

根据本公开的实施例，采用将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取遥感图像的图像特征的技术手段，从而对遥感图像中的边界信息进行提取增强。且对第二特征图进行金字塔池化，以获取更加丰富的上下文信息。捕获增强特征图的长距离依赖关系，提取了遥感图像中边界信息的空间细节信息。所以至少部分地克服了因遥感图像因场景复杂导致对地物要素预测准确度低的技术问题，提高了遥感图像边界预测的精度。According to the embodiments of the present disclosure, the remote sensing image is input into the encoding network for adaptive boundary extraction, and different convolution groups are used to extract the image features of the remote sensing image, so as to extract and enhance the boundary information in the remote sensing image. And pyramid pooling is performed on the second feature map to obtain richer context information. The long-distance dependencies of the enhanced feature maps are captured, and the spatial details of the boundary information in remote sensing images are extracted. Therefore, it at least partially overcomes the technical problem of low prediction accuracy of ground features due to complex scenes of remote sensing images, and improves the accuracy of remote sensing image boundary prediction.

附图说明Description of drawings

通过以下参照附图对本公开实施例的描述，本公开的上述以及其他目的、特征和优点将更为清楚，在附图中：The above and other objects, features and advantages of the present disclosure will be more clearly described through the following description of the embodiments of the present disclosure with reference to the accompanying drawings, in which:

图1示意性示出了根据本公开实施例的可以应用遥感图像地物要素提取方法和装置的示例性系统架构；Fig. 1 schematically shows an exemplary system architecture in which a method and device for extracting feature elements from remote sensing images can be applied according to an embodiment of the present disclosure;

图2示意性示出了根据本公开实施例的遥感图像地物要素提取方法的流程图；FIG. 2 schematically shows a flow chart of a method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure;

图3示意性示出了根据本公开实施例的遥感图像地物要素提取方法中对遥感图像下采样的示意图；Fig. 3 schematically shows a schematic diagram of downsampling a remote sensing image in a method for extracting feature elements from a remote sensing image according to an embodiment of the present disclosure;

图4示意性示出了根据本公开实施例的遥感图像地物要素提取方法中对遥感图像特征提取的流程图；Fig. 4 schematically shows a flow chart of extracting remote sensing image features in a method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure;

图5示意性示出了根据本公开的实施例的遥感图像地物要素提取装置的框图；Fig. 5 schematically shows a block diagram of an apparatus for extracting feature elements from remote sensing images according to an embodiment of the present disclosure;

图6示意性示出了根据本公开实施例的适于实现用于遥感图像地物要素提取方法的电子设备的方框图。Fig. 6 schematically shows a block diagram of an electronic device adapted to implement a method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure.

具体实施方式Detailed ways

以下，将参照附图来描述本公开的实施例。但是应该理解，这些描述只是示例性的，而并非要限制本公开的范围。在下面的详细描述中，为便于解释，阐述了许多具体的细节以提供对本公开实施例的全面理解。然而，明显地，一个或多个实施例在没有这些具体细节的情况下也可以被实施。此外，在以下说明中，省略了对公知结构和技术的描述，以避免不必要地混淆本公开的概念。Hereinafter, embodiments of the present disclosure will be described with reference to the drawings. It should be understood, however, that these descriptions are exemplary only, and are not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the present disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present disclosure.

在此使用的术语仅仅是为了描述具体实施例，而并非意在限制本公开。在此使用的术语“包括”、“包含”等表明了所述特征、步骤、操作和/或部件的存在，但是并不排除存在或添加一个或多个其他特征、步骤、操作或部件。The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting of the present disclosure. The terms "comprising", "comprising", etc. used herein indicate the presence of stated features, steps, operations and/or components, but do not exclude the presence or addition of one or more other features, steps, operations or components.

在此使用的所有术语(包括技术和科学术语)具有本领域技术人员通常所理解的含义，除非另外定义。应注意，这里使用的术语应解释为具有与本说明书的上下文相一致的含义，而不应以理想化或过于刻板的方式来解释。All terms (including technical and scientific terms) used herein have the meaning commonly understood by one of ordinary skill in the art, unless otherwise defined. It should be noted that the terms used herein should be interpreted to have a meaning consistent with the context of this specification, and not be interpreted in an idealized or overly rigid manner.

在使用类似于“A、B和C等中至少一个”这样的表述的情况下，一般来说应该按照本领域技术人员通常理解该表述的含义来予以解释(例如，“具有A、B和C中至少一个的系统”应包括但不限于单独具有A、单独具有B、单独具有C、具有A和B、具有A和C、具有B和C、和/或具有A、B、C的系统等)。在使用类似于“A、B或C等中至少一个”这样的表述的情况下，一般来说应该按照本领域技术人员通常理解该表述的含义来予以解释(例如，“具有A、B或C中至少一个的系统”应包括但不限于单独具有A、单独具有B、单独具有C、具有A和B、具有A和C、具有B和C、和/或具有A、B、C的系统等)。Where expressions such as "at least one of A, B, and C, etc." are used, they should generally be interpreted as those skilled in the art would normally understand the expression (for example, "having A, B, and C A system of at least one of "shall include, but not be limited to, systems with A alone, B alone, C alone, A and B, A and C, B and C, and/or A, B, C, etc. ). Where expressions such as "at least one of A, B, or C, etc." are used, they should generally be interpreted as those skilled in the art would normally understand the expression (for example, "having A, B, or C A system of at least one of "shall include, but not be limited to, systems with A alone, B alone, C alone, A and B, A and C, B and C, and/or A, B, C, etc. ).

在本公开的技术方案中，所涉及的数据(如包括但不限于用户个人信息)的收集、存储、使用、加工、传输、提供、公开和应用等处理，均符合相关法律法规的规定，采取了必要保密措施，且不违背公序良俗。In the technical solution of this disclosure, the collection, storage, use, processing, transmission, provision, disclosure, and application of the data involved (including but not limited to user personal information) are all in compliance with relevant laws and regulations. Necessary confidentiality measures have been taken, and it does not violate public order and good customs.

随着遥感技术的飞速发展，光学遥感图像的获取变得容易，从遥感图像中可获取的地物信息越来越丰富，对于遥感图像精细化解译应用的需求也日益急迫。地物要素提取的目的是为遥感图像的每个像素赋予一个地物要素类别标签，它不仅可以识别出图像中有什么地物目标，还会将目标的边界精确地勾勒出来。因此，光学遥感图像地物要素提取在众多领域中有着广泛的应用。但常用的地物要素提取的方法使用手工特征，这种特征提取的方法需要丰富的先验知识和经验，而且特征的表征能力十分有限，在遥感图像这种复杂的场景中很难达到令人满意的效果。With the rapid development of remote sensing technology, the acquisition of optical remote sensing images has become easier, and the ground object information that can be obtained from remote sensing images is becoming more and more abundant. The demand for fine interpretation of remote sensing images is also becoming more and more urgent. The purpose of surface feature extraction is to assign a feature category label to each pixel of the remote sensing image, which can not only identify the target in the image, but also accurately outline the boundary of the target. Therefore, the extraction of ground object elements from optical remote sensing images has a wide range of applications in many fields. However, the commonly used method of feature extraction uses manual features. This method of feature extraction requires rich prior knowledge and experience, and the representation ability of features is very limited. It is difficult to achieve impressive results in complex scenes such as remote sensing images. satisfactory effect.

深度学习的出现给图像分割领域带来了一系列革命性的进展。深度学习方法通过级联非线性的映射将低级别的特征转换为高级别和抽象的特征，而这种高级别的特征在以前的手工特征提取方法中是不容易获得的。深度学习强大的特征学习能力提升了光学遥感图像地物要素提取的性能。The emergence of deep learning has brought a series of revolutionary progress to the field of image segmentation. Deep learning methods convert low-level features into high-level and abstract features through cascading non-linear mapping, and such high-level features are not easily obtained in previous manual feature extraction methods. The powerful feature learning ability of deep learning improves the performance of feature extraction from optical remote sensing images.

然而，现有深度学习的方法获得的预测结果仍然存在边界预测性能较差的缺点。一方面是因为遥感图像本身具有尺寸大，场景复杂，小目标众多，物体边界处有阴影、遮挡等，这导致遥感图像的边界提取难度相对于自然场景图片大大增加。另一方面，现有的普通卷积核在初始化的过程中并没有显式的梯度编码限制，使其在训练过程中很难聚焦于图像梯度信息的提取，从而影响了边界预测的精度。However, the prediction results obtained by existing deep learning methods still have the disadvantage of poor boundary prediction performance. On the one hand, the remote sensing image itself has large size, complex scene, many small targets, shadows and occlusions at the object boundary, which makes the boundary extraction of remote sensing image more difficult than that of natural scene pictures. On the other hand, the existing common convolution kernel does not have explicit gradient encoding restrictions during initialization, making it difficult to focus on the extraction of image gradient information during training, thus affecting the accuracy of boundary prediction.

相关技术中，为了解决边界模糊的问题，已经提出了多种策略，大致可以分为以下三类：跳跃连接、多尺度上下文和附加边界分支。在第一种方法中，富含边界信息的低级特征通过跳跃连接嵌入到高级特征中。第二种方法基于多尺度上下文方法在维持高分辨率的前提下扩大感受野生成具有强上下文关系的语义特征图，例如金字塔池模块(PPM)和多孔空间金字塔池。第三种方法涉及额外的边界分支，设计了多个加权边缘监督强调有用的边界信息。In related technologies, in order to solve the problem of boundary ambiguity, a variety of strategies have been proposed, which can be roughly divided into the following three categories: skip connection, multi-scale context and additional boundary branch. In the first method, low-level features rich in boundary information are embedded into high-level features via skip connections. The second method is based on the multi-scale context method to expand the receptive field to generate semantic feature maps with strong contextual relations under the premise of maintaining high resolution, such as Pyramid Pooling Module (PPM) and Porous Spatial Pyramid Pooling. The third method involves an additional boundary branch, and multiple weighted edge supervisions are designed to emphasize useful boundary information.

但是上述方法依然存在不足，边界信息很难被显式的编码。如何自适应的建模语义信息和边界信息的差异，且在保证语义信息提取的同时自适应的增强边界信息的提取也是一个挑战。However, the above methods still have shortcomings, and the boundary information is difficult to be explicitly encoded. How to adaptively model the difference between semantic information and boundary information, and how to adaptively enhance the extraction of boundary information while ensuring the extraction of semantic information is also a challenge.

有鉴于此，本公开的实施例提供了一种遥感图像地物要素提取方法、一种遥感图像地物要素提取装置、一种电子设备、一种可读存储介质和一种计算机程序产品。其中，该一种遥感图像地物要素提取方法，包括：将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取遥感图像的图像特征，分别输出第一特征图和第二特征图，其中，输出第一特征图使用的第一卷积组中的卷积次数小于输出第二特征图使用的第二卷积组中的卷积次数，遥感图像中包括地物要素的边界信息；对第二特征图进行金字塔池化，得到池化特征图；利用编码网络对第一特征图的边界信息进行编码增强，得到增强特征图；捕获增强特征图的长距离依赖关系，得到提取特征图；将提取特征图与池化特征图进行特征融合，生成目标特征图。In view of this, the embodiments of the present disclosure provide a method for extracting feature elements from remote sensing images, an apparatus for extracting feature elements from remote sensing images, an electronic device, a readable storage medium, and a computer program product. Wherein, the method for extracting feature elements from remote sensing images includes: inputting remote sensing images into an encoding network for adaptive boundary extraction, using different convolution groups to extract image features of remote sensing images, and outputting the first feature map and the second feature map respectively. Feature map, wherein the number of convolutions in the first convolution group used to output the first feature map is less than the number of convolutions in the second convolution group used to output the second feature map, and the remote sensing image includes the boundary of the feature element information; perform pyramid pooling on the second feature map to obtain the pooled feature map; use the coding network to encode and enhance the boundary information of the first feature map to obtain the enhanced feature map; capture the long-distance dependency of the enhanced feature map to obtain the extracted Feature map: feature fusion of the extracted feature map and the pooled feature map to generate the target feature map.

图1示意性示出了根据本公开实施例的可以应用遥感图像地物要素提取方法和装置的示例性系统架构。需要注意的是，图1所示仅为可以应用本公开实施例的系统架构的示例，以帮助本领域技术人员理解本公开的技术内容，但并不意味着本公开实施例不可以用于其他设备、系统、环境或场景。Fig. 1 schematically shows an exemplary system architecture in which the method and device for extracting feature elements from remote sensing images can be applied according to an embodiment of the present disclosure. It should be noted that, what is shown in FIG. 1 is only an example of the system architecture to which the embodiments of the present disclosure can be applied, so as to help those skilled in the art understand the technical content of the present disclosure, but it does not mean that the embodiments of the present disclosure cannot be used in other device, system, environment or scenario.

如图1所示，根据该实施例的系统架构100可以包括终端设备101、102、103，网络104和服务器105。网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型，例如有线和/或无线通信链路等。As shown in FIG. 1 , a system architecture 100 according to this embodiment may include terminal devices 101 , 102 , 103 , a network 104 and a server 105 . The network 104 is used as a medium for providing communication links between the terminal devices 101 , 102 , 103 and the server 105 . Network 104 may include various connection types, such as wired and/or wireless communication links, and the like.

用户可以使用终端设备101、102、103通过网络104与服务器105交互，以接收或发送消息等。终端设备101、102、103上可以安装有各种通讯客户端应用，例如购物类应用、网页浏览器应用、搜索类应用、即时通信工具、邮箱客户端和/或社交平台软件等(仅为示例)。Users can use terminal devices 101 , 102 , 103 to interact with server 105 via network 104 to receive or send messages and the like. Various communication client applications can be installed on the terminal devices 101, 102, 103, such as shopping applications, web browser applications, search applications, instant messaging tools, email clients and/or social platform software, etc. (only for example ).

终端设备101、102、103可以是具有显示屏并且支持网页浏览的各种电子设备，包括但不限于智能手机、平板电脑、膝上型便携计算机和台式计算机等。The terminal devices 101, 102, 103 may be various electronic devices with display screens and supporting web browsing, including but not limited to smart phones, tablet computers, laptop computers and desktop computers.

服务器105可以是提供各种服务的服务器，例如对用户利用终端设备101、102、103所浏览的网站提供支持的后台管理服务器(仅为示例)。后台管理服务器可以对接收到的用户请求等数据进行分析等处理，并将处理结果(例如根据用户请求获取或生成的网页、信息、或数据等)反馈给终端设备。The server 105 may be a server that provides various services, such as a background management server that provides support for websites browsed by users using the terminal devices 101 , 102 , 103 (just an example). The background management server can analyze and process received data such as user requests, and feed back processing results (such as webpages, information, or data obtained or generated according to user requests) to the terminal device.

需要说明的是，本公开实施例所提供的遥感图像地物要素提取方法一般可以由服务器105执行。相应地，本公开实施例所提供的遥感图像地物要素提取装置一般可以设置于服务器105中。本公开实施例所提供的遥感图像地物要素提取方法也可以由不同于服务器105且能够与终端设备101、102、103和/或服务器105通信的服务器或服务器集群执行。相应地，本公开实施例所提供的遥感图像地物要素提取装置也可以设置于不同于服务器105且能够与终端设备101、102、103和/或服务器105通信的服务器或服务器集群中。或者，本公开实施例所提供的遥感图像地物要素提取方法也可以由终端设备101、102、或103执行，或者也可以由不同于终端设备101、102、或103的其他终端设备执行。相应地，本公开实施例所提供的遥感图像地物要素提取装置也可以设置于终端设备101、102、或103中，或设置于不同于终端设备101、102、或103的其他终端设备中。It should be noted that the method for extracting feature elements from remote sensing images provided by the embodiments of the present disclosure can generally be executed by the server 105 . Correspondingly, the apparatus for extracting feature elements from remote sensing images provided in the embodiments of the present disclosure may generally be set in the server 105 . The method for extracting feature elements from remote sensing images provided in the embodiments of the present disclosure may also be executed by a server or server cluster that is different from the server 105 and can communicate with the terminal devices 101 , 102 , 103 and/or the server 105 . Correspondingly, the apparatus for extracting feature elements from remote sensing images provided in the embodiments of the present disclosure may also be set in a server or server cluster that is different from the server 105 and can communicate with the terminal devices 101 , 102 , 103 and/or the server 105 . Alternatively, the method for extracting feature elements from remote sensing images provided by the embodiments of the present disclosure may also be executed by the terminal device 101 , 102 , or 103 , or may also be executed by other terminal devices different from the terminal device 101 , 102 , or 103 . Correspondingly, the apparatus for extracting feature elements from remote sensing images provided by the embodiments of the present disclosure may also be set in the terminal device 101 , 102 , or 103 , or in other terminal devices different from the terminal device 101 , 102 , or 103 .

应该理解，图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要，可以具有任意数目的终端设备、网络和服务器。It should be understood that the numbers of terminal devices, networks and servers in Fig. 1 are only illustrative. According to the implementation needs, there can be any number of terminal devices, networks and servers.

图2示意性示出了根据本公开实施例的遥感图像地物要素提取方法的流程图Fig. 2 schematically shows a flow chart of a method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure

如图2所示，该方法包括操作S201～S205。As shown in FIG. 2, the method includes operations S201-S205.

在操作S201，将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取遥感图像的图像特征，分别输出第一特征图和第二特征图，其中，输出第一特征图使用的第一卷积组中的卷积次数小于输出第二特征图使用的第二卷积组中的卷积次数，遥感图像中包括地物要素的边界信息。In operation S201, the remote sensing image is input into the encoding network for adaptive boundary extraction, the image features of the remote sensing image are extracted using different convolution groups, and the first feature map and the second feature map are respectively output, wherein the first feature map is output using The number of convolutions in the first convolution group is smaller than the number of convolutions in the second convolution group used to output the second feature map, and the remote sensing image includes boundary information of feature elements.

在操作S202，对第二特征图进行金字塔池化，得到池化特征图。In operation S202, pyramid pooling is performed on the second feature map to obtain a pooled feature map.

在操作S203，利用编码网络对第一特征图的边界信息进行编码增强，得到增强特征图。In operation S203, the encoding network is used to encode and enhance the boundary information of the first feature map to obtain an enhanced feature map.

在操作S204，捕获增强特征图的长距离依赖关系，得到提取特征图。In operation S204, the long-distance dependency of the enhanced feature map is captured to obtain the extracted feature map.

在操作S205，将提取特征图与池化特征图进行特征融合，生成目标特征图。In operation S205, feature fusion is performed on the extracted feature map and the pooled feature map to generate a target feature map.

根据本公开的实施例，获取由多帧光学遥感图像构成的训练集，其中，每帧遥感图像中均包括地物要素的边界信息，且遥感图像中不同的语义类别对应有相应的密集标记。语义类别包括海水、农地、绿林地、房屋以及养殖场等类别。从该训练集中选取一帧遥感图像输入自适应边界提取的编码网络中，编码网络中可以包括第一卷积组合第二卷积组，利用不同的卷积组提取遥感图像的图像特征，分别输出第一特征图和第二特征图。According to an embodiment of the present disclosure, a training set composed of multiple frames of optical remote sensing images is obtained, wherein each frame of remote sensing images includes boundary information of feature elements, and different semantic categories in the remote sensing images correspond to corresponding dense labels. Semantic categories include seawater, agricultural land, green forest land, houses, and farms. Select a frame of remote sensing image from the training set and input it into the encoding network of adaptive boundary extraction. The encoding network can include the first convolution group and the second convolution group, and use different convolution groups to extract the image features of the remote sensing image, and output them respectively The first feature map and the second feature map.

根据本公开的实施例，编码网络中可以利用不同的卷积组对遥感图像卷积多次，以提取增强遥感图像中的边界信息，边界信息明确对应于遥感图像中剧烈的梯度变化。其中，第一卷积组的卷积次数和第二卷积组中的卷积次数可以依据训练需求进行调整。例如：编码网络中可以利用第一卷积组和第二卷积组共对遥感图像共卷积五次。其中，第一特征图使用的第一卷积组中卷积的次数可以为两次，第二特征图使用的第二卷积组中卷积的次数可以为三次。According to an embodiment of the present disclosure, different convolution groups can be used in the encoding network to convolve the remote sensing image multiple times to extract boundary information in the enhanced remote sensing image, and the boundary information clearly corresponds to the drastic gradient change in the remote sensing image. Wherein, the number of convolutions in the first convolution group and the number of convolutions in the second convolution group can be adjusted according to training requirements. For example: in the encoding network, the first convolution group and the second convolution group can be used to convolute the remote sensing image five times. Wherein, the number of convolutions in the first convolution group used by the first feature map may be two times, and the number of convolutions in the second convolution group used by the second feature map may be three times.

根据本公开的实施例，对第二特征图进行金字塔池化，池化后将其引入到解码器中进行上采样，以使得池化特征图具有丰富的上下文信息，增强其特征表达。将第一特征图输入自适应边界提取的编码网络中，对第一特征图中的边界信息进行编码增强，得到增强特征图。将增强特征图输入CSA模块(坐标注意力模块)中以捕获其长距离依赖关系，得到提取特征图。将提取特征图与池化特征图进行特征融合，进而将提取特征图中的特征引入池化特征图的特征中。也即将编码网络下采样的特征图引入到解码器上采样的特征图中，从而更准确地预测类别像素的边界。其中，融合过程中可以根据提取特征图和池化特征图的图像特征对两幅图像进行像素配准，根据池化特征图提供的各像素强度值对提取特征图各像素值进行筛选和修正，以及利用池化特征图提供的整体的图像特征对提取特征图进行扩充等，从而生成目标特征图。According to an embodiment of the present disclosure, pyramid pooling is performed on the second feature map, and after pooling, it is introduced into a decoder for upsampling, so that the pooled feature map has rich context information and enhances its feature expression. The first feature map is input into the encoding network of adaptive boundary extraction, and the boundary information in the first feature map is encoded and enhanced to obtain an enhanced feature map. The enhanced feature maps are fed into the CSA module (coordinate attention module) to capture their long-distance dependencies, resulting in extracted feature maps. The feature fusion of the extracted feature map and the pooled feature map is performed, and then the features in the extracted feature map are introduced into the features of the pooled feature map. It also introduces the feature map downsampled by the encoding network into the feature map upsampled by the decoder, so as to more accurately predict the boundary of category pixels. Among them, during the fusion process, the pixel registration of the two images can be performed according to the image features of the extracted feature map and the pooled feature map, and the pixel values of the extracted feature map can be screened and corrected according to the pixel intensity values provided by the pooled feature map. And use the overall image features provided by the pooling feature map to expand the extracted feature map, so as to generate the target feature map.

根据本公开的实施例，将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取遥感图像的图像特征，分别输出第一特征图和第二特征图，可以包括如下操作。According to an embodiment of the present disclosure, inputting a remote sensing image into an adaptive boundary extraction coding network, using different convolution groups to extract image features of the remote sensing image, and outputting a first feature map and a second feature map respectively may include the following operations.

利用编码网络的第一卷积组对遥感图像进行下采样，得到第一特征图；利用编码网络的第二卷积组对第一特征图进行下采样，得到第二特征图，第二卷积组中的卷积次数大于第一卷积组中的卷积次数。Use the first convolution group of the encoding network to down-sample the remote sensing image to obtain the first feature map; use the second convolution group of the encoding network to down-sample the first feature map to obtain the second feature map, and the second convolution The number of convolutions in the group is greater than the number of convolutions in the first convolution group.

根据本公开的实施例，假设遥感图像的分辨率为16×16，利用编码网络的第一卷积组对遥感图像进行下采样，生成分辨率为4×4的第一特征图。其中，第一卷积组中的卷积次数为两次。利用编码网络的第二卷积组对第一特征图进行下采样，生成分辨率为2×2的第二特征图。其中，第一卷积组中的卷积次数为三次。利用编码网络对遥感图像进行下采样，并采用不同的卷积组提取遥感图像的图像特征，有效提取增强了遥感图像中的边界信息。According to an embodiment of the present disclosure, assuming that the resolution of the remote sensing image is 16×16, the remote sensing image is down-sampled using the first convolution group of the encoding network to generate a first feature map with a resolution of 4×4. Wherein, the number of convolutions in the first convolution group is twice. The first feature map is down-sampled using the second convolutional group of the encoding network to generate a second feature map with a resolution of 2×2. Wherein, the number of convolutions in the first convolution group is three times. The coding network is used to down-sample the remote sensing image, and different convolution groups are used to extract the image features of the remote sensing image, which effectively extracts and enhances the boundary information in the remote sensing image.

图3示意性示出了根据本公开实施例的遥感图像地物要素提取方法中对遥感图像下采样的示意图。Fig. 3 schematically shows a schematic diagram of down-sampling a remote sensing image in a method for extracting feature elements from a remote sensing image according to an embodiment of the present disclosure.

根据本公开的实施例，利用编码网络的第一卷积组对遥感图像进行下采样，得到第一特征图，可以包括如下操作。According to an embodiment of the present disclosure, using the first convolution group of the encoding network to down-sample the remote sensing image to obtain the first feature map may include the following operations.

利用第一卷积组中的第一动态混合梯度卷积对遥感图像进行下采样，得到初始特征图；利用第一卷积组中的第二动态混合梯度卷积对初始特征图进行下采样，得到第一特征图。The remote sensing image is down-sampled by using the first dynamic hybrid gradient convolution in the first convolution group to obtain an initial feature map; the initial feature map is down-sampled by using the second dynamic hybrid gradient convolution in the first convolution group, Get the first feature map.

根据本公开的实施例，自适应边界提取的编码网络的网络主体选用ResNet网络(残差网络)。将ResNet中的普通卷积全部替换为动态混合梯度卷积。如图3所示，第一卷积组中包括第一动态混合梯度卷积和第二动态混合梯度卷积。利用第一卷积组中的第一动态混合梯度卷积对分辨率为16×16的遥感图像进行第一次下采样，生成分辨率为8×8的初始特征图。利用第一卷积组中的第二动态混合梯度卷积对分辨率为8×8的初始特征图进行第二次下采样，生成分辨率为4×4的第一特征图。通过将普通卷积全部替换为动态混合梯度卷积，使得编码网络对遥感图像的特征提取更加精准。According to the embodiment of the present disclosure, the main body of the encoding network for adaptive boundary extraction is a ResNet network (residual network). Replace all ordinary convolutions in ResNet with dynamic hybrid gradient convolutions. As shown in FIG. 3 , the first convolution group includes a first dynamic mixed gradient convolution and a second dynamic mixed gradient convolution. The remote sensing image with a resolution of 16×16 is down-sampled for the first time by using the first dynamic hybrid gradient convolution in the first convolution group to generate an initial feature map with a resolution of 8×8. The initial feature map with a resolution of 8×8 is downsampled a second time using the second dynamic hybrid gradient convolution in the first convolution group to generate the first feature map with a resolution of 4×4. By replacing all ordinary convolutions with dynamic hybrid gradient convolutions, the feature extraction of remote sensing images by the encoding network is more accurate.

图4示意性示出了根据本公开实施例的遥感图像地物要素提取方法中对遥感图像特征提取的流程图。Fig. 4 schematically shows a flow chart of extracting features of remote sensing images in the method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure.

根据本公开的实施例，利用第一卷积组中的第一动态混合梯度卷积对遥感图像进行下采样，得到初始特征图，可以包括如下操作。According to an embodiment of the present disclosure, using the first dynamic hybrid gradient convolution in the first convolution group to down-sample the remote sensing image to obtain an initial feature map may include the following operations.

基于遥感图像确定第一动态混合梯度卷积的多个卷积权重系数，第一动态混合梯度卷积包括普通卷积核、第一梯度卷积核和第二梯度卷积核；基于普通卷积核对应的卷积权重系数，利用普通卷积核对遥感图像进行特征提取，得到普通特征；基于第一梯度卷积核对应的卷积权重系数，利用第一梯度卷积核对遥感图像进行特征提取，得到第一梯度特征；基于第二梯度卷积核对应的卷积权重系数，利用第二梯度卷积核对遥感图像进行特征提取，得到第二梯度特征；将普通特征、第一梯度特征和第二梯度特征进行特征融合，生成初始特征图。Determine a plurality of convolution weight coefficients of the first dynamic mixed gradient convolution based on the remote sensing image, the first dynamic mixed gradient convolution includes a common convolution kernel, a first gradient convolution kernel and a second gradient convolution kernel; based on the common convolution The convolution weight coefficient corresponding to the kernel, using the ordinary convolution kernel to extract the feature of the remote sensing image, and obtaining the common feature; based on the convolution weight coefficient corresponding to the first gradient convolution kernel, using the first gradient convolution kernel to extract the feature of the remote sensing image, Obtain the first gradient feature; based on the convolution weight coefficient corresponding to the second gradient convolution kernel, use the second gradient convolution kernel to perform feature extraction on the remote sensing image to obtain the second gradient feature; combine the common feature, the first gradient feature and the second The gradient feature is used for feature fusion to generate an initial feature map.

根据本公开的实施例，如图4所示，动态混合梯度卷积由系数预测模块410、混合梯度卷积模块420以及核聚合模块430三部分组成。动态混合梯度卷积由一个普通卷积核和两个梯度卷积核构成。其中，一个梯度卷积核是由另一个梯度卷积核旋转90度得到的。普通卷积核在卷积过程中对参数没有约束，但梯度卷积核在卷积过程中对参数存在约束。梯度卷积核在卷积过程中，作用在目标内部时响应为0，作用在目标边界时响应不为0。针对遥感图像中的语义信息和边界信息的自适应联合建模，受传统混合空间增强方法的启发，将普通卷积核和梯度卷积核组合成动态混合梯度卷积，有利于大量遥感图像边界信息的提取。According to an embodiment of the present disclosure, as shown in FIG. 4 , the dynamic hybrid gradient convolution consists of three parts: a coefficient prediction module 410 , a hybrid gradient convolution module 420 and a kernel aggregation module 430 . The dynamic hybrid gradient convolution consists of a common convolution kernel and two gradient convolution kernels. Among them, one gradient convolution kernel is obtained by rotating another gradient convolution kernel by 90 degrees. The ordinary convolution kernel has no constraints on the parameters during the convolution process, but the gradient convolution kernel has constraints on the parameters during the convolution process. During the convolution process, the gradient convolution kernel responds to 0 when it acts on the inside of the target, and the response is not 0 when it acts on the boundary of the target. Aiming at the adaptive joint modeling of semantic information and boundary information in remote sensing images, inspired by the traditional hybrid space enhancement method, the ordinary convolution kernel and gradient convolution kernel are combined into a dynamic hybrid gradient convolution, which is beneficial to a large number of remote sensing image boundaries extraction of information.

根据本公开的实施例，由于输入不同的遥感图像的边界信息不同，混合梯度卷积应该能够根据遥感图像内容动态调整普通卷积核和梯度的权重比。系数预测模块410根据遥感图像内容自适应的为混合梯度卷积的三个卷积核生成卷积权重系数，三个卷积权重系数的总和为1。将普通卷积核聚合成一个新的高效算子，使得动态混合梯度卷积能够自适应地增强语义分割信息流中的边界信息，有利于遥感图像边界的提取。According to the embodiments of the present disclosure, since different input remote sensing images have different boundary information, the hybrid gradient convolution should be able to dynamically adjust the weight ratio of common convolution kernels and gradients according to remote sensing image content. The coefficient prediction module 410 adaptively generates convolution weight coefficients for the three convolution kernels of the hybrid gradient convolution according to the content of the remote sensing image, and the sum of the three convolution weight coefficients is 1. Aggregating ordinary convolution kernels into a new high-efficiency operator enables dynamic hybrid gradient convolution to adaptively enhance boundary information in semantic segmentation information flow, which is beneficial to the extraction of remote sensing image boundaries.

根据本公开的实施例，在混合梯度卷积模块420中，利用普通卷积核对遥感图像进行特征提取，得到普通特征。利用第一梯度卷积核对遥感图像进行特征提取，得到第一梯度特征。利用第二梯度卷积核对遥感图像进行特征提取，得到第二梯度特征。核聚合模块430将普通特征、第一梯度特征和第二梯度特征进行特征融合，生成初始特征图。核聚合模块430采用卷积核聚合的方式来等效的替代特征图聚合的方式，从而加速网络推理过程。According to an embodiment of the present disclosure, in the hybrid gradient convolution module 420, a common convolution kernel is used to perform feature extraction on remote sensing images to obtain common features. The features of the remote sensing image are extracted by using the first gradient convolution kernel to obtain the first gradient features. Using the second gradient convolution kernel to perform feature extraction on the remote sensing image to obtain the second gradient feature. The kernel aggregation module 430 performs feature fusion of common features, first gradient features and second gradient features to generate an initial feature map. The kernel aggregation module 430 adopts the method of convolution kernel aggregation to equivalently replace the method of feature map aggregation, so as to accelerate the network reasoning process.

根据本公开的实施例，利用编码网络的第二卷积组对第一特征图进行下采样，得到第二特征图，可以包括如下操作。According to an embodiment of the present disclosure, using the second convolution group of the encoding network to down-sample the first feature map to obtain the second feature map may include the following operations.

利用第二卷积组中的第一动态混合梯度卷积对第一特征图进行下采样，得到第一卷积图；利用第二卷积组中的第二动态混合梯度卷积对第一卷积图进行编码增强，得到第二卷积图；利用第二卷积组中的第三动态混合梯度卷积对第二卷积图进行编码增强，得到第二特征图。Use the first dynamic hybrid gradient convolution in the second convolution group to downsample the first feature map to obtain the first convolution map; use the second dynamic hybrid gradient convolution in the second convolution group to the first volume Coding enhancement is performed on the product map to obtain a second convolution map; the second convolution map is coded and enhanced by using the third dynamic hybrid gradient convolution in the second convolution group to obtain a second feature map.

根据本公开的实施例，利用第二卷积组中的第一动态混合梯度卷积对分辨率为4×4的第一特征图进行下采样，得到分辨率为2×2的第一卷积图。利用第二卷积组中的第二动态混合梯度卷积对第一卷积图进行编码增强，得到边界梯度信息增强的分辨率为2×2第二卷积图。利用第二卷积组中的第三动态混合梯度卷积对第二卷积图进行编码增强，得到边界梯度信息增强的分辨率为2×2第二特征图。通过采用动态混合梯度卷积的对遥感图像边界信息的提取并增强，以使得对地物像素边界预测的更加准确。According to an embodiment of the present disclosure, the first feature map with a resolution of 4×4 is down-sampled using the first dynamic hybrid gradient convolution in the second convolution group to obtain the first convolution with a resolution of 2×2 picture. The second dynamic hybrid gradient convolution in the second convolution group is used to encode and enhance the first convolutional image to obtain a second convolutional image with a boundary gradient information enhanced resolution of 2×2. The second convolution map is coded and enhanced by using the third dynamic hybrid gradient convolution in the second convolution group to obtain a second feature map with a boundary gradient information enhanced resolution of 2×2. By using the dynamic mixed gradient convolution to extract and enhance the boundary information of the remote sensing image, the prediction of the pixel boundary of the ground object is more accurate.

根据本公开的实施例，该方法还可以包括如下操作。According to an embodiment of the present disclosure, the method may further include the following operations.

利用编码网络对目标特征图的边界信息进行编码增强，得到目标增强特征图；捕获目标增强特征图的长距离依赖关系，得到目标提取特征图；输出目标提取特征图。Use the encoding network to encode and enhance the boundary information of the target feature map to obtain the target enhanced feature map; capture the long-distance dependency of the target enhanced feature map to obtain the target extraction feature map; output the target extraction feature map.

根据本公开的实施例，利用编码网络对融合后的目标特征图进一步进行编码增强，得到目标增强特征图。将目标增强特征图输入CSA模块中以捕获其长距离依赖关系，得到目标提取特征图。其中，目标特征图为对地物像素边界的预测结果，将目标特征图输出，并用于对模型的训练，以使得预测模型对地物像素边界预测结果更加准确。According to an embodiment of the present disclosure, a coding network is used to further encode and enhance the fused target feature map to obtain a target enhanced feature map. The object-enhanced feature map is fed into the CSA module to capture its long-distance dependencies, and the object-extracted feature map is obtained. Among them, the target feature map is the prediction result of the pixel boundary of the feature, and the target feature map is output and used for training the model, so that the prediction model can predict the pixel boundary of the feature more accurately.

图5示意性示出了根据本公开的实施例的遥感图像地物要素提取装置的框图。Fig. 5 schematically shows a block diagram of an apparatus for extracting feature elements from remote sensing images according to an embodiment of the present disclosure.

如图5所示，遥感图像地物要素提取装置500包括编码模块510、池化模块520、增强模块530、捕获模块540以及融合模块550。As shown in FIG. 5 , the device 500 for extracting feature elements from remote sensing images includes an encoding module 510 , a pooling module 520 , an enhancement module 530 , a capture module 540 and a fusion module 550 .

编码模块510，用于将遥感图像输入自适应边界提取的编码网络中，利用不同的卷积组提取遥感图像的图像特征，分别输出第一特征图和第二特征图，其中，输出第一特征图使用的第一卷积组中的卷积次数小于输出第二特征图使用的第二卷积组中的卷积次数，遥感图像中包括地物要素的边界信息；The encoding module 510 is used to input the remote sensing image into the encoding network of adaptive boundary extraction, use different convolution groups to extract the image features of the remote sensing image, and output the first feature map and the second feature map respectively, wherein the output of the first feature The number of convolutions in the first convolution group used by the map is smaller than the number of convolutions in the second convolution group used to output the second feature map, and the remote sensing image includes boundary information of feature elements;

池化模块520，用于对第二特征图进行金字塔池化，得到池化特征图；A pooling module 520, configured to perform pyramid pooling on the second feature map to obtain a pooling feature map;

增强模块530，用于利用编码网络对第一特征图的边界信息进行编码增强，得到增强特征图；The enhancement module 530 is used to encode and enhance the boundary information of the first feature map by using the coding network to obtain the enhanced feature map;

捕获模块540，用于捕获增强特征图的长距离依赖关系，得到提取特征图；Capture module 540, used to capture the long-distance dependency of the enhanced feature map to obtain the extracted feature map;

融合模块550，用于将提取特征图与池化特征图进行特征融合，生成目标特征图。The fusion module 550 is configured to perform feature fusion of the extracted feature map and the pooled feature map to generate a target feature map.

根据本公开的实施例，编码模块510包括第一下采样子模块和第二下采样子模块。According to an embodiment of the present disclosure, the encoding module 510 includes a first down-sampling sub-module and a second down-sampling sub-module.

第一下采样子模块，用于利用编码网络的第一卷积组对遥感图像进行下采样，得到第一特征图。The first down-sampling sub-module is configured to use the first convolution group of the encoding network to down-sample the remote sensing image to obtain a first feature map.

第二下采样子模块，用于利用编码网络的第二卷积组对第一特征图进行下采样，得到第二特征图，第二卷积组中的卷积次数大于第一卷积组中的卷积次数。The second downsampling sub-module is used to downsample the first feature map using the second convolution group of the encoding network to obtain a second feature map, and the number of convolutions in the second convolution group is greater than that in the first convolution group the number of convolutions.

根据本公开的实施例，第一下采样子模块包括第一下采样单元和第二下采样单元。According to an embodiment of the present disclosure, the first downsampling sub-module includes a first downsampling unit and a second downsampling unit.

第一下采样单元，用于利用第一卷积组中的第一动态混合梯度卷积对遥感图像进行下采样，得到初始特征图。The first down-sampling unit is configured to use the first dynamic mixed gradient convolution in the first convolution group to down-sample the remote sensing image to obtain an initial feature map.

第二下采样单元，用于利用第一卷积组中的第二动态混合梯度卷积对初始特征图进行下采样，得到第一特征图。The second down-sampling unit is configured to use the second dynamic hybrid gradient convolution in the first convolution group to down-sample the initial feature map to obtain the first feature map.

根据本公开的实施例，第一下采样单元包括权重确定子单元、第一特征提取子单元、第二特征提取子单元、第三特征提取子单元以及特征融合子单元。According to an embodiment of the present disclosure, the first downsampling unit includes a weight determination subunit, a first feature extraction subunit, a second feature extraction subunit, a third feature extraction subunit, and a feature fusion subunit.

权重确定子单元，用于基于遥感图像确定第一动态混合梯度卷积的多个卷积权重系数，第一动态混合梯度卷积包括普通卷积核、第一梯度卷积核和第二梯度卷积核。The weight determination subunit is used to determine a plurality of convolution weight coefficients of the first dynamic mixed gradient convolution based on the remote sensing image, and the first dynamic mixed gradient convolution includes a common convolution kernel, a first gradient convolution kernel and a second gradient convolution Accumulation.

第一特征提取子单元，用于基于普通卷积核对应的卷积权重系数，利用普通卷积核对遥感图像进行特征提取，得到普通特征。The first feature extraction subunit is configured to extract features of the remote sensing image by using the common convolution kernel based on the convolution weight coefficients corresponding to the common convolution kernel to obtain common features.

第二特征提取子单元，用于基于第一梯度卷积核对应的卷积权重系数，利用第一梯度卷积核对遥感图像进行特征提取，得到第一梯度特征。The second feature extraction subunit is configured to use the first gradient convolution kernel to perform feature extraction on the remote sensing image based on the convolution weight coefficient corresponding to the first gradient convolution kernel to obtain the first gradient feature.

第三特征提取子单元，用于基于第二梯度卷积核对应的卷积权重系数，利用第二梯度卷积核对遥感图像进行特征提取，得到第二梯度特征。The third feature extraction subunit is configured to use the second gradient convolution kernel to perform feature extraction on the remote sensing image based on the convolution weight coefficient corresponding to the second gradient convolution kernel to obtain a second gradient feature.

特征融合子单元，用于将普通特征、第一梯度特征和第二梯度特征进行特征融合，生成初始特征图。The feature fusion subunit is used to perform feature fusion of ordinary features, first gradient features and second gradient features to generate an initial feature map.

根据本公开的实施例，第二下采样子模块包括第三下采样单元、第一增强单元和第二增强单元。According to an embodiment of the present disclosure, the second downsampling sub-module includes a third downsampling unit, a first enhancement unit and a second enhancement unit.

第三下采样单元，用于利用第二卷积组中的第一动态混合梯度卷积对第一特征图进行下采样，得到第一卷积图。The third down-sampling unit is configured to use the first dynamic hybrid gradient convolution in the second convolution group to down-sample the first feature map to obtain the first convolution map.

第一增强单元，用于利用第二卷积组中的第二动态混合梯度卷积对第一卷积图进行编码增强，得到第二卷积图。The first enhancement unit is configured to use the second dynamic hybrid gradient convolution in the second convolution group to perform coding enhancement on the first convolutional image to obtain a second convolutional image.

第二增强单元，用于利用第二卷积组中的第三动态混合梯度卷积对第二卷积图进行编码增强，得到第二特征图。The second enhancement unit is configured to use the third dynamic hybrid gradient convolution in the second convolution group to perform coding enhancement on the second convolution map to obtain a second feature map.

根据本公开的实施例的模块、子模块、单元、子单元中的任意多个、或其中任意多个的至少部分功能可以在一个模块中实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以被拆分成多个模块来实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以至少被部分地实现为硬件电路，例如现场可编程门阵列(Field Programmable Gate Array，FPGA)、可编程逻辑阵列(Programmable LogicArrays，PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ApplicationSpecific Integrated Circuit，ASIC)，或可以通过对电路进行集成或封装的任何其他的合理方式的硬件或固件来实现，或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者，根据本公开实施例的模块、子模块、单元、子单元中的一个或多个可以至少被部分地实现为计算机程序模块，当该计算机程序模块被运行时，可以执行相应的功能。Modules, sub-modules, units, any multiple of sub-units according to the embodiments of the present disclosure, or at least part of the functions of any multiple of them may be implemented in one module. Any one or more of modules, submodules, units, and subunits according to the embodiments of the present disclosure may be implemented by being divided into multiple modules. Any one or more of modules, submodules, units, and subunits according to embodiments of the present disclosure may be at least partially implemented as hardware circuits, such as Field Programmable Gate Array (Field Programmable Gate Array, FPGA), programmable logic Arrays (Programmable LogicArrays, PLA), system-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or any other reasonable means of hardware or firmware, or any one of software, hardware, and firmware, or an appropriate combination of any of them. Alternatively, one or more of the modules, submodules, units, and subunits according to the embodiments of the present disclosure may be at least partially implemented as computer program modules, and when the computer program modules are executed, corresponding functions may be performed.

例如，编码模块510、池化模块520、增强模块530、捕获模块540以及融合模块550中的任意多个可以合并在一个模块/单元/子单元中实现，或者其中的任意一个模块/单元/子单元可以被拆分成多个模块/单元/子单元。或者，这些模块/单元/子单元中的一个或多个模块/单元/子单元的至少部分功能可以与其他模块/单元/子单元的至少部分功能相结合，并在一个模块/单元/子单元中实现。根据本公开的实施例，编码模块510、池化模块520、增强模块530、捕获模块540以及融合模块550中的至少一个可以至少被部分地实现为硬件电路，例如现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ASIC)，或可以通过对电路进行集成或封装的任何其他的合理方式等硬件或固件来实现，或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者，编码模块510、池化模块520、增强模块530、捕获模块540以及融合模块550中的至少一个可以至少被部分地实现为计算机程序模块，当该计算机程序模块被运行时，可以执行相应的功能。For example, any number of encoding module 510, pooling module 520, enhancement module 530, capture module 540, and fusion module 550 can be implemented in one module/unit/subunit, or any one of the modules/units/subunits Units can be split into multiple modules/units/subunits. Alternatively, at least part of the functions of one or more modules/units/subunits of these modules/units/subunits can be combined with at least part of the functions of other modules/units/subunits, and combined in one module/unit/subunit realized in. According to an embodiment of the present disclosure, at least one of the encoding module 510, pooling module 520, enhancement module 530, capture module 540, and fusion module 550 may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA) , programmable logic array (PLA), system-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or any other reasonable means of integrating or packaging circuits, such as hardware or firmware , or implemented by any one of software, hardware and firmware, or by an appropriate combination of any of them. Alternatively, at least one of the encoding module 510, the pooling module 520, the enhancement module 530, the capture module 540, and the fusion module 550 may be at least partially implemented as a computer program module, and when the computer program module is executed, the corresponding Function.

需要说明的是，本公开的实施例中遥感图像地物要素提取装置部分与本公开的实施例中遥感图像地物要素提取方法部分是相对应的，遥感图像地物要素提取装置部分的描述具体参考遥感图像地物要素提取方法部分，在此不再赘述。It should be noted that the remote sensing image feature element extraction device part in the embodiment of the present disclosure corresponds to the remote sensing image feature element extraction method part in the embodiment of the present disclosure, and the description of the remote sensing image feature element extraction device part is specific Refer to the part of extracting method of feature elements from remote sensing images, and will not go into details here.

图6示意性示出了根据本公开实施例的适于实现用于遥感图像地物要素提取方法的电子设备的方框图。图6示出的电子设备仅仅是一个示例，不应对本公开实施例的功能和使用范围带来任何限制。Fig. 6 schematically shows a block diagram of an electronic device adapted to implement a method for extracting feature elements from remote sensing images according to an embodiment of the present disclosure. The electronic device shown in FIG. 6 is only an example, and should not limit the functions and application scope of the embodiments of the present disclosure.

如图6所示，根据本公开实施例的电子设备600包括处理器601，其可以根据存储在只读存储器(Read-Only Memory，ROM)602中的程序或者从存储部分608加载到随机访问存储器(Random Access Memory，RAM)603中的程序而执行各种适当的动作和处理。处理器601例如可以包括通用微处理器(例如CPU)、指令集处理器和/或相关芯片组和/或专用微处理器(例如，专用集成电路(ASIC))，等等。处理器601还可以包括用于缓存用途的板载存储器。处理器601可以包括用于执行根据本公开实施例的方法流程的不同动作的单一处理单元或者是多个处理单元。As shown in FIG. 6, an electronic device 600 according to an embodiment of the present disclosure includes a processor 601, which can be loaded into a random access memory according to a program stored in a read-only memory (Read-Only Memory, ROM) 602 or from a storage part 608. (Random Access Memory, RAM) 603 to perform various appropriate actions and processing. Processor 601 may include, for example, a general-purpose microprocessor (eg, a CPU), an instruction set processor and/or related chipsets, and/or a special-purpose microprocessor (eg, an application-specific integrated circuit (ASIC)), and the like. Processor 601 may also include on-board memory for caching purposes. The processor 601 may include a single processing unit or multiple processing units for executing different actions of the method flow according to the embodiments of the present disclosure.

在RAM 603中，存储有电子设备600操作所需的各种程序和数据。处理器601、ROM602以及RAM 603通过总线604彼此相连。处理器601通过执行ROM 602和/或RAM 603中的程序来执行根据本公开实施例的方法流程的各种操作。需要注意，所述程序也可以存储在除ROM 602和RAM603以外的一个或多个存储器中。处理器601也可以通过执行存储在所述一个或多个存储器中的程序来执行根据本公开实施例的方法流程的各种操作。In the RAM 603, various programs and data necessary for the operation of the electronic device 600 are stored. The processor 601 , ROM 602 and RAM 603 are connected to each other through a bus 604 . The processor 601 executes various operations according to the method flow of the embodiment of the present disclosure by executing programs in the ROM 602 and/or RAM 603 . It should be noted that the program may also be stored in one or more memories other than ROM 602 and RAM 603 . The processor 601 may also perform various operations according to the method flow of the embodiments of the present disclosure by executing programs stored in the one or more memories.

根据本公开的实施例，电子设备600还可以包括输入/输出(I/O)接口605，输入/输出(I/O)接口605也连接至总线604。系统600还可以包括连接至I/O接口605的以下部件中的一项或多项：包括键盘、鼠标等的输入部分606；包括诸如阴极射线管(CRT)、液晶显示器(Liquid Crystal Display，LCD)等以及扬声器等的输出部分607；包括硬盘等的存储部分608；以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分609。通信部分609经由诸如因特网的网络执行通信处理。驱动器610也根据需要连接至I/O接口605。可拆卸介质611，诸如磁盘、光盘、磁光盘、半导体存储器等等，根据需要安装在驱动器610上，以便于从其上读出的计算机程序根据需要被安装入存储部分608。According to an embodiment of the present disclosure, the electronic device 600 may further include an input/output (I/O) interface 605 which is also connected to the bus 604 . The system 600 may also include one or more of the following components connected to the I/O interface 605: an input section 606 including a keyboard, a mouse, etc.; ) and the like and an output section 607 of speakers, etc.; a storage section 608 including a hard disk, etc.; and a communication section 609 including a network interface card such as a LAN card, a modem, and the like. The communication section 609 performs communication processing via a network such as the Internet. A drive 610 is also connected to the I/O interface 605 as needed. A removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc. is mounted on the drive 610 as necessary so that a computer program read therefrom is installed into the storage section 608 as necessary.

根据本公开的实施例，根据本公开实施例的方法流程可以被实现为计算机软件程序。例如，本公开的实施例包括一种计算机程序产品，其包括承载在计算机可读存储介质上的计算机程序，该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中，该计算机程序可以通过通信部分609从网络上被下载和安装，和/或从可拆卸介质611被安装。在该计算机程序被处理器601执行时，执行本公开实施例的系统中限定的上述功能。根据本公开的实施例，上文描述的系统、设备、装置、模块、单元等可以通过计算机程序模块来实现。According to the embodiments of the present disclosure, the method flow according to the embodiments of the present disclosure can be implemented as a computer software program. For example, the embodiments of the present disclosure include a computer program product, which includes a computer program carried on a computer-readable storage medium, where the computer program includes program codes for executing the methods shown in the flowcharts. In such an embodiment, the computer program may be downloaded and installed from a network via communication portion 609 and/or installed from removable media 611 . When the computer program is executed by the processor 601, the above-mentioned functions defined in the system of the embodiment of the present disclosure are executed. According to the embodiments of the present disclosure, the above-described systems, devices, devices, modules, units, etc. may be implemented by computer program modules.

本公开还提供了一种计算机可读存储介质，该计算机可读存储介质可以是上述实施例中描述的设备/装置/系统中所包含的；也可以是单独存在，而未装配入该设备/装置/系统中。上述计算机可读存储介质承载有一个或者多个程序，当上述一个或者多个程序被执行时，实现根据本公开实施例的方法。The present disclosure also provides a computer-readable storage medium. The computer-readable storage medium may be included in the device/apparatus/system described in the above embodiments; it may also exist independently without being assembled into the device/system device/system. The above-mentioned computer-readable storage medium carries one or more programs, and when the above-mentioned one or more programs are executed, the method according to the embodiment of the present disclosure is implemented.

根据本公开的实施例，计算机可读存储介质可以是非易失性的计算机可读存储介质。例如可以包括但不限于：便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM(Erasable Programmable Read Only Memory，EPROM)或闪存)、便携式紧凑磁盘只读存储器(Computer Disc Read-Only Memory，CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中，计算机可读存储介质可以是任何包含或存储程序的有形介质，该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。According to an embodiment of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium. For example, it may include but not limited to: portable computer disk, hard disk, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM (Erasable Programmable Read Only Memory, EPROM) or flash memory), Portable compact disk read-only memory (Computer Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.

例如，根据本公开的实施例，计算机可读存储介质可以包括上文描述的ROM 602和/或RAM 603和/或ROM 602和RAM 603以外的一个或多个存储器。For example, according to an embodiment of the present disclosure, a computer-readable storage medium may include one or more memories other than the above-described ROM 602 and/or RAM 603 and/or ROM 602 and RAM 603 .

本公开的实施例还包括一种计算机程序产品，其包括计算机程序，该计算机程序包含用于执行本公开实施例所提供的方法的程序代码，当计算机程序产品在电子设备上运行时，该程序代码用于使电子设备实现本公开实施例所提供的遥感图像地物要素提取方法。Embodiments of the present disclosure also include a computer program product, which includes a computer program, and the computer program includes program codes for executing the method provided by the embodiments of the present disclosure. When the computer program product is run on an electronic device, the program The code is used to enable the electronic device to implement the method for extracting feature elements from remote sensing images provided by the embodiments of the present disclosure.

在该计算机程序被处理器601执行时，执行本公开实施例的系统/装置中限定的上述功能。根据本公开的实施例，上文描述的系统、装置、模块、单元等可以通过计算机程序模块来实现。When the computer program is executed by the processor 601, the above-mentioned functions defined in the system/device of the embodiment of the present disclosure are executed. According to the embodiments of the present disclosure, the above-described systems, devices, modules, units, etc. may be implemented by computer program modules.

在一种实施例中，该计算机程序可以依托于光存储器件、磁存储器件等有形存储介质。在另一种实施例中，该计算机程序也可以在网络介质上以信号的形式进行传输、分发，并通过通信部分609被下载和安装，和/或从可拆卸介质611被安装。该计算机程序包含的程序代码可以用任何适当的网络介质传输，包括但不限于：无线、有线等等，或者上述的任意合适的组合。In one embodiment, the computer program may rely on tangible storage media such as optical storage devices and magnetic storage devices. In another embodiment, the computer program can also be transmitted and distributed in the form of a signal on a network medium, downloaded and installed through the communication part 609, and/or installed from the removable medium 611. The program code contained in the computer program can be transmitted by any appropriate network medium, including but not limited to: wireless, wired, etc., or any appropriate combination of the above.

根据本公开的实施例，可以以一种或多种程序设计语言的任意组合来编写用于执行本公开实施例提供的计算机程序的程序代码，具体地，可以利用高级过程和/或面向对象的编程语言、和/或汇编/机器语言来实施这些计算程序。程序设计语言包括但不限于诸如Java，C++，python，“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算设备上执行、部分地在用户设备上执行、部分在远程计算设备上执行、或者完全在远程计算设备或服务器上执行。在涉及远程计算设备的情形中，远程计算设备可以通过任意种类的网络，包括局域网(LAN)或广域网(WAN)，连接到用户计算设备，或者，可以连接到外部计算设备(例如利用因特网服务提供商来通过因特网连接)。According to the embodiments of the present disclosure, the program codes for executing the computer programs provided by the embodiments of the present disclosure can be written in any combination of one or more programming languages, specifically, high-level procedural and/or object-oriented programming language, and/or assembly/machine language to implement these computing programs. Programming languages include, but are not limited to, programming languages such as Java, C++, python, "C" or similar programming languages. The program code can execute entirely on the user computing device, partly on the user device, partly on the remote computing device, or entirely on the remote computing device or server. In cases involving a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computing device (for example, using an Internet service provider). business to connect via the Internet).

附图中的流程图和框图，图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上，流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分，上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意，在有些作为替换的实现中，方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如，两个接连地表示的方框实际上可以基本并行地执行，它们有时也可以按相反的顺序执行，这依所涉及的功能而定。也要注意的是，框图或流程图中的每个方框、以及框图或流程图中的方框的组合，可以用执行规定的功能或操作的专用的基于硬件的系统来实现，或者可以用专用硬件与计算机指令的组合来实现。本领域技术人员可以理解，本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合和/或结合，即使这样的组合或结合没有明确记载于本公开中。特别地，在不脱离本公开精神和教导的情况下，本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合和/或结合。所有这些组合和/或结合均落入本公开的范围。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in a flowchart or block diagram may represent a module, program segment, or portion of code that includes one or more logical functions for implementing specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. It should also be noted that each block in the block diagrams or flowchart illustrations, and combinations of blocks in the block diagrams or flowchart illustrations, can be implemented by a dedicated hardware-based system that performs the specified function or operation, or can be implemented by a A combination of dedicated hardware and computer instructions. Those skilled in the art can understand that various combinations and/or combinations can be made in the various embodiments of the present disclosure and/or the features described in the claims, even if such combinations or combinations are not explicitly recorded in the present disclosure. In particular, without departing from the spirit and teaching of the present disclosure, the various embodiments of the present disclosure and/or the features described in the claims can be combined and/or combined in various ways. All such combinations and/or combinations fall within the scope of the present disclosure.

以上对本公开的实施例进行了描述。但是，这些实施例仅仅是为了说明的目的，而并非为了限制本公开的范围。尽管在以上分别描述了各实施例，但是这并不意味着各个实施例中的措施不能有利地结合使用。本公开的范围由所附权利要求及其等同物限定。不脱离本公开的范围，本领域技术人员可以做出多种替代和修改，这些替代和修改都应落在本公开的范围之内。The embodiments of the present disclosure have been described above. However, these examples are for illustrative purposes only and are not intended to limit the scope of the present disclosure. Although the various embodiments have been described separately above, this does not mean that the measures in the various embodiments cannot be advantageously used in combination. The scope of the present disclosure is defined by the appended claims and their equivalents. Various substitutions and modifications can be made by those skilled in the art without departing from the scope of the present disclosure, and these substitutions and modifications should all fall within the scope of the present disclosure.

Claims

1. A method for extracting feature elements from remote sensing images, comprising:

Input the remote sensing image into the encoding network of adaptive boundary extraction, use different convolution groups to extract the image features of the remote sensing image, and output the first feature map and the second feature map respectively, wherein the first feature map is output using The number of convolutions in the first convolution group is less than the number of convolutions in the second convolution group used to output the second feature map, and the remote sensing image includes boundary information of feature elements;

performing pyramid pooling on the second feature map to obtain a pooled feature map;

Encoding and enhancing the boundary information of the first feature map by using the encoding network to obtain an enhanced feature map;

capturing the long-distance dependency of the enhanced feature map to obtain the extracted feature map;

performing feature fusion on the extracted feature map and the pooled feature map to generate a target feature map.

2. The method according to claim 1, wherein the remote sensing image is input into the encoding network of adaptive boundary extraction, and different convolution groups are used to extract the image features of the remote sensing image, and the first feature map and the second feature map are respectively output. Feature maps, including:

using the first convolution group of the encoding network to down-sample the remote sensing image to obtain the first feature map;

Using the second convolution group of the encoding network to down-sample the first feature map to obtain the second feature map, the number of convolutions in the second convolution group is greater than that in the first convolution group number of convolutions.

3. The method according to claim 2, wherein said utilizing the first convolution group of said encoding network to down-sample said remote sensing image to obtain said first feature map, comprising:

Downsampling the remote sensing image by using the first dynamic hybrid gradient convolution in the first convolution group to obtain an initial feature map;

The second dynamic mixed gradient convolution in the first convolution group is used to downsample the initial feature map to obtain the first feature map.

4. The method according to claim 3, wherein said utilizing the first dynamic hybrid gradient convolution in said first convolution group to downsample said remote sensing image to obtain an initial feature map, comprising:

Determine a plurality of convolution weight coefficients of the first dynamic hybrid gradient convolution based on the remote sensing image, the first dynamic hybrid gradient convolution includes a common convolution kernel, a first gradient convolution kernel, and a second gradient convolution nuclear;

Based on the convolution weight coefficient corresponding to the ordinary convolution kernel, using the ordinary convolution kernel to perform feature extraction on the remote sensing image to obtain ordinary features;

Based on the convolution weight coefficient corresponding to the first gradient convolution kernel, using the first gradient convolution kernel to perform feature extraction on the remote sensing image to obtain a first gradient feature;

Based on the convolution weight coefficient corresponding to the second gradient convolution kernel, using the second gradient convolution kernel to perform feature extraction on the remote sensing image to obtain a second gradient feature;

performing feature fusion on the common feature, the first gradient feature, and the second gradient feature to generate the initial feature map.

5. The method according to claim 2, said utilizing the second convolution group of said encoding network to downsample said first feature map to obtain said second feature map, comprising:

downsampling the first feature map by using the first dynamic hybrid gradient convolution in the second convolution group to obtain a first convolution map;

Encoding and enhancing the first convolutional image by using the second dynamic hybrid gradient convolution in the second convolutional group to obtain a second convolutional image;

Encoding and enhancing the second convolution map by using the third dynamic hybrid gradient convolution in the second convolution group to obtain the second feature map.

6. The method of claim 1, further comprising:

Encoding and enhancing the boundary information of the target feature map by using the coding network to obtain a target enhanced feature map;

capturing the long-distance dependency of the target enhancement feature map to obtain the target extraction feature map;

Output the target extraction feature map.

7. A remote sensing image feature extraction device, comprising:

The encoding module is used to input the remote sensing image into the encoding network of adaptive boundary extraction, use different convolution groups to extract the image features of the remote sensing image, and output the first feature map and the second feature map respectively, wherein the output The number of convolutions in the first convolution group used by the first feature map is less than the number of convolutions in the second convolution group used to output the second feature map, and the remote sensing image includes boundary information of feature elements;

A pooling module, configured to perform pyramid pooling on the second feature map to obtain a pooling feature map;

An enhancement module, configured to use the encoding network to encode and enhance the boundary information of the first feature map to obtain an enhanced feature map;

A capturing module, configured to capture the long-distance dependency of the enhanced feature map to obtain the extracted feature map;

A fusion module, configured to perform feature fusion on the extracted feature map and the pooled feature map to generate a target feature map.

8. An electronic device comprising:

one or more processors;

memory for storing one or more programs,

Wherein, when the one or more programs are executed by the one or more processors, the one or more processors are made to implement the method according to any one of claims 1-6.

9. A computer-readable storage medium, on which executable instructions are stored, and when the instructions are executed by a processor, the processor can implement the method according to any one of claims 1-6.

10. A computer program product, the computer program product comprising a computer program, the computer program being used to implement the method according to any one of claims 1-6 when executed by a processor.