[go: up one dir, main page]

CN116684421A - Service deployment method, management platform, equipment and medium of hybrid cloud platform - Google Patents

Service deployment method, management platform, equipment and medium of hybrid cloud platform Download PDF

Info

Publication number
CN116684421A
CN116684421A CN202310787308.4A CN202310787308A CN116684421A CN 116684421 A CN116684421 A CN 116684421A CN 202310787308 A CN202310787308 A CN 202310787308A CN 116684421 A CN116684421 A CN 116684421A
Authority
CN
China
Prior art keywords
cluster
cloud platform
hybrid cloud
service deployment
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310787308.4A
Other languages
Chinese (zh)
Inventor
杨腾飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202310787308.4A priority Critical patent/CN116684421A/en
Publication of CN116684421A publication Critical patent/CN116684421A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1012Server selection for load balancing based on compliance of requirements or conditions with available server resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/51Discovery or management thereof, e.g. service location protocol [SLP] or web services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The disclosure provides a service deployment method, a management platform, equipment and a medium of a hybrid cloud platform, and relates to the technical fields of cloud computing, hybrid cloud and the like. The specific implementation scheme comprises the following steps: acquiring a service deployment request of a user; the service deployment request carries functional information of a target model to be deployed and service deployment information of the target model; acquiring the residual quantity of each type of computing power resource on each node in each cluster created in advance for the user in a hybrid cloud platform; and deploying the services of the target model in the hybrid cloud platform based on the residual amounts of computing power resources of each type of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model. According to the method and the device, resources in the hybrid cloud platform can be fully utilized, and the deployment efficiency of the services of the model in the hybrid cloud platform can be effectively improved.

Description

混合云平台的服务部署方法、管理平台、设备及介质Service deployment method, management platform, equipment and media of hybrid cloud platform

技术领域technical field

本公开涉及计算机技术领域,具体涉及云计算以及混合云等技术领域,尤其涉及一种混合云平台的服务部署方法、管理平台、设备及介质。The present disclosure relates to the field of computer technology, specifically to the technical fields of cloud computing and hybrid cloud, and in particular to a service deployment method, management platform, equipment and media of a hybrid cloud platform.

背景技术Background technique

混合云平台融合了公有云和私有云,是近年来云计算的主要模式和发展方向。The hybrid cloud platform integrates public cloud and private cloud, and is the main model and development direction of cloud computing in recent years.

现有技术中,混合云平台中公有云和私有云之间相互独立。在公有云和私有云中部署服务的时候,需要先分别在公有云和私有云中创建集群。然后分别在公有云或者私有云中部署服务。In the prior art, the public cloud and the private cloud in the hybrid cloud platform are independent of each other. When deploying services in the public cloud and private cloud, you need to create clusters in the public cloud and private cloud respectively. Then deploy services in public cloud or private cloud respectively.

发明内容Contents of the invention

本公开提供了一种混合云平台的服务部署方法、管理平台、设备及介质。The present disclosure provides a hybrid cloud platform service deployment method, management platform, equipment and media.

根据本公开的一方面,提供了一种混合云平台的服务部署方法,包括:According to an aspect of the present disclosure, a service deployment method of a hybrid cloud platform is provided, including:

获取用户的服务部署请求;所述服务部署请求中携带待部署的目标模型的功能信息、以及所述目标模型的服务部署信息;Obtain the user's service deployment request; the service deployment request carries the function information of the target model to be deployed and the service deployment information of the target model;

获取混合云平台中预先为所述用户创建的各集群中各节点上的各类型的算力资源的剩余量;Obtaining the remaining amount of each type of computing resources on each node in each cluster pre-created for the user in the hybrid cloud platform;

基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及所述目标模型的服务部署信息,在所述混合云平台中部署所述目标模型的服务。Deploy the service of the target model on the hybrid cloud platform based on the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model.

根据本公开的另一方面,提供了一种混合云管理平台,包括:According to another aspect of the present disclosure, a hybrid cloud management platform is provided, including:

服务部署获取模块,用于获取用户的服务部署请求;所述服务部署请求中携带待部署的目标模型的功能信息、以及所述目标模型的服务部署信息;A service deployment acquisition module, configured to acquire a user's service deployment request; the service deployment request carries the function information of the target model to be deployed and the service deployment information of the target model;

剩余资源获取模块,用于获取混合云平台中预先为所述用户创建的各集群中各节点上的各类型的算力资源的剩余量;The remaining resource obtaining module is used to obtain the remaining amount of various types of computing power resources on each node in each cluster created in advance for the user in the hybrid cloud platform;

服务部署模块,用于基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及所述目标模型的服务部署信息,在所述混合云平台中部署所述目标模型的服务。A service deployment module, configured to deploy the target in the hybrid cloud platform based on the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model Model service.

根据本公开的再一方面,提供了一种电子设备,包括:According to still another aspect of the present disclosure, an electronic device is provided, including:

至少一个处理器;以及at least one processor; and

与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,

所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如上所述的方面和任一可能的实现方式的方法。The memory stores instructions executable by the at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can perform the above aspects and any possible implementation way of way.

根据本公开的又一方面,提供了一种存储有计算机指令的非瞬时计算机可读存储介质,所述计算机指令用于使所述计算机执行如上所述的方面和任一可能的实现方式的方法。According to yet another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions, the computer instructions are used to make the computer execute the method of the above aspect and any possible implementation manner .

根据本公开的再另一方面,提供了一种计算机程序产品,包括计算机程序,所述计算机程序在被处理器执行时实现如上所述的方面和任一可能的实现方式的方法。According to yet another aspect of the present disclosure, a computer program product is provided, including a computer program, and when the computer program is executed by a processor, the above aspect and the method of any possible implementation manner are implemented.

根据本公开的技术,能够有效地提高混合云平台中模型的服务的部署效率。According to the technology disclosed in the present disclosure, the deployment efficiency of model services in the hybrid cloud platform can be effectively improved.

应当理解,本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征,也不用于限制本公开的范围。本公开的其它特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or important features of the embodiments of the present disclosure, nor is it intended to limit the scope of the present disclosure. Other features of the present disclosure will be readily understood through the following description.

附图说明Description of drawings

附图用于更好地理解本方案,不构成对本公开的限定。其中:The accompanying drawings are used to better understand the present solution, and do not constitute a limitation to the present disclosure. in:

图1是根据本公开第一实施例的示意图;FIG. 1 is a schematic diagram according to a first embodiment of the present disclosure;

图2是根据本公开第二实施例的示意图;FIG. 2 is a schematic diagram according to a second embodiment of the present disclosure;

图3是根据本公开第三实施例的示意图;Fig. 3 is a schematic diagram according to a third embodiment of the present disclosure;

图4是本公开提供的一种混合云管理平台的管理示意图;FIG. 4 is a management diagram of a hybrid cloud management platform provided by the present disclosure;

图5是根据本公开第四实施例的示意图;FIG. 5 is a schematic diagram according to a fourth embodiment of the present disclosure;

图6是根据本公开第五实施例的示意图;FIG. 6 is a schematic diagram according to a fifth embodiment of the present disclosure;

图7是用来实现本公开实施例的方法的电子设备的框图。FIG. 7 is a block diagram of an electronic device used to implement the method of an embodiment of the present disclosure.

具体实施方式Detailed ways

以下结合附图对本公开的示范性实施例做出说明,其中包括本公开实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本公开的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

显然,所描述的实施例是本公开一部分实施例,而不是全部的实施例。基于本公开中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的全部其他实施例,都属于本公开保护的范围。Apparently, the described embodiments are some of the embodiments of the present disclosure, but not all of them. Based on the embodiments in the present disclosure, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present disclosure.

需要说明的是,本公开实施例中所涉及的终端设备可以包括但不限于手机、个人数字助理(Personal Digital Assistant,PDA)、无线手持设备、平板电脑(TabletComputer)等智能设备;显示设备可以包括但不限于个人电脑、电视等具有显示功能的设备。It should be noted that the terminal devices involved in the embodiments of the present disclosure may include but not limited to mobile phones, personal digital assistants (Personal Digital Assistant, PDA), wireless handheld devices, tablet computers (TabletComputer) and other smart devices; display devices may include However, it is not limited to devices with display functions such as personal computers and televisions.

另外,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。In addition, the term "and/or" in this article is only an association relationship describing associated objects, which means that there may be three relationships, for example, A and/or B may mean: A exists alone, A and B exist at the same time, There are three cases of B alone. In addition, the character "/" in this article generally indicates that the contextual objects are an "or" relationship.

现有的混合云平台中,各公有云和私有云需要分别单独管理,例如,基于不同的集群创建方式,分别在对应的云中创建集群。集群创建完毕后,需要在集群的节点中独立安装相关的算力组件,以支持后续服务的部署。然后分别在公有云或者私有云的集群中部署服务,无法充分利用混合云平台的资源,导致混合云平台的服务部署效率较低。In the existing hybrid cloud platform, each public cloud and private cloud needs to be managed separately, for example, based on different cluster creation methods, clusters are created in corresponding clouds respectively. After the cluster is created, related computing power components need to be independently installed in the nodes of the cluster to support the deployment of subsequent services. Then, services are deployed in public cloud or private cloud clusters, which cannot fully utilize the resources of the hybrid cloud platform, resulting in low service deployment efficiency of the hybrid cloud platform.

图1是根据本公开第一实施例的示意图;如图1所示,本实施例提供一种混合云平台的服务部署方法,具体可以包括如下步骤:FIG. 1 is a schematic diagram according to a first embodiment of the present disclosure; as shown in FIG. 1 , this embodiment provides a service deployment method for a hybrid cloud platform, which may specifically include the following steps:

S101、获取用户的服务部署请求;服务部署请求中携带待部署的目标模型的功能信息、目标模型的服务部署信息;S101. Obtain a user's service deployment request; the service deployment request carries the function information of the target model to be deployed and the service deployment information of the target model;

本实施例中,可以提供一种混合云管理平台,用于实现对混合云平台进行管理,例如包括在混合云平台上进行服务部署。所以,本实施例的混合云平台的服务部署方法,可以应用在混合云管理平台中。该混合云管理平台可以对包括至少一个公有云和至少一个私有云的云平台进行管理。In this embodiment, a hybrid cloud management platform may be provided for implementing management on the hybrid cloud platform, for example including service deployment on the hybrid cloud platform. Therefore, the service deployment method of the hybrid cloud platform in this embodiment can be applied to the hybrid cloud management platform. The hybrid cloud management platform can manage a cloud platform including at least one public cloud and at least one private cloud.

本实施例的混合云平台中可以包括有公有云,也可以包括有私有云。该混合云平台,可以是一个基于容器集群管理系统kubernetes(简称K8s)的混合云平台,所以该混合云平台也可以称为K8s混合云平台。The hybrid cloud platform in this embodiment may include a public cloud or a private cloud. The hybrid cloud platform may be a hybrid cloud platform based on the container cluster management system kubernetes (K8s for short), so the hybrid cloud platform may also be called the K8s hybrid cloud platform.

本实施例的用户非个人用户,可以为一个商家账户,或者是基于一个指定项目所创建的账户,需要可以在混合云平台中部署该指定项目所需的各中模型的服务。The user in this embodiment is not an individual user, but can be a merchant account, or an account created based on a specified project, and needs services that can deploy various models required by the specified project on the hybrid cloud platform.

本实施例的目标模型可以为一个预先训练好的、能够实现一定功能的神经网络模型。The target model in this embodiment may be a pre-trained neural network model capable of realizing certain functions.

S102、获取混合云平台中预先为用户创建的各集群中各节点上的各类型的算力资源的剩余量;S102. Obtain the remaining amount of various types of computing power resources on each node in each cluster created in advance for the user in the hybrid cloud platform;

本实施例中,预先在混合云平台中的各云平台中创建有集群。且各集群的各节点上部署有采集器组件,能够采集节点上的各类型的算力资源的剩余量。当然,可选地,各采集器组件,还可以采集各节点上的各类型的算力资源的使用情况。In this embodiment, clusters are created in advance on each cloud platform in the hybrid cloud platform. And each node of each cluster is deployed with a collector component, which can collect the remaining amount of various types of computing power resources on the node. Of course, optionally, each collector component can also collect usage of various types of computing resources on each node.

S103、基于混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及目标模型的服务部署信息,在混合云平台中部署目标模型的服务。S103. Deploy the service of the target model on the hybrid cloud platform based on the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model.

本实施例中,可以参考混合云平台的各集群中各节点的各类型的算力资源的剩余量、目标模型的服务部署信息,在混合云平台中部署目标模型的服务。本实施例的部署方式,对于混合云管理平台而言,不用区分混合云平台中的不同云以及不同云中的集群,而是将混合云平台看作一个整体,去部署目标模型的服务。也就是说,本实施例中在混合云平台部署目标模型的服务,不限制仅在混合云平台中的一个云平台上部署该目标模型的服务;也可以同时在混合云平台中的两个或者两个以上的云平台中部署该目标模型的服务,这样可以充分利用混合云平台中的资源,提高目标模型的服务的部署效率。In this embodiment, the service of the target model can be deployed on the hybrid cloud platform by referring to the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model. The deployment method of this embodiment, for the hybrid cloud management platform, does not need to distinguish between different clouds in the hybrid cloud platform and clusters in different clouds, but regards the hybrid cloud platform as a whole to deploy services of the target model. That is to say, in this embodiment, deploying the service of the target model on the hybrid cloud platform is not limited to only deploying the service of the target model on one cloud platform in the hybrid cloud platform; Deploy the service of the target model in more than two cloud platforms, so that the resources in the hybrid cloud platform can be fully utilized, and the deployment efficiency of the service of the target model can be improved.

本实施例的混合云平台的服务部署方法,能够基于混合云平台的各集群中各节点的各类型的算力资源的剩余量、目标模型的服务部署信息,将混合云平台看作一个整体,在混合云平台中部署目标模型的服务,而不限制仅在混合云平台中的一个云平台上部署该目标模型的服务,能够充分利用混合云平台中的资源,有效地提高混合云平台中模型的服务的部署效率。The service deployment method of the hybrid cloud platform in this embodiment can regard the hybrid cloud platform as a whole based on the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model, Deploying the service of the target model in the hybrid cloud platform is not limited to deploying the service of the target model on only one cloud platform in the hybrid cloud platform, which can make full use of the resources in the hybrid cloud platform and effectively improve the model in the hybrid cloud platform. service deployment efficiency.

图2是根据本公开第二实施例的示意图;如图2所示,本实施例在上述图1所示实施例的技术方案的基础上,进一步更加详细地介绍本公开的技术方案,具体可以包括如下步骤:Fig. 2 is a schematic diagram according to the second embodiment of the present disclosure; as shown in Fig. 2, this embodiment further introduces the technical solution of the present disclosure in more detail on the basis of the technical solution of the embodiment shown in Fig. 1 above, specifically, Including the following steps:

S201、接收外部的用户的服务部署请求;S201. Receive a service deployment request from an external user;

S202、将用户的服务部署请求存入队列中;S202. Store the user's service deployment request into a queue;

本实施例的该服务部署请求中,携带待部署的目标模型的功能信息、以及目标模型的服务部署信息。The service deployment request in this embodiment carries the function information of the target model to be deployed and the service deployment information of the target model.

例如,本实施例中,目标模型的服务部署信息,可以包括目标模型的服务部署所依赖的第一算力资源的类型、依赖的第一算力资源的量、以及部署目标模型的总数量,该部署的总数量数量也可以认为是部署该目标模型的服务的副本的总数量,或者部署该目标模型的服务的实例的总数量。本实施例中,部署的总数量可以由用户根据需求来指定。For example, in this embodiment, the service deployment information of the target model may include the type of the first computing power resource on which the service deployment of the target model depends, the amount of the first computing power resource relied on, and the total number of deployed target models, The total number of deployments can also be considered as the total number of copies of the service deploying the target model, or the total number of instances of the service deploying the target model. In this embodiment, the total number of deployments can be specified by the user according to requirements.

本实施例的场景中,算力资源的类型可以包括中央处理器(Central ProcessingUnit;CPU)、内存、显卡等;其中显卡还可以分不同型号的显卡,不同型号的显卡具有不同的显存。进一步地,算力资源的类型还可以包括张量处理器(Tensor Processing Unit;TPU)、神经网络处理器(Neural Processing Unit;NPU);图形处理器(Graphics ProcessingUnit;GPU);智能处理器(Intelligent Processing Unit:IPU)等等XPU。In the scenario of this embodiment, the types of computing power resources may include a central processing unit (Central Processing Unit; CPU), memory, graphics card, etc.; the graphics card may be divided into different types of graphics cards, and different types of graphics cards have different video memories. Further, the types of computing power resources may also include tensor processing units (Tensor Processing Unit; TPU), neural network processors (Neural Processing Unit; NPU); graphics processing units (Graphics Processing Unit; GPU); intelligent processors (Intelligent Processing Unit: IPU) and so on XPU.

本实施例中,目标模型的服务部署依赖的第一算力资源的类型,可以包括一种、两种或者多种。例如,有的目标模型仅依赖CPU和内存,而有的目标模型还依赖指定型号的显卡;还有的目标模型需要依赖TPU、NPU、或者GPU等。In this embodiment, the type of the first computing resource that the service deployment of the target model depends on may include one type, two types, or multiple types. For example, some target models only depend on CPU and memory, while some target models also depend on a specified type of graphics card; some target models need to depend on TPU, NPU, or GPU.

本实施例的部署目标模型的服务依赖第一算力资源的量可以包括:部署该目标模型的服务依赖的CPU的核数、依赖的内存的大小(单位可以为兆M)、依赖的指定型号的显卡的块数、依赖的TPU的块数、依赖的NPU的块数、依赖的GPU的块数以及依赖的IPU的块数等中的至少一个。本实施例中的混合云管理平台部署服务时,需要一定的耗时,所以对于混合云管理平台接收到的每一个服务部署请求,并非在接收时进行实时处理。In this embodiment, the service of the deployment target model depends on the amount of the first computing power resource may include: the number of cores of the CPU on which the service of the deployment target model depends, the size of the dependent memory (the unit can be megabytes), and the specified model of the dependence At least one of the number of graphics card blocks, the number of dependent TPU blocks, the number of dependent NPU blocks, the number of dependent GPU blocks, and the number of dependent IPU blocks. The hybrid cloud management platform in this embodiment takes a certain amount of time to deploy services, so each service deployment request received by the hybrid cloud management platform is not processed in real time when it is received.

为了避免服务部署请求的丢失或者遗漏处理,本实施例中,可以设置一个队列,在混合云管理平台接收到外部发送的用户的服务部署请求之后,先将用户的服务部署请求存储在队列中。然后处理时,再从队列中读取一个用户的服务部署请求进行服务部署。具体地,存储时,存入队列的队尾。而读取时,从队列的队头读取。In order to avoid the loss or omission of service deployment requests, in this embodiment, a queue can be set up, and after the hybrid cloud management platform receives the user's service deployment requests sent from the outside, it first stores the user's service deployment requests in the queue. Then when processing, read a user's service deployment request from the queue for service deployment. Specifically, when storing, it is stored at the end of the queue. When reading, read from the head of the queue.

本实施例的外部的用户的服务部署请求可以为用户通过该混合云管理平台的应用程序接口(Application Programming Interface;API)输入的,或者也可以为用户通过混合云管理平台的界面输入的。In this embodiment, the external user's service deployment request may be input by the user through the application programming interface (Application Programming Interface; API) of the hybrid cloud management platform, or may also be input by the user through the interface of the hybrid cloud management platform.

S203、从队列中获取用户的服务部署请求;S203. Obtain the user's service deployment request from the queue;

具体地,从队列的头部获取用户的服务部署请求。Specifically, the user's service deployment request is obtained from the head of the queue.

需要说明的是,本实施例中,在步骤S201之前,还可以包括:接入目标模型,即将目标模型的参数等所有信息存入当前的混合云管理平台中,实现该目标模型的接入,使得混合云管理平台接收到关于该目标模型的服务部署请求时,能够知道是要部署哪个模型的服务。It should be noted that, in this embodiment, before step S201, it may also include: accessing the target model, that is, storing all information such as the parameters of the target model in the current hybrid cloud management platform to realize the access of the target model, This enables the hybrid cloud management platform to know which model service is to be deployed when receiving a service deployment request about the target model.

进一步地,在步骤S203之后,还可以包括:对预先接入的该目标模型进行格式转化,得到标准格式的目标模型。Further, after step S203, the method may further include: performing format conversion on the pre-accessed object model to obtain an object model in a standard format.

由于混合云管理平台在接入目标模型时,不同的目标模型采用的格式可能不同,混合云管理平台无法识别每一种格式的目标模型。所以,混合云管理平台接收到该服务部署请求时,可以先将服务部署请求中限定的目标模型转换为混合云管理平台的标准格式,方便识别该目标模型。Since different target models may adopt different formats when the hybrid cloud management platform accesses the target model, the hybrid cloud management platform cannot recognize target models in each format. Therefore, when the hybrid cloud management platform receives the service deployment request, it can first convert the target model defined in the service deployment request into the standard format of the hybrid cloud management platform, so as to facilitate the identification of the target model.

S204、从存储单元中获取各集群中各节点上报的各类型的算力资源的剩余量;S204. Obtain from the storage unit the remaining amount of each type of computing resources reported by each node in each cluster;

例如,本实施例中,各节点上报的各类型的算力资源的剩余量,可以包括CPU的剩余核数、内存剩余大小(单位可以为兆M)、显卡的剩余块数。一个节点上可以部署一块、两块或者多块指定型号的显卡。For example, in this embodiment, the remaining amounts of various types of computing power resources reported by each node may include the remaining number of cores of the CPU, the remaining size of the memory (the unit may be megabytes), and the remaining number of blocks of the graphics card. One, two or more graphics cards of specified models can be deployed on a node.

采用该方式,可以准确、高效地获取到各节点的各类型的算力资源的剩余量。By adopting this method, the remaining amount of each type of computing resources of each node can be obtained accurately and efficiently.

可选地,在该步骤S204之前,还可以包括:接收各集群中各节点上报的各类型的算力资源的剩余量,并存储在存储单元中,能够有效地确保存储单元中的各节点的各类型的算力资源的剩余量的准确性。Optionally, before this step S204, it may also include: receiving the remaining amount of each type of computing power resource reported by each node in each cluster, and storing it in the storage unit, which can effectively ensure that each node in the storage unit The accuracy of the remaining amount of various types of computing power resources.

具体地,该用户对应的每个集群创建在一个云平台上。如对于包括一个私有云平台和至少两个公有云平台的混合云平台,可以分别在私有云平台和各公有云平台上预先创建有相应的集群。创建集群的方式可以采用任意方式,在此不做限定。并且还可以各集群中各节点上部署采集器组件,能够采集各节点上的各类型的算力资源的剩余量。例如对于多核CPU的机器,采集器组件可以采集到CPU剩余几核未占用、内存剩余多少。对于指定型号的显卡的机器,采集组件可以采集到剩余显卡的块数等。Specifically, each cluster corresponding to the user is created on a cloud platform. For example, for a hybrid cloud platform including one private cloud platform and at least two public cloud platforms, corresponding clusters may be pre-created on the private cloud platform and each public cloud platform respectively. The manner of creating the cluster may be in any manner, which is not limited here. In addition, collector components can be deployed on each node in each cluster, which can collect the remaining amount of various types of computing power resources on each node. For example, for a multi-core CPU machine, the collector component can collect how many cores are not occupied by the CPU and how much memory is left. For a machine with a specified type of graphics card, the collection component can collect the number of blocks of the remaining graphics card, etc.

可选地,本实施例中,也可以从实时从各集群中获取集群中各节点上报的各类型的算力资源的剩余量。相对于上一方式,耗时更长,但获取到的数据更准确。当然,各节点上报的各类型的算力资源的剩余量的上报周期,也可以设置短一点,能够有效地提高存储单元中存储的各节点的各类型的算力资源的剩余量的准确性。Optionally, in this embodiment, the remaining amount of each type of computing resources reported by each node in the cluster may also be obtained from each cluster in real time. Compared with the previous method, it takes longer, but the obtained data is more accurate. Of course, the reporting period of the remaining amount of each type of computing power resource reported by each node can also be set shorter, which can effectively improve the accuracy of the remaining amount of each type of computing power resource of each node stored in the storage unit.

S205、基于混合云平台的各集群中各节点的各类型的算力资源的剩余资源量、目标模型依赖的第一算力资源的类型、依赖第一算力资源的量以及部署目标模型的服务的总数量,检测混合云平台中是否能够实施总数量的目标模型的服务的部署;若能,执行步骤S206;若不能,执行步骤S209;S205. The remaining resource amount of each type of computing power resource of each node in each cluster based on the hybrid cloud platform, the type of the first computing power resource that the target model depends on, the amount that depends on the first computing power resource, and the service of deploying the target model total quantity, detect whether the deployment of the service of the total quantity target model can be implemented in the hybrid cloud platform; if yes, perform step S206; if not, perform step S209;

S206、确定目标模型的目标服务部署策略;执行步骤S207;S206. Determine the target service deployment strategy of the target model; execute step S207;

该目标服务部署策略中包括至少一个第一集群的标识以及各第一集群中部署的第一数量;各第一集群部署的第一数量之和等于总数量。此时,该目标服务部署策略仅为一个模型部署的预分配方案,用于指导后续的步骤。本实施例的第一集群为混合云平台中的任一集群。The target service deployment strategy includes an identifier of at least one first cluster and a first quantity deployed in each first cluster; the sum of the first quantities deployed in each first cluster is equal to the total quantity. At this point, the target service deployment policy is only a pre-allocated solution for model deployment, which is used to guide the next steps. The first cluster in this embodiment is any cluster in the hybrid cloud platform.

本实施例中,具体可以依据混合云平台的各集群中各节点的各类型的算力资源的剩余资源空间量、以及部署一个目标模型的服务所需的资源量,实现确定混合云平台中各集群中可以部署的目标模型的服务的数量。In this embodiment, it can be specifically determined according to the amount of remaining resource space of each type of computing power resource of each node in each cluster of the hybrid cloud platform and the amount of resources required to deploy a target model service. The number of services of the target model that can be deployed in the cluster.

基于以上所述,可以得知,在混合云平台中第一算力资源的量充足时,可以直接确定目标模型的目标服务部署策略,该目标服务部署策略能够实现在混合云平台中直接部署总数量的目标模型的服务。具体部署时,总数量的目标模型的服务可以部署在一个第一集群中,也可以部署在两个或者多个第一集群中。若存在部署至两个以上第一集群时,各第一集群部署的第一数量之和必然等于部署目标模型的服务的总数量。Based on the above, it can be known that when the amount of the first computing power resources in the hybrid cloud platform is sufficient, the target service deployment strategy of the target model can be directly determined, and the target service deployment strategy can realize the direct deployment of the overall cloud platform in the hybrid cloud platform. The number of target models to serve. During specific deployment, the total number of target model services may be deployed in one first cluster, or may be deployed in two or more first clusters. If there are more than two first clusters deployed, the sum of the first numbers deployed by each first cluster must be equal to the total number of services of the deployment target model.

S207、生成目标服务部署策略对应的目标镜像文件;执行步骤S208;S207. Generate a target image file corresponding to the target service deployment strategy; execute step S208;

具体地,可以基于标准格式的目标模型和依赖的第一算力资源的类型,生成目标服务部署策略对应的目标镜像文件,使得生成的目标镜像文件可以在混合云平台中第一算力资源上运行,实现部署目标模型的服务在对应的集群的容器中。采用该方式,可以有效地确保生成的镜像文件的准确性。Specifically, the target image file corresponding to the target service deployment strategy can be generated based on the target model in the standard format and the type of the dependent first computing resource, so that the generated target image file can be used on the first computing resource in the hybrid cloud platform Running, the service that implements the deployment target model is in the container of the corresponding cluster. By adopting this method, the accuracy of the generated image file can be effectively ensured.

进一步可选地,步骤S207之后,还可以包括:将生成的目标镜像文件,存储在镜像文件仓库中。即在混合云管理平台中配置一个镜像文件仓库,用于存储生成的所有镜像文件,以对生成的镜像文件及时、有效地存储,以便于后续需要时直接获取。Further optionally, after step S207, the method may further include: storing the generated target image file in the image file warehouse. That is, an image file warehouse is configured on the hybrid cloud management platform to store all generated image files, so that the generated image files can be stored in a timely and effective manner, so that they can be directly obtained when needed later.

S208、向各第一集群部署对应第一数量的目标镜像文件,以实现在混合云平台中部署总数量的目标模型的服务,执行步骤S224;S208. Deploy the corresponding first number of target image files to each first cluster, so as to realize the service of deploying a total number of target models in the hybrid cloud platform, and perform step S224;

例如,本实施例中,可以通过调用各第一集群的API,向第一集群发送携带对应第一数量和目标镜像文件的标识服务部署命令,以供第一集群,从混合云管理平台拉取对应的目标镜像文件,并部署第一数量份,以实现在混合云平台中部署总数量的目标模型的服务。For example, in this embodiment, by calling the API of each first cluster, the first cluster can be sent to the first cluster to carry the identification service deployment command corresponding to the first quantity and the target image file, so that the first cluster can pull it from the hybrid cloud management platform. Corresponding target image files, and deploying the first number of copies, so as to implement the service of deploying a total number of target models in the hybrid cloud platform.

采用上述方式,可以在混合云平台中第一算力资源充足时,准确、高效地在混合云平台中部署总数量的目标模型的服务。By adopting the above method, when the first computing resources in the hybrid cloud platform are sufficient, the services of the total number of target models can be accurately and efficiently deployed on the hybrid cloud platform.

S209、确定混合云平台中能够部署目标模型的第一子服务部署策略;执行步骤S210;S209. Determine the first sub-service deployment strategy that can deploy the target model in the hybrid cloud platform; execute step S210;

本实施例的第一子服务部署策略中包括至少一个第二集群的标识以及各第二集群中部署的第二数量;各第二集群部署的第二数量之和小于总数量。第二集群也为混合云平台中的任一集群。The first sub-service deployment strategy in this embodiment includes an identifier of at least one second cluster and a second quantity deployed in each second cluster; the sum of the second quantities deployed in each second cluster is less than the total quantity. The second cluster is also any cluster in the hybrid cloud platform.

该第一子服务部署策略,即充分利用混合云平台中目标模型依赖的第一算力资源,尽可能地部署部分数量的目标模型的服务。即该第一子服务部署策略,仅完成部分部署任务。The first sub-service deployment strategy is to make full use of the first computing power resources that the target model depends on in the hybrid cloud platform, and deploy as many services as possible of the target model. That is, the first sub-service deployment strategy only completes part of the deployment tasks.

需要说明的是,实际应用中,若混合云平台中目标模型依赖的第一算力资源非常稀缺,可能都无法完成一个目标模型的服务部署。此时,第一子服务部署策略也可以空。即第一子服务部署策略中不包括第二集群的标识,各第二集群中部署的第二数量为空。为了便于后续统计,也可以配置第一子服务部署策略中包括混合云平台中的每个集群,但是集群中部署的第二数量为0。It should be noted that, in practical applications, if the first computing resource that the target model depends on in the hybrid cloud platform is very scarce, it may not be possible to complete the service deployment of a target model. At this time, the first sub-service deployment policy may also be empty. That is, the first sub-service deployment strategy does not include the identifier of the second cluster, and the second quantity deployed in each second cluster is empty. For the convenience of subsequent statistics, it is also possible to configure the first sub-service deployment strategy to include each cluster in the hybrid cloud platform, but the second number deployed in the cluster is 0.

S210、基于总数量和各第二集群对应的第二数量,确定部署的剩余数量;执行步骤S211;S210. Based on the total quantity and the second quantity corresponding to each second cluster, determine the remaining quantity to be deployed; perform step S211;

具体地,先取各第二集群对应的第二数量之和,作为已部署的数量。然后采用总数量减去已部署数量,便得到部署的剩余数量。Specifically, the sum of the second quantities corresponding to the second clusters is firstly taken as the deployed quantity. Then subtract the deployed quantity from the total quantity to get the remaining quantity deployed.

本实施例中,在步骤S210之后,为了实现剩余数量的目标模型的部署,可以执行如下模型转换的部署步骤:基于混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及已部署的多个模型中各模型的功能信息以及各模型的服务部署信息,在混合云平台中部署剩余数量的、与目标模型具有相同功能的已部署模型的服务,实现代替剩余数量的目标模型的服务。In this embodiment, after step S210, in order to implement the deployment of the remaining number of target models, the following deployment steps of model conversion can be performed: the remaining amount of each type of computing resources of each node in each cluster based on the hybrid cloud platform, As well as the function information of each model in the deployed multiple models and the service deployment information of each model, deploy the remaining number of services of the deployed model that have the same function as the target model in the hybrid cloud platform, so as to achieve the goal of replacing the remaining number Model service.

各模型的服务部署信息可以包括对应模型的服务部署时依赖的算力资源的类型以及依赖的算力资源的量。The service deployment information of each model may include the type and amount of computing power resources that the service deployment of the corresponding model depends on.

该模型转换的部署步骤,可以在混合云平台上目标模型依赖的算力资源不足时,且混合云平台上与目标模型功能相同的已部署模型依赖的算力资源与目标模型依赖的算力资源不同、且算力资源充足的情况下,将部署剩余数量的目标模型的服务,转换为部署相同功能的已部署模型的服务,能够充分地利用混合云平台上的所有算力资源,进行模型部署,有效地提高模型的部署效率。The deployment steps of this model conversion can be implemented when the computing power resources that the target model depends on on the hybrid cloud platform are insufficient, and the computing power resources that the deployed model on the hybrid cloud platform has the same function as the target model depend on and the computing power resources that the target model depends on In the case of different and sufficient computing power resources, the service of deploying the remaining number of target models is converted into the service of deploying the deployed model with the same function, which can make full use of all computing power resources on the hybrid cloud platform for model deployment , effectively improving the deployment efficiency of the model.

例如,该模型转换的部署步骤,具体实现时,可以包括如下步骤S211-S222的全部或者部分。也就是说,模型转换的过程,可以仅进行一次模型转换以及转换后的相应模型的服务部署,也可以进行两次或者多次模型转换以及转换后的相应模型的服务部署,详细参考下述实施例的记载。For example, the deploying step of model conversion may include all or part of the following steps S211-S222 during specific implementation. That is to say, in the process of model conversion, the model conversion and the service deployment of the converted corresponding model can be performed only once, or two or more model conversions and the service deployment of the converted corresponding model can be performed. For details, refer to the following implementation Example records.

S211、基于目标模型的功能信息和已部署的多个模型中各模型的功能信息,获取与目标模型具有相同功能的第一替代模型;执行步骤S212;S211. Based on the function information of the target model and the function information of each of the deployed models, obtain a first substitute model having the same function as the target model; perform step S212;

S212、基于混合云平台的各集群中各节点的各类型的算力资源的剩余量、第一替代模型的服务部署依赖的第二算力资源的类型、依赖的第二目标算力资源的量以及部署的剩余数量,检测混合云平台中是否能够实施剩余数量的第一替代模型的服务的部署;若能,执行步骤S213;若不能,执行步骤S216;S212. The remaining amount of each type of computing power resources of each node in each cluster based on the hybrid cloud platform, the type of the second computing power resource that the service deployment of the first alternative model depends on, and the amount of the second target computing power resource that depends on and the remaining number of deployments, detecting whether the remaining number of services of the first alternative model can be deployed in the hybrid cloud platform; if yes, execute step S213; if not, execute step S216;

本实施例中的第二算力资源的类型与第一算力资源的类型不同。The type of the second computing power resource in this embodiment is different from the type of the first computing power resource.

该步骤的检测过程中,将混合云平台看作一个整体,充分考虑混合云平台中的每个集群中每个节点的每种类型的算力资源的剩余数量,能够充分利用混合云平台中的算力资源进行模型部署。In the detection process of this step, the hybrid cloud platform is regarded as a whole, and the remaining quantity of each type of computing resources of each node in each cluster in the hybrid cloud platform is fully considered, so that the hybrid cloud platform can make full use of Computing resources for model deployment.

S213、确定混合云平台中部署第一替代模型的第二子服务部署策略;执行步骤S214;S213. Determine the second sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform; execute step S214;

本实施例中的第二子服务部署策略中包括至少一个第三集群的标识以及各第三集群中部署的第三数量;各第三集群部署的第三数量之和等于剩余数量。第三集群也为混合云平台中的任一集群。The second sub-service deployment strategy in this embodiment includes an identifier of at least one third cluster and a third quantity deployed in each third cluster; the sum of the third quantities deployed in each third cluster is equal to the remaining quantity. The third cluster is also any cluster in the hybrid cloud platform.

基于以上所述,可以得知,此时,总数量的目标模型的部署服务,可以拆分采用第一子服务部署策略部署部分数量的目标模型的服务,采用第二子服务部署策略部署剩余数量的第一替代模型的服务。此时,总数量的目标模型的部署服务预分配方案完成。Based on the above, it can be known that at this time, the deployment services of the total number of target models can be split using the first sub-service deployment strategy to deploy part of the number of target model services, using the second sub-service deployment strategy to deploy the remaining number The first alternative model of service. At this point, the deployment service pre-allocation scheme for the total number of target models is completed.

S214、分别获取第一子服务部署策略对应的第一镜像文件和第二子服务部署策略对应的第二镜像文件;执行步骤S215;S214. Respectively acquire the first image file corresponding to the first sub-service deployment strategy and the second image file corresponding to the second sub-service deployment strategy; execute step S215;

其中,由于目标模型为新接入的模型,不是已部署的模型,镜像文件仓库中不会存在相应的镜像文件,所以,可以基于标准格式的目标模型和依赖的第一算力资源的类型,生成第一子服务部署策略对应的第一镜像文件。生成的第一镜像文件可以在第一算力资源上运行,实现部署目标模型的服务在对应的集群的容器中。Among them, since the target model is a newly accessed model, not a deployed model, there will be no corresponding image file in the image file warehouse. Therefore, based on the target model in the standard format and the type of the first computing power resource that depends on, A first image file corresponding to the first sub-service deployment strategy is generated. The generated first image file can be run on the first computing power resource, and the service that implements the deployment target model is in the container of the corresponding cluster.

而第二子服务部署策略对应部署的第一替代模型,是混合云平台中已经部署过的模型,此时可以先检测镜像文件仓库中是否存在第二子服务部署策略对应的第二镜像文件;若不存在,此时可以进一步基于标准格式的第一替代模型和依赖的第二算力资源的类型、生成第二子服务部署策略对应的第二镜像文件。生成的第二镜像文件可以在第二算力资源上运行,实现部署第一替代模型的服务在对应的集群的容器中。The first alternative model deployed by the second sub-service deployment strategy is a model that has already been deployed in the hybrid cloud platform. At this time, it can be detected whether there is a second image file corresponding to the second sub-service deployment strategy in the image file warehouse; If not, at this time, the second image file corresponding to the second sub-service deployment strategy may be further generated based on the first alternative model in the standard format and the type of the second computing power resource to be relied upon. The generated second image file can be run on the second computing resource, so that the service of the first alternative model can be deployed in the container of the corresponding cluster.

采用上述方式,可以高效、准确地获取到第二子服务部署策略对应的第二镜像文件。By adopting the above method, the second image file corresponding to the deployment strategy of the second sub-service can be obtained efficiently and accurately.

S215、分别向各第二集群部署对应第二数量的第一镜像文件,向各第三集群部署对应第三数量的第二镜像文件,以实现在混合云平台中部署总数量的所述目标模型的服务;执行步骤S224;S215. Deploy the first image files corresponding to the second number to each second cluster, and deploy the second image files corresponding to the third number to each third cluster, so as to implement the total number of target models deployed in the hybrid cloud platform service; execute step S224;

因为不同的第二集群对应的第二数量可能不同,不同的第三集群部署对应第三数量也可能不同。为了保证部署的准确性,要保证每个第二集群中部署的第一镜像文件的数量是当前第二集群对应的第二数量,每个第三集群中部署的第二镜像文件的数量是当前第三集群对应的第三数量。Because the second numbers corresponding to different second clusters may be different, the third numbers corresponding to different third cluster deployments may also be different. In order to ensure the accuracy of deployment, it is necessary to ensure that the number of first image files deployed in each second cluster is the second number corresponding to the current second cluster, and the number of second image files deployed in each third cluster is the current The third cluster corresponds to the third quantity.

也就是说,本实施例中,为了能够实现目标模型的服务的部署,可以在部署目标模型依赖的第一算力资源不足的情况下,而部署与目标模型具有相同功能的第一替代模型的资源充足的情况下,将部署目标模型的服务,转换为部署第一替代模型的服务,以确保部署的服务具备相同的功能和相同的服务数量和服务质量。That is to say, in this embodiment, in order to realize the deployment of the service of the target model, the deployment of the first alternative model that has the same function as the target model can be deployed when the first computing resource that the target model depends on is insufficient. In the case of sufficient resources, the services of the deployment target model are converted to the services of the first alternative model, so as to ensure that the deployed services have the same functions and the same service quantity and service quality.

采用该部署方式,能够充分地利用混合云平台中的算力资源进行服务部署,确保服务部署质量,提高服务部署效率。With this deployment method, the computing resources in the hybrid cloud platform can be fully utilized for service deployment, ensuring the quality of service deployment and improving service deployment efficiency.

S216、确定混合云平台中部署第一替代模型的第三子服务部署策略;执行步骤S217;S216. Determine the third sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform; execute step S217;

此时,该第三子服务部署策略中包括至少一个第四集群的标识以及各第四集群中部署的第四数量;各第四集群部署的第四数量之和小于剩余数量。同理,第四集群也为混合云平台中的任一集群。At this time, the third sub-service deployment strategy includes at least one fourth cluster identifier and the fourth quantity deployed in each fourth cluster; the sum of the fourth quantities deployed in each fourth cluster is less than the remaining quantity. Similarly, the fourth cluster is also any cluster in the hybrid cloud platform.

也就是说,此时可以充分利用混合云平台的资源,部署部分数量的第一替代模型的服务,代替部分数量的目标模型的服务。That is to say, resources of the hybrid cloud platform can be fully utilized at this time, and a part of the services of the first alternative model can be deployed to replace a part of the services of the target model.

同理,参考上述步骤209的记载,该第三子服务部署策略也可以空。即第三子服务部署策略中不包括第四集群的标识,各第四集群中部署的第四数量为空。同理,为了便于后续统计,也可以配置第三子服务部署策略中包括混合云平台中的每个集群,但是集群中部署的第四数量为0。Similarly, referring to the description in step 209 above, the third sub-service deployment strategy can also be empty. That is, the third sub-service deployment strategy does not include the identifier of the fourth cluster, and the fourth quantity deployed in each fourth cluster is empty. Similarly, for the convenience of subsequent statistics, it is also possible to configure the third sub-service deployment strategy to include each cluster in the hybrid cloud platform, but the fourth number deployed in the cluster is 0.

S217、基于各第四集群部署的第四数量,更新部署的剩余数量;执行步骤S218;S217. Based on the fourth quantity deployed in each fourth cluster, update the remaining quantity deployed; perform step S218;

具体地,可以取各第四集群部署的第四数量之和,作为第三子服务部署策略部署的数量;然后采用剩余数量,减去第三子服务部署策略部署的数量,作为更新后的剩余数量。Specifically, the sum of the fourth quantities deployed by each fourth cluster can be taken as the quantity deployed by the third sub-service deployment strategy; quantity.

S218、基于目标模型的功能信息和已部署的多个模型中各模型的功能信息,获取与目标模型具有相同功能的第二替代模型;执行步骤S219;S218. Based on the function information of the target model and the function information of each of the deployed models, obtain a second substitute model having the same function as the target model; perform step S219;

S219、基于混合云平台的各集群中各节点的各类型的算力资源的剩余量、第二替代模型的服务部署依赖的第三算力资源的类型、依赖的第三目标算力资源的量、以及更新的剩余数量,检测混合云平台中是否能够实施更新后的剩余数量的第二替代模型的服务的部署;若能,执行步骤S220;若不能,执行步骤S223;S219. The remaining amount of each type of computing power resources of each node in each cluster based on the hybrid cloud platform, the type of the third computing power resource that the service deployment of the second alternative model depends on, and the amount of the third target computing power resource that depends on , and the updated remaining quantity, detecting whether the hybrid cloud platform can implement the deployment of the service of the updated remaining quantity of the second alternative model; if yes, perform step S220; if not, perform step S223;

本实施例中,第三算力资源的类型、第三算力资源的类型与第一算力资源的类型不同。In this embodiment, the type of the third computing resource is different from the type of the first computing resource.

S220、确定混合云平台中部署第二替代模型的第四子服务部署策略;执行步骤S221;S220. Determine the fourth sub-service deployment strategy for deploying the second alternative model in the hybrid cloud platform; execute step S221;

本实施例中的第四子服务部署策略中包括至少一个第五集群的标识以及各第五集群中部署的第五数量;各第五集群对应的第五数量之和等于更新后的剩余数量。同理,第五集群也可以为混合云平台中的任一集群。The fourth sub-service deployment strategy in this embodiment includes an identifier of at least one fifth cluster and a fifth quantity deployed in each fifth cluster; the sum of the fifth quantities corresponding to each fifth cluster is equal to the updated remaining quantity. Similarly, the fifth cluster may also be any cluster in the hybrid cloud platform.

基于以上所述,可以得知,此时,总数量的目标模型的部署服务,可以拆分为采用第一子服务部署策略部署部分数量的目标模型的服务,采用第三子服务部署策略部署部分数量的第一替代模型的服务,采用第四子服务部署策略部署剩余数量的第二替代模型的服务。Based on the above, it can be known that at this time, the deployment services of the total number of target models can be split into services that use the first sub-service deployment strategy to deploy part of the number of target models, and use the third sub-service deployment strategy to deploy the part The number of services of the first alternative model, and the remaining number of services of the second alternative model are deployed using the fourth sub-service deployment strategy.

本实施例的各子服务部署策略可以看作是服务部署的预分配方案,用于指导后续的服务部署。Each sub-service deployment strategy in this embodiment can be regarded as a pre-allocation scheme for service deployment, which is used to guide subsequent service deployment.

S221、分别获取第一子服务部署策略对应的第一镜像文件、第三子服务部署策略对应的第三镜像文件、和第四子服务部署策略对应的第四镜像文件;执行步骤S222;S221. Obtain the first image file corresponding to the first sub-service deployment strategy, the third image file corresponding to the third sub-service deployment strategy, and the fourth image file corresponding to the fourth sub-service deployment strategy; perform step S222;

该步骤的具体实现,可以参考上述步骤S214,在此不再赘述。For the specific implementation of this step, reference may be made to the above step S214, which will not be repeated here.

S222、分别向各第二集群部署对应第二数量的第二镜像文件,向各第四集群部署对应第四数量的第三镜像文件,向各第五集群部署对应第五数量的第四镜像文件,以实现在混合云平台中部署总数量的目标模型的服务;执行步骤S224;S222. Deploy second image files corresponding to the second number to each second cluster, deploy third image files corresponding to the fourth number to each fourth cluster, and deploy fourth image files corresponding to the fifth number to each fifth cluster , to realize the service of deploying a total number of target models in the hybrid cloud platform; perform step S224;

也就是说,本实施例中,为了能够实现目标模型的服务的部署,可以在部署目标模型依赖的第一算力资源不足的情况下,而与目标模型具有相同功能的第一替代模型的还存在部分算力资源的情况下,将部署部分数量的目标模型的服务,转换为部署部分数量的第一替代模型的服务;进一步地,还可以将部署部分数量的目标模型的服务,转换为部署部分数量的与目标模型具有相同功能的第二替代模型的服务。该部署方式,可以充分利用混合云平台中各种类型的算力资源,进行服务部署,确保服务部署质量,提高服务部署效率。That is to say, in this embodiment, in order to realize the deployment of the service of the target model, the first alternative model with the same function as the target model may also In the case of partial computing resources, the service of deploying a part of the target model is converted into the service of deploying a part of the first alternative model; further, the service of deploying a part of the target model can also be converted into a deployment of A partial number of services for a second alternative model that has the same functionality as the target model. This deployment method can make full use of various types of computing resources in the hybrid cloud platform for service deployment, ensure service deployment quality, and improve service deployment efficiency.

S223、若确定已部署的多个模型中不存在与目标模型的具有相同功能的其他替代模型时,确定混合云平台中无法部署总数量的目标模型的服务;执行步骤S225;S223. If it is determined that there is no other alternative model having the same function as the target model in the deployed multiple models, it is determined that the total number of services of the target model cannot be deployed in the hybrid cloud platform; perform step S225;

需要说明的是,若确定已部署的多个模型中还存在与目标模型的具有相同功能的其他替代模型时,此时可以采用上述步骤S219-S222的方式继续进行部署。但是若部署总数量的目标模型的服务的最终预分配方案还未确定,而已部署的多个模型中又不存在相同功能的其他替代模型,此时,确定混合云平台中无法部署总数量的目标模型的服务。It should be noted that, if it is determined that there are other alternative models having the same function as the target model among the deployed models, the deployment can be continued in the manner of the above steps S219-S222. However, if the final pre-allocation plan for deploying the total number of target model services has not been determined, and there are no other alternative models with the same functions among the deployed multiple models, at this time, it is determined that the total number of targets cannot be deployed in the hybrid cloud platform Model service.

S224、向用户返回目标模型的服务部署完成,结束。S224, returning to the user that the service deployment of the target model is completed, and ends.

S225、向用户返回无法部署目标模型的服务,结束。S225. Return to the user the service that the target model cannot be deployed, and end.

本实施例的混合云平台的服务部署方法,能够基于混合云平台的各集群中各节点的各类型的算力资源的剩余资源量、目标模型的服务部署信息,将混合云平台看作为一个整体,在混合云平台中部署目标模型的服务,而不限制仅在混合云平台中的部分单个云平台上部署该目标模型的服务,能够充分利用混合云平台中的资源,有效地提高混合云平台中目标模型的服务的部署效率;The service deployment method of the hybrid cloud platform in this embodiment can regard the hybrid cloud platform as a whole based on the remaining resources of each type of computing resources of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model , deploying the service of the target model in the hybrid cloud platform, without limiting the deployment of the service of the target model on some single cloud platforms in the hybrid cloud platform, can make full use of the resources in the hybrid cloud platform and effectively improve the efficiency of the hybrid cloud platform. Deployment efficiency of services in the target model;

进一步地,本实施例中,还可以在混合云平台中部署目标模型的服务时,若目标模型依赖的算力资源不足时,可以仅采用剩余的算力资源,部署部分数量的目标模型。而剩余数量的目标模型的服务的部署,可以通过模型转换,获取与目标模型具有相同功能的替代模型,并在混合云平台中具有替代模型的依赖算力资源时,将剩余数量的目标模型的服务的部署,转换为部署剩余数量的替代模型的服务。而且,在本实施例的场景中,一个目标模型的部署服务,可以依据混合云平台的剩余算力资源的情况,将部分部署后剩余数量的目标模型的部署服务转换为部署一个、两个或者两个以上的替代模型的服务,能够充分地利用混合云平台的各种类型的算力资源,有效地提高服务部署效率。Furthermore, in this embodiment, when deploying the service of the target model on the hybrid cloud platform, if the computing power resources on which the target model depends are insufficient, only a part of the target model can be deployed by using the remaining computing power resources. For the deployment of the service of the remaining number of target models, an alternative model with the same function as the target model can be obtained through model conversion, and when there are dependent computing resources of the alternative model in the hybrid cloud platform, the remaining number of target models Deployment of the service, converted to a service that deploys the remaining number of alternative models. Moreover, in the scenario of this embodiment, the deployment service of one target model can be converted into deploying one, two or The service of more than two alternative models can make full use of various types of computing resources of the hybrid cloud platform and effectively improve the efficiency of service deployment.

图3是根据本公开第二实施例的示意图;如图3所示,本实施例在上述图1或者图2所示实施例的技术方案的基础上,进一步更加详细地介绍本公开的技术方案,具体可以包括如下步骤:Fig. 3 is a schematic diagram according to the second embodiment of the present disclosure; as shown in Fig. 3, this embodiment further introduces the technical solution of the present disclosure in more detail on the basis of the technical solution of the above-mentioned embodiment shown in Fig. 1 or Fig. 2 , specifically may include the following steps:

S301、接收用户的集群创建请求;该集群创建请求中携带集群的创建平台标识、需配置的各类型的算力资源的信息;S301. Receive a cluster creation request from a user; the cluster creation request carries a cluster creation platform identifier and information of various types of computing resources to be configured;

本实施例的技术方案,主要用于提供基于混合云管理平台,实现集群的创建。该集群的创建请求可以用户通过混合云管理平台的API输入的,或者通过混合云管理平台的界面输入的。The technical solution of this embodiment is mainly used to provide a hybrid cloud-based management platform to realize cluster creation. The creation request of the cluster may be input by the user through the API of the hybrid cloud management platform, or through the interface of the hybrid cloud management platform.

具体地,本实施例的用户,与上述图1所示实施例的用户相同,可以为商家账户或者基于指定项目所创建的账户。Specifically, the user in this embodiment is the same as the user in the embodiment shown in FIG. 1 above, and may be a merchant account or an account created based on a specified item.

该集群创建请求,可以基于商家的规划或者项目的规划来生成并发起。例如,项目的规划,需要在某个指定的公有云或者私有云平台中创建具备一定算力资源的集群。The cluster creation request can be generated and initiated based on the business plan or project plan. For example, project planning requires the creation of clusters with certain computing power resources in a specified public cloud or private cloud platform.

S302、基于集群的创建平台标识和需配置的各类型的算力资源的信息,在对应的云平台中获取多个节点构建用户的集群;各节点能够提供至少一种类型的算力资源;S302. Based on the identification of the cluster creation platform and the information of various types of computing power resources to be configured, obtain multiple nodes in the corresponding cloud platform to build a user cluster; each node can provide at least one type of computing power resources;

本实施例中,一个节点可以对应一台设备,该设备能够提高至少一种类型的算力资源。例如,集群中有的节点具备多核CPU,具备一定大小的内存;而有的节点设置有指定型号的显卡等。In this embodiment, a node may correspond to a device, and the device can increase at least one type of computing resources. For example, some nodes in the cluster have multi-core CPUs and memory of a certain size; while some nodes are equipped with graphics cards of specified models.

例如,某个项目的集群创建请求,可以直接要求在私有云平台中创建一个满足如下条件的集群:5台8核CPU的设备、3台具备指定型号的显卡的设备、总内存可以达到第一预设阈值。基于上述条件,可以在对应的私有云平台中依次筛选各条件对应的节点,获取满足条件的节点,构建集群。例如,先筛选5台具有8核CPU的设备,然后检测这5台设备是否具备指定型号的显卡,若具备,再进一步5台设备的总内存是否达到第一预算阈值,若达到,确定5台具有8核CPU的设备便可以构成一个集群。否则,若检测这5台设备不具备指定型号的显卡,还需要另外选择3台具备该指定型号的显卡的设备。然后进一步检测5台具有8核CPU的设备和3台具备该指定型号的显卡的设备的总内存是否达到第一预设阈值,若达到,确定5台具有8核CPU的设备和3台具备该指定型号的显卡的设备便可以构成一个集群。若未达到,还需要再选择具有一定内存的设备,与5台具有8核CPU的设备和3台具备该指定型号的显卡的设备一起,构成集群。For example, a cluster creation request for a certain project can directly request to create a cluster in the private cloud platform that meets the following conditions: 5 devices with 8-core CPUs, 3 devices with specified graphics cards, and the total memory can reach the first preset threshold. Based on the above conditions, the nodes corresponding to each condition can be screened sequentially in the corresponding private cloud platform to obtain nodes that meet the conditions and build a cluster. For example, first screen 5 devices with 8-core CPUs, and then check whether the 5 devices have the specified graphics card. If so, check whether the total memory of the 5 devices reaches the first budget threshold. If so, determine the 5 devices. Devices with 8-core CPUs can form a cluster. Otherwise, if it is detected that these 5 devices do not have the graphics card of the specified model, another 3 devices with the graphics card of the specified model need to be selected. Then further check whether the total memory of 5 devices with 8-core CPUs and 3 devices with the specified graphics card reaches the first preset threshold, and if so, determine whether the total memory of 5 devices with 8-core CPUs and 3 devices with the Devices with specified graphics cards can form a cluster. If not, you need to select a device with a certain amount of memory to form a cluster with 5 devices with 8-core CPUs and 3 devices with the specified graphics card.

上述举例仅为一个示例,不作为本申请集群创建请求的限定。实际应用场景中,在不违背本申请的实现思想的前提下,可以基于任意的形式的集群创建请求,在相应平台创建相应的集群。The above example is only an example, and is not intended to limit the cluster creation request of this application. In an actual application scenario, on the premise of not violating the implementation idea of this application, a corresponding cluster can be created on a corresponding platform based on any form of cluster creation request.

本实施例的上述集群创建方式,可以基于混合云管理平台实现在混合云频台中的各云平台中进行集群创建,实现非常简单、方便。而且,与现有技术的人工手动分别在各云平台创建集群的方式相比,能够有效地提高集群创建效率。The above cluster creation method in this embodiment can be based on the hybrid cloud management platform to implement cluster creation in each cloud platform in the hybrid cloud platform, which is very simple and convenient to implement. Moreover, compared with the prior art of manually creating clusters on each cloud platform, the cluster creation efficiency can be effectively improved.

S303、从组件仓库中获取采集器组件;S303. Obtain the collector component from the component warehouse;

本实施例中,混合云管理平台中还配置有组件仓库,该组件仓库中配置有多个组件,例如可以包括采集器组件,还可以包括显卡服务组件等。In this embodiment, the hybrid cloud management platform is further configured with a component warehouse, and the component warehouse is configured with multiple components, for example, may include a collector component, and may also include a graphics card service component and the like.

S304、在集群的各节点中部署采集器组件,以用于采集各节点的各类型的算力资源的剩余资源量;S304. Deploy a collector component in each node of the cluster, so as to collect remaining resource amounts of various types of computing power resources of each node;

可选地,通过采用上述步骤S301和步骤S302可以实现在混合云平台中的任一云平台中实现集群的创建。Optionally, by adopting the above steps S301 and S302, the cluster can be created on any cloud platform in the hybrid cloud platform.

进一步地,为了能够更加有效地实现在集群中部署服务,可以采用上述步骤S303和步骤S304,在集群中的各节点中部署采集器组件,以实现对各节点的各类型的算力资源的剩余资源量的实时采集,并上报。可选地,各节点中的采集器组件,也可以采集各节点上各类型资源的使用情况,也上报给混合云管理平台,并存储在存储单元中。Furthermore, in order to more effectively implement the deployment of services in the cluster, the above steps S303 and S304 can be used to deploy collector components in each node in the cluster, so as to realize the remaining Real-time collection of resources and reporting. Optionally, the collector component in each node can also collect usage of various types of resources on each node, report it to the hybrid cloud management platform, and store it in the storage unit.

例如,具体实现时,步骤S302创建集群之后。混合云管理平台可以调用该集群的API,发起携带采集器组件标识的部署指令,该集群可以从混合云管理平台中拉取该采集器组件,并安装在各节点上,实现在集群中的各节点上部署该采集器组件。For example, in specific implementation, after the cluster is created in step S302. The hybrid cloud management platform can call the API of the cluster and initiate a deployment command carrying the collector component identifier. The cluster can pull the collector component from the hybrid cloud management platform and install it on each node to realize the The collector component is deployed on the node.

本实施例中,通过在创建的集群中各节点部署采集器组件,能够及时、准确地获取到各节点的各类型的算力资源的剩余量,方便后续服务的部署。In this embodiment, by deploying the collector component on each node in the created cluster, the remaining amount of each type of computing resources of each node can be obtained in a timely and accurate manner, which facilitates the deployment of subsequent services.

S305、从组件仓库中获取显卡服务组件;S305. Obtain the graphics card service component from the component warehouse;

S306、在集群的提供显卡资源的节点中部署显卡服务组件;S306. Deploy the graphics card service component in the nodes of the cluster that provide the graphics card resource;

步骤S305和步骤S306为一种补充方案,对于集群中具有显卡资源时,需要同时在具有显卡资源的设备上部署显卡服务组件,为该设备赋能,使得该设备能够提高显卡的服务。Steps S305 and S306 are a supplementary solution. When there are graphics card resources in the cluster, it is necessary to deploy the graphics card service component on the device with the graphics card resource at the same time to empower the device so that the device can improve the graphics card service.

该显卡服务组件可以为device plugin,使用时,该显卡服务组件可以将当前节点的显卡使用情况,上报给所在集群中的kubelet组件,以供集群在部署服务的时候,可以参考节点的显卡使用情况,进行更加准确地服务部署。The graphics card service component can be a device plugin. When used, the graphics card service component can report the graphics card usage of the current node to the kubelet component in the cluster, so that the cluster can refer to the graphics card usage of the node when deploying services. , for more accurate service deployment.

S307、接收用户的集群修改请求;S307. Receive a cluster modification request from a user;

本实施例的集群修改请求中可以携带要修改的集群的标识;该集群修改请求用于指示对要修改的集群进行删除、或者增加集群中节点、或者删除集群中的节点。The cluster modification request in this embodiment may carry the identifier of the cluster to be modified; the cluster modification request is used to instruct to delete the cluster to be modified, or to add nodes in the cluster, or to delete nodes in the cluster.

S308、基于用户的集群修改请求,对相应的集群进行更新。S308. Based on the user's cluster modification request, update the corresponding cluster.

例如,若集群修改请求指示删除集群,对应删除该集群即可。For example, if the cluster modification request indicates to delete the cluster, the corresponding cluster can be deleted.

若集群修改请求中指示为集群增加一个具备指定类型资源的节点,则根据集群修改请求,从该集群的标识所在的云平台中获取一台具备指定类型资源的机器,假如集群即可。另外,需要说明的是,对于新加入的节点,需要参考上述步骤S303和步骤S304,在新加入的节点中部署采集器组件。若新加入的节点具备显卡资源,需要采用步骤S305和步骤S306,在节点中部署显卡服务组件。If the cluster modification request indicates to add a node with a specified type of resource to the cluster, then according to the cluster modification request, obtain a machine with the specified type of resource from the cloud platform where the cluster ID is located, as long as the cluster is sufficient. In addition, it should be noted that, for a newly added node, it is necessary to refer to the above step S303 and step S304 to deploy the collector component in the newly added node. If the newly added node has graphics card resources, step S305 and step S306 need to be adopted to deploy the graphics card service component in the node.

若集群修改请求中指示为集群删除指定类型资源的节点,则根据集群修改请求,相应删除集群中对应的节点即可。If the cluster modification request indicates to delete a node of a specified type of resource for the cluster, the corresponding node in the cluster can be deleted according to the cluster modification request.

本实施例的上述技术方案,可以实现基于同一个混合云管理平台,在各云平台中创建集群,不用单独为每个云平台配置相应的集群创建平台,管理非常方便。The above technical solution of this embodiment can create clusters in each cloud platform based on the same hybrid cloud management platform, without configuring a corresponding cluster creation platform for each cloud platform separately, and the management is very convenient.

而且,本实施例中,还可以在集群的各节点中部署采集器组件,为集群赋能,实现集群中各节点的剩余资源监控的能力,为后续集群中服务部署提供了有效地支持。Moreover, in this embodiment, collector components can also be deployed in each node of the cluster to empower the cluster and realize the ability to monitor the remaining resources of each node in the cluster, providing effective support for subsequent service deployment in the cluster.

进一步地,本实施例中,还可以在集群中支持显卡的节点上部署显卡服务组件,为集群赋予实施显卡的能力,为后续需要显卡的服务的部署提供了有效地支持。Furthermore, in this embodiment, the graphics card service component can also be deployed on the nodes supporting the graphics card in the cluster, so as to endow the cluster with the ability to implement graphics cards, and provide effective support for the subsequent deployment of services that require graphics cards.

进一步地,本实施例中,还能够基于混合云管理平台,实现集群的修改,对集群的管理和维护非常方便。Furthermore, in this embodiment, the modification of the cluster can also be implemented based on the hybrid cloud management platform, which is very convenient for the management and maintenance of the cluster.

例如,图4是本公开提供的一种混合云管理平台的管理示意图。如图4所示,以混合云管理平台管理包括一个公有云平台和一个私有云平台的混合云平台为例,实际应用中,还可以包括更多的公有云平台。如图4所示,本实施例中,以混合云管理平台中包括调度器、计算单元、存储单元、模型转换单元、适配器、组件仓库以及镜像文件仓库等为例。实际应用中,也可以按照其他功能单元来划分,在此不做限定。For example, FIG. 4 is a management diagram of a hybrid cloud management platform provided by the present disclosure. As shown in FIG. 4 , taking a hybrid cloud platform managed by a hybrid cloud management platform including a public cloud platform and a private cloud platform as an example, more public cloud platforms may also be included in practical applications. As shown in FIG. 4 , in this embodiment, the hybrid cloud management platform includes a scheduler, a computing unit, a storage unit, a model conversion unit, an adapter, a component warehouse, and an image file warehouse as an example. In practical applications, it can also be divided according to other functional units, which is not limited here.

参考上述图3所示实施例的记载,混合云管理平台可以通过API接收外部的集群创建请求,然后基于集群创建请求,在公有云平台或者私有云平台中创建相应的集群。实际应用中,也可以采用传统方式或者其他方式,在混合云平台中的各云平台中创建集群,在此不做限定。进一步地,集群创建完后,可以参考上述图3所示实施例的记载,从组件仓库中获取采集器组件,部署在集群的各节点上。对于包括有显卡组件的节点,还需要从组件仓库中获取显卡服务组件,部署在相应的节点上。Referring to the description of the embodiment shown in FIG. 3 above, the hybrid cloud management platform can receive an external cluster creation request through an API, and then create a corresponding cluster on the public cloud platform or private cloud platform based on the cluster creation request. In practical applications, traditional methods or other methods may also be used to create clusters in each cloud platform of the hybrid cloud platform, which is not limited here. Further, after the cluster is created, the collector component can be obtained from the component warehouse and deployed on each node of the cluster by referring to the description of the above embodiment shown in FIG. 3 . For a node including a graphics card component, it is also necessary to obtain the graphics card service component from the component warehouse and deploy it on the corresponding node.

该混合云管理平台还可以通过API接收外部的服务部署请求,并存储在队列(Queue)中。然后由调度器从队列Queue中读取一个服务部署请求,如上述实施例的步骤S203。进一步地,调度器调度计算单元可以执行上述实施例的步骤S204-S207,并将生成的镜像文件存储在镜像文件仓库中。并由调度器调度适配器通过执行步骤S208,实现部署目标模型的服务。The hybrid cloud management platform can also receive external service deployment requests through the API and store them in a queue (Queue). Then the scheduler reads a service deployment request from the queue, as in step S203 of the above embodiment. Further, the scheduler schedules the computing unit to execute steps S204-S207 of the above embodiment, and store the generated image file in the image file repository. And the scheduler schedules the adapter to implement the service of the deployment target model by executing step S208.

而若计算单元在计算过程中,确定混合云平台的各集群中不能够实施总数量的目标模型的服务的部署,此时调度器调度计算单元执行步骤S209和S210;并调度模型转换单元,执行步骤S211进行模型的转化,并进一步由计算单元执行步骤S212-S213,获取各子服务部署策略对应的镜像文件,并存储在镜像文件仓库中。并由调度器调度适配器通过执行步骤S215,实现部署目标模型的服务。以此类推,还可以基于上述各单元,执行上述步骤S216-S222,直至最终向用户返回目标模型的服务部署完成,或者向用户返回无法部署目标模型的服务。And if the calculation unit determines that the service deployment of the total number of target models cannot be implemented in each cluster of the hybrid cloud platform during the calculation process, the scheduler schedules the calculation unit to perform steps S209 and S210; and schedules the model conversion unit to execute Step S211 converts the model, and further executes steps S212-S213 by the calculation unit to obtain the image files corresponding to the deployment strategies of each sub-service, and store them in the image file warehouse. And the scheduler schedules the adapter to implement the service of the deployment target model by executing step S215. By analogy, the above-mentioned steps S216-S222 can also be executed based on the above-mentioned units until the service deployment of the target model is finally returned to the user, or the service that cannot deploy the target model is returned to the user.

图5是根据本公开第四实施例的示意图;如图5所示,本实施例的混合云管理平台500,包括:FIG. 5 is a schematic diagram according to a fourth embodiment of the present disclosure; as shown in FIG. 5, the hybrid cloud management platform 500 of this embodiment includes:

服务部署获取模块501,用于获取用户的服务部署请求;所述服务部署请求中携带待部署的目标模型的功能信息、以及所述目标模型的服务部署信息;The service deployment acquiring module 501 is configured to acquire a user's service deployment request; the service deployment request carries the function information of the target model to be deployed and the service deployment information of the target model;

剩余资源获取模块502,用于剩余资源获取模块,用于获取混合云平台中预先为所述用户创建的各集群中各节点上的各类型的算力资源的剩余量;The remaining resource acquisition module 502 is used for the remaining resource acquisition module, and is used for acquiring the remaining amount of various types of computing power resources on each node in each cluster pre-created for the user in the hybrid cloud platform;

服务部署模块503,用于基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及所述目标模型的服务部署信息,在所述混合云平台中部署所述目标模型的服务。The service deployment module 503 is configured to deploy said The serving of the target model.

本实施例的混合云管理平台500,通过采用上述模块实现基于混合云平台的服务部署的实现原理以及技术效果,与上述相关方法实施例的实现相同,详细可以参考上述相关方法实施例的记载,在此不再赘述。The hybrid cloud management platform 500 of this embodiment uses the above-mentioned modules to realize the implementation principle and technical effect of service deployment based on the hybrid cloud platform, which is the same as the implementation of the above-mentioned related method embodiments. For details, please refer to the records of the above-mentioned related method embodiments. I won't repeat them here.

图6是根据本公开第五实施例的示意图;如图6所示,本实施例的混合云管理平台600,包括上述图5所示的同名同功能模块:服务部署获取模块601、剩余资源获取模块602以及服务部署模块603。Fig. 6 is a schematic diagram according to the fifth embodiment of the present disclosure; as shown in Fig. 6, the hybrid cloud management platform 600 of this embodiment includes modules with the same name and the same function as shown in Fig. 5 above: service deployment acquisition module 601, remaining resource acquisition Module 602 and service deployment module 603.

本实施例中,服务部署获取模块601,用于从队列中获取所述用户的服务部署请求。In this embodiment, the service deployment obtaining module 601 is configured to obtain the user's service deployment request from the queue.

如图6所示,本实施例的混合云管理平台600中,还包括:As shown in FIG. 6, the hybrid cloud management platform 600 of this embodiment also includes:

接收模块604,用于接收外部的所述用户的服务部署请求;A receiving module 604, configured to receive an external service deployment request from the user;

存储模块605,用于将所述用户的服务部署请求存入所述队列中。The storage module 605 is configured to store the user's service deployment request into the queue.

进一步可选地,在本公开的一个实施例中,剩余资源获取模块502,用于:Further optionally, in an embodiment of the present disclosure, the remaining resource acquisition module 502 is configured to:

从存储单元中获取各所述集群中各节点上报的各类型的算力资源的剩余量。The remaining amount of each type of computing resources reported by each node in each cluster is obtained from the storage unit.

进一步可选地,在本公开的一个实施例中,接收模块604,还用于接收各所述集群中各节点上报的各类型的算力资源的剩余量;Further optionally, in one embodiment of the present disclosure, the receiving module 604 is also configured to receive the remaining amount of each type of computing resources reported by each node in each cluster;

存储模块605,用于将各所述集群中各节点上报的各类型的算力资源的剩余资源量,存储在所述存储单元中。The storage module 605 is configured to store, in the storage unit, the remaining resource amounts of each type of computing resources reported by each node in each cluster.

进一步可选地,在本公开的一个实施例中,服务部署模块603,包括:Further optionally, in an embodiment of the present disclosure, the service deployment module 603 includes:

计算单元6031,用于基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、所述目标模型依赖的第一算力资源的类型、依赖第一算力资源的量以及部署所述目标模型的服务的总数量,检测所述混合云平台中是否能够实施所述总数量的所述目标模型的服务的部署;The calculation unit 6031 is configured to be based on the remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, the type of the first computing power resource on which the target model depends, and the number of dependent first computing power resources. amount and the total number of services deploying the target model, detecting whether the deployment of the total number of services of the target model can be implemented in the hybrid cloud platform;

确定单元6032,用于若能,确定所述目标模型的目标服务部署策略;所述目标服务部署策略中包括至少一个第一集群的标识以及各第一集群中部署的第一数量;各所述第一集群部署的第一数量之和等于所述总数量部署所述目标模型的服务的第一集群以及在第一集群中部署的第一数量;The determining unit 6032 is configured to, if possible, determine a target service deployment strategy of the target model; the target service deployment strategy includes an identifier of at least one first cluster and a first number of deployments in each first cluster; each of the The sum of the first number of deployments of the first cluster is equal to the total number of first clusters on which the service of the target model is deployed and the first number of deployments in the first cluster;

生成单元6033,用于生成所述目标服务部署策略对应的目标镜像文件;A generating unit 6033, configured to generate a target image file corresponding to the target service deployment strategy;

部署单元6034,用于向各所述第一集群部署对应的第一数量的目标镜像文件,以实现在所述混合云平台中部署所述总数量的所述目标模型的服务。The deploying unit 6034 is configured to deploy a corresponding first quantity of target image files to each of the first clusters, so as to implement the service of deploying the total quantity of the target models in the hybrid cloud platform.

进一步可选地,在本公开的一个实施例中,还包括:Further optionally, in an embodiment of the present disclosure, it also includes:

格式转化模块606,用于对预先接入的所述目标模型进行格式转化,得到标准格式的目标模型;A format conversion module 606, configured to perform format conversion on the pre-accessed target model to obtain a target model in a standard format;

生成单元6033,用于:generating unit 6033 for:

基于所述标准格式的目标模型和依赖的所述第一算力资源的类型,生成所述目标服务部署策略对应的所述目标镜像文件;Generate the target image file corresponding to the target service deployment strategy based on the target model in the standard format and the type of the dependent first computing resource;

并将生成的所述目标镜像文件,存储在镜像文件仓库中。And store the generated target image file in the image file repository.

进一步可选地,在本公开的一个实施例中,部署单元6034,用于:Further optionally, in one embodiment of the present disclosure, the deployment unit 6034 is configured to:

通过调用各所述第一集群的应用程序接口,向所述第一集群发送携带对应的第一数量和目标镜像文件的标识的服务部署命令,以供对应的所述第一集群,从混合云管理平台拉取对应的目标镜像文件,并部署第一数量份,以实现在所述混合云平台中部署所述总数量的所述目标模型的服务。By invoking the application program interface of each of the first clusters, a service deployment command carrying the corresponding first quantity and the identification of the target image file is sent to the first cluster, so that the corresponding first cluster can receive from the hybrid cloud The management platform pulls the corresponding target image file, and deploys the first number of copies, so as to implement the service of deploying the total number of the target models in the hybrid cloud platform.

进一步可选地,如图6所示,在本公开的一个实施例中,服务部署模块603,还包括转换部署单元6035。Further optionally, as shown in FIG. 6 , in an embodiment of the present disclosure, the service deployment module 603 further includes a conversion deployment unit 6035 .

确定单元6032,还用于:The determination unit 6032 is also used for:

若检测并确定所述混合云平台中不能实施所述总数量的所述目标模型的服务的部署,确定所述混合云平台中能够部署所述目标模型的第一子服务部署策略;所述第一子服务部署策略中包括至少一个第二集群的标识以及各第二集群中部署的第二数量;各所述第二集群部署的第二数量之和小于所述总数量;If it is detected and determined that the deployment of the total number of services of the target model cannot be implemented in the hybrid cloud platform, determine a first sub-service deployment strategy in the hybrid cloud platform that can deploy the target model; the second A sub-service deployment policy includes an identifier of at least one second cluster and a second quantity deployed in each second cluster; the sum of the second quantities deployed in each second cluster is less than the total quantity;

基于所述总数量和各所述第二集群对应的第二数量,确定部署的剩余数量。Based on the total quantity and the second quantity corresponding to each of the second clusters, determine a remaining quantity for deployment.

转换部署单元6035,用于:Conversion Deployment Unit 6035 for:

基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、以及已部署的多个模型中各模型的功能信息以及各模型的服务部署信息,在所述混合云平台中部署所述剩余数量的、与所述目标模型具有相同功能的已部署模型的服务。Based on the remaining amount of each type of computing resources of each node in each cluster of the hybrid cloud platform, and the function information of each model in the deployed multiple models and the service deployment information of each model, the hybrid cloud platform Deploy the services of the remaining number of deployed models that have the same functionality as the target model in .

进一步可选地,在本公开的一个实施例中,转换部署单元6035,用于:Further optionally, in an embodiment of the present disclosure, the conversion and deployment unit 6035 is configured to:

基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、所述第一替代模型的服务部署依赖的第二算力资源的类型、依赖的第二目标算力资源的量、以及部署的剩余数量,检测所述混合云平台中是否能够实施所述剩余数量的所述第一替代模型的服务的部署;所述第二算力资源的类型与所述第一算力资源的类型不同;Based on the remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, the type of the second computing power resource that the service deployment of the first alternative model depends on, and the second target computing power resource that depends on amount, and the remaining number of deployments, detecting whether the remaining number of services of the first alternative model can be deployed in the hybrid cloud platform; the type of the second computing power resource is the same as the first computing power resource different types of human resources;

若能,确定所述混合云平台中部署所述第一替代模型的第二子服务部署策略;所述第二子服务部署策略中包括至少一个第三集群的标识以及各第三集群中部署的第三数量;各所述第三集群部署的第三数量之和等于所述剩余数量;If yes, determine the second sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform; the second sub-service deployment strategy includes at least one third cluster identifier and each third cluster deployed A third quantity; the sum of the third quantities deployed by each of the third clusters is equal to the remaining quantity;

分别获取第一子服务部署策略对应的第一镜像文件和第二子服务部署策略对应的第二镜像文件;Respectively obtain a first image file corresponding to the first sub-service deployment strategy and a second image file corresponding to the second sub-service deployment strategy;

分别向各所述第二集群部署对应第二数量的所述第一镜像文件,向各所述第三集群部署对应第三数量的第二镜像文件,以实现在所述混合云平台中部署所述总数量的所述目标模型的服务。Deploying the first image files corresponding to the second number to each of the second clusters, and deploying the second image files corresponding to the third number to each of the third clusters, so as to implement the deployment of all the images on the hybrid cloud platform. The total number of services for the target model.

进一步可选地,在本公开的一个实施例中,转换部署单元6035,用于:Further optionally, in an embodiment of the present disclosure, the conversion and deployment unit 6035 is configured to:

检测并确定镜像文件仓库中存在所述第二子服务部署策略对应的第二镜像文件。Detecting and determining that the second image file corresponding to the second sub-service deployment strategy exists in the image file repository.

进一步可选地,在本公开的一个实施例中,转换部署单元6035,用于:Further optionally, in an embodiment of the present disclosure, the conversion and deployment unit 6035 is configured to:

若检测并确定镜像文件仓库中不存在所述第二子服务部署策略对应的第二镜像文件,基于标准格式的第一替代模型和依赖的第二算力资源的类型、生成所述第二子服务部署策略对应的第二镜像文件。If it is detected and determined that the second image file corresponding to the second sub-service deployment strategy does not exist in the image file warehouse, the second sub-service is generated based on the first substitution model in the standard format and the type of the second computing resource that depends on it. The second image file corresponding to the service deployment policy.

进一步可选地,在本公开的一个实施例中,转换部署单元6035,还用于:Further optionally, in one embodiment of the present disclosure, the conversion and deployment unit 6035 is also used to:

若检测并确定所述混合云平台中不能够实施所述剩余数量的所述第一替代模型的服务的部署,确定所述混合云平台中部署所述第一替代模型的第三子服务部署策略;所述第三子服务部署策略中包括至少一个第四集群的标识以及各第四集群中部署的第四数量;各所述第四集群部署的第四数量之和小于所述剩余数量;If it is detected and determined that the deployment of the remaining number of services of the first alternative model cannot be implemented in the hybrid cloud platform, determine a third sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform ; The third sub-service deployment strategy includes an identifier of at least one fourth cluster and a fourth quantity deployed in each fourth cluster; the sum of the fourth quantities deployed in each fourth cluster is less than the remaining quantity;

基于各所述第四集群部署的第四数量,更新部署的所述剩余数量;updating said remaining number of deployments based on a fourth number of deployments of each of said fourth clusters;

基于目标模型的功能信息和已部署的多个模型中各模型的功能信息,获取与所述目标模型具有相同功能的第二替代模型;Acquiring a second substitute model having the same function as the target model based on the function information of the target model and the function information of each of the deployed models;

基于所述混合云平台的各集群中各节点的各类型的算力资源的剩余量、所述第二替代模型的服务部署依赖的第三算力资源的类型、依赖的第三目标算力资源的量、以及更新的剩余数量,检测所述混合云平台中是否能够实施更新后的所述剩余数量的所述第二替代模型的服务的部署;所述第三算力资源的类型、所述第二算力资源的类型与所述第一算力资源的类型不同;Based on the remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, the type of the third computing power resource that the service deployment of the second alternative model depends on, and the third target computing power resource that depends on and the updated remaining quantity, detecting whether the hybrid cloud platform can implement the deployment of the updated remaining quantity of services of the second alternative model; the type of the third computing resource, the The type of the second computing power resource is different from the type of the first computing power resource;

若能,确定所述混合云平台中部署所述第二替代模型的第四子服务部署策略;所述第四子服务部署策略中包括至少一个第五集群的标识以及各第五集群中部署的第五数量;各所述第五集群对应的第五数量之和等于更新后的所述剩余数量;If yes, determine the fourth sub-service deployment strategy for deploying the second alternative model in the hybrid cloud platform; the fourth sub-service deployment strategy includes at least one fifth cluster identifier and each fifth cluster deployment strategy The fifth quantity; the sum of the fifth quantities corresponding to each of the fifth clusters is equal to the updated remaining quantity;

分别获取第一子服务部署策略对应的第一镜像文件、第三子服务部署策略对应的第三镜像文件、和第四子服务部署策略对应的第四镜像文件;Respectively obtain the first image file corresponding to the first sub-service deployment strategy, the third image file corresponding to the third sub-service deployment strategy, and the fourth image file corresponding to the fourth sub-service deployment strategy;

分别向各所述第二集群部署对应第二数量的所述第二镜像文件,向各所述第四集群部署对应第四数量的第三镜像文件,向各所述第五集群部署对应第五量的第四镜像文件,以实现在所述混合云平台中部署所述总数量的所述目标模型的服务。Deploy the second image files corresponding to the second quantity to each of the second clusters, deploy the third image files corresponding to the fourth quantity to each of the fourth clusters, and deploy the fifth image files corresponding to the fifth to each of the fifth clusters. A quantity of fourth image files, so as to implement the deployment of the total quantity of services of the target model in the hybrid cloud platform.

进一步可选地,如图6所示,在本公开的一个实施例中,在本公开的一个实施例中,混合云管理平台600还包括:Further optionally, as shown in FIG. 6, in an embodiment of the present disclosure, in an embodiment of the present disclosure, the hybrid cloud management platform 600 further includes:

确定模块607,用于若检测并确定所述混合云平台中不能够实施更新后的所述剩余数量的所述第二替代模型的服务的部署时,且确定已部署的多个模型中不存在与所述目标模型的具有相同功能的其他替代模型时,确定所述混合云平台中无法部署所述总数量的所述目标模型的服务。The determination module 607 is configured to detect and determine that the hybrid cloud platform cannot implement the deployment of the updated remaining number of services of the second alternative model, and it is determined that there is no such thing among the deployed multiple models When using other alternative models of the target model that have the same function, it is determined that the total number of services of the target model cannot be deployed on the hybrid cloud platform.

进一步可选地,如图6所示,在本公开的一个实施例中,混合云管理平台600还包括:Further optionally, as shown in FIG. 6, in an embodiment of the present disclosure, the hybrid cloud management platform 600 further includes:

集群请求接收模块608,用于接收所述用户的集群创建请求;所述集群创建请求中携带集群的创建平台标识、需配置的各类型的算力资源的信息;The cluster request receiving module 608 is configured to receive the user's cluster creation request; the cluster creation request carries the cluster creation platform identifier and information of various types of computing resources to be configured;

集群创建模块609,用于基于所述集群的创建平台标识和需配置的各类型的算力资源的信息,在对应的云平台中获取多个节点构建所述用户的集群;各所述节点能够提供至少一种类型的算力资源。The cluster creation module 609 is configured to acquire multiple nodes in the corresponding cloud platform to build the user's cluster based on the creation platform identifier of the cluster and the information of various types of computing resources to be configured; each of the nodes can Provide at least one type of computing resources.

进一步可选地,如图6所示,在本公开的一个实施例中,混合云管理平台600还包括:Further optionally, as shown in FIG. 6, in an embodiment of the present disclosure, the hybrid cloud management platform 600 further includes:

组件获取模块610,用于从组件仓库中获取采集器组件;A component acquisition module 610, configured to acquire the collector component from the component warehouse;

组件部署模块611,用于在所述集群的各节点中部署采集器组件,以用于采集各节点的各类型的算力资源的剩余资源量。The component deployment module 611 is configured to deploy collector components in each node of the cluster, so as to collect remaining resource amounts of various types of computing power resources of each node.

进一步可选地,在本公开的一个实施例中,组件获取模块610,还用于从组件仓库中获取显卡服务组件;Further optionally, in an embodiment of the present disclosure, the component acquisition module 610 is also configured to acquire the graphics card service component from the component warehouse;

组件部署模块611,还用于在所述集群的提供显卡资源的节点中部署所述显卡服务组件。The component deployment module 611 is further configured to deploy the graphics card service component in the nodes of the cluster that provide graphics card resources.

进一步可选地,如图6所示,在本公开的一个实施例中,混合云管理平台600还包括:Further optionally, as shown in FIG. 6, in an embodiment of the present disclosure, the hybrid cloud management platform 600 further includes:

集群请求接收模块608,还用于接收所述用户的集群修改请求;The cluster request receiving module 608 is also configured to receive the user's cluster modification request;

集群更新模块612,用于基于所述用户的集群修改请求,对相应的集群进行更新。The cluster update module 612 is configured to update the corresponding cluster based on the user's cluster modification request.

本实施例的混合云管理平台600,通过采用上述模块实现基于混合云平台的服务部署的实现原理以及技术效果,与上述相关方法实施例的实现相同,详细可以参考上述相关方法实施例的记载,在此不再赘述。The hybrid cloud management platform 600 of this embodiment uses the above-mentioned modules to realize the implementation principle and technical effect of service deployment based on the hybrid cloud platform, which is the same as the implementation of the above-mentioned related method embodiments. For details, please refer to the records of the above-mentioned related method embodiments. I won't repeat them here.

本公开的技术方案中,所涉及的用户个人信息的获取,存储和应用等,均符合相关法律法规的规定,且不违背公序良俗。In the technical solution of the present disclosure, the acquisition, storage and application of the user's personal information involved are in compliance with relevant laws and regulations, and do not violate public order and good customs.

根据本公开的实施例,本公开还提供了一种电子设备、一种可读存储介质和一种计算机程序产品。According to the embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium, and a computer program product.

图7示出了可以用来实施本公开的实施例的示例电子设备700的示意性框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本公开的实现。FIG. 7 shows a schematic block diagram of an example electronic device 700 that may be used to implement embodiments of the present disclosure. Electronic device is intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are by way of example only, and are not intended to limit implementations of the disclosure described and/or claimed herein.

如图7所示,设备700包括计算单元701,其可以根据存储在只读存储器(ROM)702中的计算机程序或者从存储单元708加载到随机访问存储器(RAM)703中的计算机程序,来执行各种适当的动作和处理。在RAM 703中,还可存储设备700操作所需的各种程序和数据。计算单元701、ROM 702以及RAM 703通过总线704彼此相连。输入/输出(I/O)接口705也连接至总线704。As shown in FIG. 7, the device 700 includes a computing unit 701 that can execute according to a computer program stored in a read-only memory (ROM) 702 or loaded from a storage unit 708 into a random-access memory (RAM) 703. Various appropriate actions and treatments. In the RAM 703, various programs and data necessary for the operation of the device 700 can also be stored. The computing unit 701 , ROM 702 , and RAM 703 are connected to each other through a bus 704 . An input/output (I/O) interface 705 is also connected to the bus 704 .

设备700中的多个部件连接至I/O接口705,包括:输入单元706,例如键盘、鼠标等;输出单元707,例如各种类型的显示器、扬声器等;存储单元708,例如磁盘、光盘等;以及通信单元709,例如网卡、调制解调器、无线通信收发机等。通信单元709允许设备700通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。Multiple components in the device 700 are connected to the I/O interface 705, including: an input unit 706, such as a keyboard, a mouse, etc.; an output unit 707, such as various types of displays, speakers, etc.; a storage unit 708, such as a magnetic disk, an optical disk, etc. ; and a communication unit 709, such as a network card, a modem, a wireless communication transceiver, and the like. The communication unit 709 allows the device 700 to exchange information/data with other devices over a computer network such as the Internet and/or various telecommunication networks.

计算单元701可以是各种具有处理和计算能力的通用和/或专用处理组件。计算单元701的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的计算单元、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。计算单元701执行上文所描述的各个方法和处理,例如本公开的上述方法。例如,在一些实施例中,本公开的上述方法可被实现为计算机软件程序,其被有形地包含于机器可读介质,例如存储单元708。在一些实施例中,计算机程序的部分或者全部可以经由ROM 702和/或通信单元709而被载入和/或安装到设备700上。当计算机程序加载到RAM 703并由计算单元701执行时,可以执行上文描述的本公开的上述方法的一个或多个步骤。备选地,在其他实施例中,计算单元701可以通过其他任何适当的方式(例如,借助于固件)而被配置为执行本公开的上述方法。The computing unit 701 may be various general-purpose and/or special-purpose processing components having processing and computing capabilities. Some examples of computing units 701 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various dedicated artificial intelligence (AI) computing chips, various computing units that run machine learning model algorithms, digital signal processing processor (DSP), and any suitable processor, controller, microcontroller, etc. The computing unit 701 executes the various methods and processes described above, such as the above-mentioned methods of the present disclosure. For example, in some embodiments, the above-described methods of the present disclosure may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as the storage unit 708 . In some embodiments, part or all of the computer program may be loaded and/or installed on the device 700 via the ROM 702 and/or the communication unit 709 . When the computer program is loaded into the RAM 703 and executed by the computing unit 701, one or more steps of the above-mentioned method of the present disclosure described above may be performed. Alternatively, in other embodiments, the computing unit 701 may be configured in any other appropriate way (for example, by means of firmware) to execute the above-mentioned method of the present disclosure.

本文中以上描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、芯片上系统的系统(SOC)、复杂可编程逻辑设备(CPLD)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described above herein can be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips Implemented in a system of systems (SOC), complex programmable logic device (CPLD), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor Can be special-purpose or general-purpose programmable processor, can receive data and instruction from storage system, at least one input device, and at least one output device, and transmit data and instruction to this storage system, this at least one input device, and this at least one output device an output device.

用于实施本公开的方法的程序代码可以采用一个或多个编程语言的任何组合来编写。这些程序代码可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器或控制器,使得程序代码当由处理器或控制器执行时使流程图和/或框图中所规定的功能/操作被实施。程序代码可以完全在机器上执行、部分地在机器上执行,作为独立软件包部分地在机器上执行且部分地在远程机器上执行或完全在远程机器或服务器上执行。Program codes for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general-purpose computer, a special purpose computer, or other programmable data processing devices, so that the program codes, when executed by the processor or controller, make the functions/functions specified in the flow diagrams and/or block diagrams Action is implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.

在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device. A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.

为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide for interaction with the user, the systems and techniques described herein can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user. ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and can be in any form (including Acoustic input, speech input or, tactile input) to receive input from the user.

可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein can be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., as a a user computer having a graphical user interface or web browser through which a user can interact with embodiments of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system can be interconnected by any form or medium of digital data communication, eg, a communication network. Examples of communication networks include: Local Area Network (LAN), Wide Area Network (WAN) and the Internet.

计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。服务器可以是云服务器,也可以为分布式系统的服务器,或者是结合了区块链的服务器。A computer system may include clients and servers. Clients and servers are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, a server of a distributed system, or a server combined with a blockchain.

应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本发公开中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本公开公开的技术方案所期望的结果,本文在此不进行限制。It should be understood that steps may be reordered, added or deleted using the various forms of flow shown above. For example, each step described in the present disclosure may be executed in parallel, sequentially, or in a different order, as long as the desired result of the technical solution disclosed in the present disclosure can be achieved, no limitation is imposed herein.

上述具体实施方式,并不构成对本公开保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本公开的精神和原则之内所作的修改、等同替换和改进等,均应包含在本公开保护范围之内。The specific implementation manners described above do not limit the protection scope of the present disclosure. It should be apparent to those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made depending on design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present disclosure shall be included within the protection scope of the present disclosure.

Claims (22)

1. A service deployment method of a hybrid cloud platform comprises the following steps:
acquiring a service deployment request of a user; the service deployment request carries functional information of a target model to be deployed and service deployment information of the target model;
acquiring the residual quantity of each type of computing power resource on each node in each cluster created in advance for the user in a hybrid cloud platform;
and deploying the services of the target model in the hybrid cloud platform based on the residual amounts of computing power resources of each type of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model.
2. The method of claim 1, wherein obtaining a service deployment request for a user comprises:
and acquiring the service deployment request of the user from a queue.
3. The method of claim 2, wherein prior to retrieving the user's service deployment request from a queue, the method further comprises:
receiving an external service deployment request of the user;
and storing the service deployment request of the user into the queue.
4. The method of claim 1, wherein obtaining the remaining amount of each type of computing power resource on each node in each cluster created in advance for the user in the hybrid cloud platform comprises:
and acquiring the residual quantity of each type of computing power resource reported by each node in each cluster from a storage unit.
5. The method of claim 4, wherein before obtaining, from a storage unit, a remaining amount of each type of computing power resource reported by each node in each cluster, the method further comprises:
and receiving the residual quantity of each type of computing power resource reported by each node in each cluster and storing the residual quantity in the storage unit.
6. The method of claim 1, wherein deploying the services of the target model in the hybrid cloud platform based on the remaining amount of each type of computing power resource for each node in each cluster of the hybrid cloud platform and the service deployment information of the target model comprises:
Detecting whether deployment of services of the target model of the total number can be implemented in the hybrid cloud platform based on a remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, a type of first computing power resource on which the target model depends, an amount of first computing power resource on which the target model depends, and a total number of services of the target model are deployed;
if yes, determining a target service deployment strategy of the target model; the target service deployment strategy comprises at least one first cluster identifier and a first number deployed in each first cluster; the sum of the first number of deployments of each of the first clusters is equal to the total number;
generating a target image file corresponding to the target service deployment strategy;
and deploying a corresponding first number of target image files to each first cluster so as to realize the service of deploying the total number of target models in the hybrid cloud platform.
7. The method of claim 6, wherein after obtaining the service deployment request of the user, the method further comprises:
performing format conversion on the target model which is accessed in advance to obtain a target model with a standard format;
The generating the target image file corresponding to the target service deployment strategy comprises the following steps:
generating the target image file corresponding to the target service deployment strategy based on the target model in the standard format and the type of the first computing power resource depended on;
after generating the target image file corresponding to the target service deployment policy, the method further includes:
and storing the generated target image file in an image file warehouse.
8. The method of claim 6, wherein deploying a corresponding first number of target image files to each of the first clusters to enable deployment of the total number of services of the target model in the hybrid cloud platform comprises:
and sending a service deployment command carrying the corresponding first number and the identification of the target image files to the first clusters by calling the application program interfaces of the first clusters so as to enable the corresponding first clusters to pull the corresponding target image files from a hybrid cloud management platform and deploy the first number so as to realize the deployment of the total number of services of the target model in the hybrid cloud platform.
9. The method of claim 6, wherein the method further comprises:
if the deployment of the total number of services of the target model which cannot be implemented in the hybrid cloud platform is detected and determined, determining a first sub-service deployment strategy capable of deploying the target model in the hybrid cloud platform; the first sub-service deployment strategy comprises at least one identifier of a second cluster and a second number deployed in each second cluster; the sum of the second number of deployments of each of the second clusters is less than the total number;
determining the residual quantity of deployment based on the total quantity and the second quantity corresponding to each second cluster; and deploying the services of the residual quantity of deployed models with the same function as the target model in the hybrid cloud platform based on the residual quantity of computing power resources of each type of each node in each cluster of the hybrid cloud platform, the function information of each model in the deployed models and the service deployment information of each model.
10. The method of claim 9, wherein deploying the remaining number of services of the deployed models having the same function as the target model in the hybrid cloud platform based on the remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, and the function information of each model of the deployed plurality of models, and the service deployment information of each model, comprises:
Acquiring a first alternative model with the same function as the target model based on the function information of the target model and the function information of each model in the deployed multiple models;
detecting whether deployment of services of the first surrogate model of the remaining number can be implemented in the hybrid cloud platform based on a remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, a type of second computing power resource on which service deployment of the first surrogate model depends, an amount of second target computing power resource on which service deployment of the first surrogate model depends, and a remaining number of deployments; the type of the second computing power resource is different from the type of the first computing power resource;
if yes, determining a second sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform; the second sub-service deployment strategy comprises at least one identifier of a third cluster and a third number deployed in each third cluster; the sum of the third numbers of the third cluster deployments is equal to the remaining number;
respectively acquiring a first image file corresponding to a first sub-service deployment strategy and a second image file corresponding to a second sub-service deployment strategy;
and respectively deploying the first image files corresponding to the second quantity to each second cluster, and deploying the second image files corresponding to the third quantity to each third cluster so as to realize the service of deploying the total quantity of the target models in the hybrid cloud platform.
11. The method of claim 10, wherein obtaining a second image corresponding to a second sub-service deployment policy comprises:
and detecting and determining that a second image file corresponding to the second sub-service deployment strategy exists in the image file warehouse.
12. The method of claim 11, wherein obtaining a second image corresponding to a second sub-service deployment policy further comprises:
if the fact that the second image file corresponding to the second sub-service deployment strategy does not exist in the image file warehouse is detected and determined, and the second image file corresponding to the second sub-service deployment strategy is generated based on the first alternative model in the standard format and the type of the second computing power resource which is depended on.
13. The method of claim 10, wherein deploying the remaining number of services of the deployed models having the same function as the target model in the hybrid cloud platform based on the remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, and the function information of each model of the deployed plurality of models, and the service deployment information of each model, further comprises:
if the deployment of the residual quantity of the services of the first alternative model in the hybrid cloud platform can not be implemented is detected and determined, determining a third sub-service deployment strategy for deploying the first alternative model in the hybrid cloud platform; the third sub-service deployment strategy comprises at least one identifier of a fourth cluster and a fourth number deployed in each fourth cluster; the sum of the fourth numbers of the fourth cluster deployments is smaller than the remaining number;
Updating the remaining number of deployments based on a fourth number of deployments of each of the fourth clusters;
acquiring a second alternative model with the same function as the target model based on the function information of the target model and the function information of each model in the deployed multiple models;
detecting whether deployment of services of the second surrogate model of the updated remaining number can be implemented in the hybrid cloud platform based on a remaining amount of each type of computing power resource of each node in each cluster of the hybrid cloud platform, a type of third computing power resource on which service deployment of the second surrogate model depends, an amount of third target computing power resource on which service deployment of the second surrogate model depends, and the updated remaining number; the type of the third computing power resource, the type of the second computing power resource and the type of the first computing power resource are different;
if yes, determining a fourth sub-service deployment strategy for deploying the second alternative model in the hybrid cloud platform; the fourth sub-service deployment strategy comprises at least one identifier of a fifth cluster and a fifth number deployed in each fifth cluster; the sum of the fifth numbers corresponding to the fifth clusters is equal to the updated residual number;
Respectively acquiring a first image file corresponding to a first sub-service deployment strategy, a third image file corresponding to a third sub-service deployment strategy and a fourth image file corresponding to a fourth sub-service deployment strategy;
and respectively deploying a second quantity of second image files corresponding to each second cluster, deploying a fourth quantity of third image files corresponding to each fourth cluster, and deploying a fifth quantity of fourth image files corresponding to each fifth cluster so as to realize the deployment of the total quantity of services of the target model in the hybrid cloud platform.
14. The method of claim 13, wherein the method further comprises:
and if the deployment of the updated residual number of services of the second alternative model cannot be implemented in the hybrid cloud platform is detected and determined, and if other alternative models with the same function as the target model do not exist in the deployed multiple models, determining that the total number of services of the target model cannot be deployed in the hybrid cloud platform.
15. The method of any of claims 1-14, wherein prior to obtaining the remaining amount of each type of computing power resource on each node in each cluster created in advance for the user in a hybrid cloud platform, the method further comprises:
Receiving a cluster creation request of the user; the cluster creation request carries information of a cluster creation platform identifier and various computing power resources to be configured;
based on the creation platform identification of the cluster and the information of various types of computing power resources to be configured, acquiring a plurality of nodes from the corresponding cloud platform to construct the cluster of the user; each of the nodes is capable of providing at least one type of computational power resource.
16. The method of claim 15, wherein the cluster of the user is constructed by acquiring a plurality of nodes in the corresponding cloud platform based on the creation platform identification of the cluster and the information of various types of computing power resources to be configured; after each node is capable of providing at least one type of computational power resource, the method further comprises:
acquiring a collector component from a component warehouse;
a collector component is deployed in each node of the cluster for collecting the remaining amount of each type of computational power resource for each node.
17. The method of claim 15, wherein the cluster of the user is constructed by acquiring a plurality of nodes in the corresponding cloud platform based on the creation platform identification of the cluster and the information of various types of computing power resources to be configured; after each node is capable of providing at least one type of computational power resource, the method further comprises:
Acquiring a display card service component from a component warehouse;
and deploying the display card service component in a node of the cluster providing display card resources.
18. The method of claim 15, wherein the cluster of the user is constructed by acquiring a plurality of nodes in the corresponding cloud platform based on the creation platform identification of the cluster and the information of various types of computing power resources to be configured; after each node is capable of providing at least one type of computational power resource, the method further comprises:
receiving a cluster modification request of the user;
and updating the corresponding clusters based on the cluster modification request of the user.
19. A hybrid cloud management platform, comprising:
the service deployment acquisition module is used for acquiring a service deployment request of a user; the service deployment request carries functional information of a target model to be deployed and service deployment information of the target model;
the residual resource acquisition module is used for acquiring the residual quantity of each type of computing power resource on each node in each cluster created in advance for the user in the hybrid cloud platform;
and the service deployment module is used for deploying the services of the target model in the hybrid cloud platform based on the residual quantity of the computing power resources of each type of each node in each cluster of the hybrid cloud platform and the service deployment information of the target model.
20. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-18.
21. A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1-18.
22. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any of claims 1-18.
CN202310787308.4A 2023-06-29 2023-06-29 Service deployment method, management platform, equipment and medium of hybrid cloud platform Pending CN116684421A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310787308.4A CN116684421A (en) 2023-06-29 2023-06-29 Service deployment method, management platform, equipment and medium of hybrid cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310787308.4A CN116684421A (en) 2023-06-29 2023-06-29 Service deployment method, management platform, equipment and medium of hybrid cloud platform

Publications (1)

Publication Number Publication Date
CN116684421A true CN116684421A (en) 2023-09-01

Family

ID=87790953

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310787308.4A Pending CN116684421A (en) 2023-06-29 2023-06-29 Service deployment method, management platform, equipment and medium of hybrid cloud platform

Country Status (1)

Country Link
CN (1) CN116684421A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119155113A (en) * 2024-11-14 2024-12-17 赛服(上海)网络科技有限公司 Integrated safe hosting operation platform, method and related equipment based on hybrid cloud
CN119396413A (en) * 2024-09-18 2025-02-07 中科加禾(北京)科技有限公司 Model deployment scheme generation, model processing method, device and electronic equipment
CN119396419A (en) * 2024-09-25 2025-02-07 浪潮通信信息系统有限公司 Application deployment method, device, equipment and storage medium
WO2025102729A1 (en) * 2023-11-13 2025-05-22 北京百度网讯科技有限公司 Algorithm service deployment method and apparatus, electronic device, and readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104657220A (en) * 2015-03-12 2015-05-27 广东石油化工学院 Model and method for scheduling for mixed cloud based on deadline and cost constraints
CN110908795A (en) * 2019-11-04 2020-03-24 深圳先进技术研究院 Cloud computing cluster mixed job scheduling method, device, server and storage device
US20210397466A1 (en) * 2020-06-22 2021-12-23 Hewlett Packard Enterprise Development Lp Container-as-a-service (caas) controller for selecting a bare-metal machine of a private cloud for a cluster of a managed container service
CN115562855A (en) * 2022-09-23 2023-01-03 阿里巴巴(中国)有限公司 Resource allocation method and device, electronic equipment and readable storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104657220A (en) * 2015-03-12 2015-05-27 广东石油化工学院 Model and method for scheduling for mixed cloud based on deadline and cost constraints
CN110908795A (en) * 2019-11-04 2020-03-24 深圳先进技术研究院 Cloud computing cluster mixed job scheduling method, device, server and storage device
WO2021088207A1 (en) * 2019-11-04 2021-05-14 深圳先进技术研究院 Mixed deployment-based job scheduling method and apparatus for cloud computing cluster, server and storage device
US20210397466A1 (en) * 2020-06-22 2021-12-23 Hewlett Packard Enterprise Development Lp Container-as-a-service (caas) controller for selecting a bare-metal machine of a private cloud for a cluster of a managed container service
CN115562855A (en) * 2022-09-23 2023-01-03 阿里巴巴(中国)有限公司 Resource allocation method and device, electronic equipment and readable storage medium

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025102729A1 (en) * 2023-11-13 2025-05-22 北京百度网讯科技有限公司 Algorithm service deployment method and apparatus, electronic device, and readable storage medium
CN119396413A (en) * 2024-09-18 2025-02-07 中科加禾(北京)科技有限公司 Model deployment scheme generation, model processing method, device and electronic equipment
CN119396413B (en) * 2024-09-18 2025-07-15 中科加禾(北京)科技有限公司 Model deployment scheme generation method, model processing method, model deployment scheme generation device, model processing device and electronic equipment
CN119396419A (en) * 2024-09-25 2025-02-07 浪潮通信信息系统有限公司 Application deployment method, device, equipment and storage medium
CN119155113A (en) * 2024-11-14 2024-12-17 赛服(上海)网络科技有限公司 Integrated safe hosting operation platform, method and related equipment based on hybrid cloud

Similar Documents

Publication Publication Date Title
CN116684421A (en) Service deployment method, management platform, equipment and medium of hybrid cloud platform
CN112015402B (en) Method, device and electronic device for quickly establishing business scenarios
CN111427971B (en) Business modeling method, device, system and medium for computer system
CN111159897B (en) Target optimization method and device based on system modeling application
WO2022088082A1 (en) Task processing method, apparatus and device based on defect detection, and storage medium
CN113360672B (en) Method, device, device, medium and product for generating knowledge map
CN114911598A (en) Task scheduling method, device, equipment and storage medium
CN114816393A (en) Information generation method, apparatus, device, and storage medium
CN118394592A (en) A Paas platform based on cloud computing
CN115295164A (en) Medical insurance data processing method and device, electronic equipment and storage medium
CN115037655B (en) Pressure measurement methods and systems
CN114841267B (en) Real-time prediction method, device, electronic equipment and computer program product
CN114757214B (en) Method and device for selecting sample corpus for optimizing translation model
CN114723455A (en) Service processing method and device, electronic equipment and storage medium
CN114416357A (en) Method and device for creating container group, electronic equipment and medium
CN114816758B (en) Resource allocation method and device
CN115629939A (en) Data acquisition method and device for edge device, electronic device and medium
CN118502956A (en) Cloud storage system-based resource scheduling method and device, electronic equipment, storage medium and program product
EP4089592A1 (en) Method for determining annotation capability information, related apparatus and computer program product
CN116149875A (en) Scheduling method and device of cloud terminal, electronic equipment, storage medium and product
CN116932147A (en) Streaming job processing method and device, electronic equipment and medium
CN116383284A (en) Data access method, device, equipment and storage medium
CN115858921A (en) Model processing method, device, equipment and storage medium
CN115860121A (en) Text reasoning method, device, equipment and storage medium
CN114070889A (en) Configuration method, traffic forwarding method, device, storage medium and program product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination