[go: up one dir, main page]

CN120868635A - A data-driven method for temperature control in data centers using carbon dioxide refrigeration - Google Patents

A data-driven method for temperature control in data centers using carbon dioxide refrigeration

Info

Publication number
CN120868635A
CN120868635A CN202511409326.4A CN202511409326A CN120868635A CN 120868635 A CN120868635 A CN 120868635A CN 202511409326 A CN202511409326 A CN 202511409326A CN 120868635 A CN120868635 A CN 120868635A
Authority
CN
China
Prior art keywords
augmented
performance function
carbon dioxide
optimal
control strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202511409326.4A
Other languages
Chinese (zh)
Inventor
扈晓翔
董克君
郭亮
杨蓉
杨晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northwestern Polytechnical University
Original Assignee
Northwestern Polytechnical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northwestern Polytechnical University filed Critical Northwestern Polytechnical University
Priority to CN202511409326.4A priority Critical patent/CN120868635A/en
Publication of CN120868635A publication Critical patent/CN120868635A/en
Pending legal-status Critical Current

Links

Landscapes

  • Feedback Control In General (AREA)

Abstract

The application discloses a data-driven carbon dioxide refrigeration data center temperature control method, which belongs to the technical field of carbon dioxide refrigeration data center temperature control and comprises the steps of combining control targets of a carbon dioxide refrigeration and temperature control system of a data center, establishing an augmentation system, establishing a performance function, an HJB equation and an integral Belman equation, approximating the performance function and the control strategy through an evaluation network and an action network to obtain an approximation value, replacing the performance function and the control strategy of the integral Belman equation, defining a residual error to obtain a weight update law, giving an input signal, collecting state information and input information of the augmentation system, iteratively solving an optimal weight based on the weight update law, obtaining an optimal control strategy by utilizing the optimal weight, and applying the optimal control strategy to the carbon dioxide refrigeration and temperature control system of the data center. The method does not depend on a dynamic model of a carbon dioxide refrigeration system, and the practicability and the adaptability of a control scheme are obviously improved.

Description

Data-driven carbon dioxide refrigeration data center temperature control method
Technical Field
The application relates to the technical field of temperature control of carbon dioxide refrigeration data centers, in particular to a data-driven temperature control method of a carbon dioxide refrigeration data center.
Background
With the continuous development of modern information technology, the scale and number of data centers are continuously increasing, and the cooling demands are also rapidly increasing, so that carbon dioxide has been gradually applied to the refrigeration systems of the data centers by virtue of the remarkable advantages of no toxicity, no corrosiveness, good phase change heat transfer and the like.
However, the carbon dioxide refrigeration system of the data center is extremely complex in construction and includes numerous components including compressors, gas coolers, expansion valves, evaporators, internal heat exchangers, and connecting piping. When modeling a system, not only the physical characteristics of each component are needed to be deeply understood, but also the parameter identification is needed, so that an accurate model of the system is difficult to obtain, and model parameter deviation caused by inaccurate modeling can influence the temperature control precision, and further, a control method based on the model has obvious limitations.
Therefore, a control scheme for getting rid of the complex modeling process of the carbon dioxide refrigeration system is needed, and the accurate control of the temperature of the data center is realized in a data-driven manner by only using the input information and the state information of the carbon dioxide refrigeration system.
Disclosure of Invention
Aiming at the defects in the prior art, the temperature control method for the data-driven carbon dioxide refrigeration data center provides a model-free method for precisely controlling the temperature of a carbon dioxide refrigeration system of the data center, solves the problem that the existing control method based on a model can influence the temperature control precision, and provides a certain technical support for the fields of energy efficiency improvement, green refrigeration and the like of the data center.
In order to achieve the aim of the application, the application adopts the following technical scheme:
the application provides a data-driven carbon dioxide refrigeration data center temperature control method, which comprises the following steps:
s1, combining control targets of a carbon dioxide refrigeration and temperature control system of a data center, establishing an augmentation system about state errors and target states, and establishing a performance function, an HJB equation and an integral Belman equation of the augmentation system;
S2, approximating the performance function and the control strategy of the augmentation system through an evaluation network and an action network to obtain the approximation value of the performance function and the control strategy, replacing the performance function and the control strategy of an integral Belman equation based on the approximation value of the performance function and the control strategy, defining a residual error, and obtaining a weight update law of the augmentation weight based on the residual error;
S3, given an input signal, collecting state information and input information of an augmentation system, and iteratively solving an optimal augmentation weight based on a weight update law of the augmentation weight;
And S4, calculating to obtain an optimal control strategy by using the solved optimal augmentation weight, and applying the optimal control strategy as an input signal to a carbon dioxide refrigeration and temperature control system of the data center.
Further, the step S1 includes:
s101, combining control targets of a carbon dioxide refrigeration and temperature control system of a data center, constructing a target state and tracking error, and constructing a performance function;
s102, constructing an augmentation vector of a target state and a tracking error, constructing an augmentation system based on the augmentation vector, and constructing a performance function of the augmentation system based on the performance function;
S103, constructing an optimal performance function and an HJB equation based on the performance functions of the augmentation system and the augmentation system, and obtaining an optimal control strategy based on the HJB equation;
And S104, rewriting the augmentation system based on the optimal control strategy, and constructing an integrated Belman equation based on the optimal performance function, the optimal control strategy and the rewritten augmentation system.
Further, the method for constructing the target state and tracking error and constructing the performance function by combining the control targets of the carbon dioxide refrigeration and temperature control system of the data center comprises the following steps:
a1, modeling a carbon dioxide refrigeration and temperature control system of a data center as a general nonlinear system:
In the formula, As an actual state vector of the system,AndThe system drift dynamics and the input dynamics are respectively,For the input to the system,For the current momentIs used for the purpose of determining the derivative of (c),Is the number of the states of the system,For the number of inputs to the system,For the current moment of time,Representing a real set;
A2, setting target superheat degree of the outlet of the evaporator and target temperature of the internal environment of the data center as target states ,The dynamic characteristics of (a) are:
In the formula, As a function of the target state generator,For the current time target stateIs a derivative of (2);
a3, constructing tracking error based on the actual state and the target state :
Wherein the tracking errorThe kinetic equation of (2) is:
In the formula, Tracking error for current timeIs a derivative of (2);
A4 based on tracking error System inputConstructing a performance function:
In the formula, For states at any instant in the integration interval,For input at any time in the integration interval, superscriptIs the transpose of the vector or matrix,As a discount factor, the number of times the discount is calculated,AndThe matrix is positively defined for symmetry,Is an integral variable.
Further, the augmentation vectorThe method comprises the following steps:
The augmentation system is as follows:
wherein, the Augmenting vectors for current timeIs used for the purpose of determining the derivative of (c),In order to augment the drift dynamics of the system,In order to augment the input dynamics of the system,Is a system input;
the performance function of the augmentation system is:
wherein, the For an augmented state at any time within the integration interval,For system inputs at any time within the integration interval,Is a symmetric matrix.
Further, the optimal performance functionThe method comprises the following steps:
wherein, the Is a real number setThe set of allowable controls on the upper part,Representing a set of admission controlsThe upper minimum;
The HJB equation is:
wherein, the For the optimal control strategy to be used,Representing an optimal performance functionFor a pair ofIs used for the purpose of determining the derivative of (c),To derive an operator;
the optimal control strategy The method comprises the following steps:
Wherein, superscript Representing the inverse of the matrix,Representing taking the minimum value.
Further, the post-rewrite augmentation system is:
wherein, the Iteration of the control strategyThe control strategy obtained by the secondary calculation is used for controlling the control strategy,Is the number of iterations.
Further, the constructing an integrated bellman equation based on the optimal performance function, the optimal control strategy and the rewritten augmentation system includes:
combining the optimal performance function and the optimal control strategy, iterating the control strategy Performance function obtained by secondary calculationTime-dependent along an overlaid system dynamic trajectoryDeriving to obtainRelative toIs the derivative of:
wherein, the Represents the firstPerformance function obtained by secondary calculationTo the stateIs used for the purpose of determining the derivative of (c),Iteration of the control strategyA control strategy obtained by secondary calculation;
In the time interval And (5) carrying out internal integration to obtain an integral Belman equation:
wherein, the The integration time is indicated as such,Representation ofAt the lower part of the timeThe performance function of the number of iterations,AndIs the firstSecondary and tertiaryThe control strategy for the number of iterations,Is the corresponding firstPerformance function of the multiple iterations.
Further, the step S2 includes:
S201, utilizing the universal approximation of the neural network, respectively approximating the control strategy iteration through the evaluation network and the action network Performance function obtained by secondary calculationControl strategy:
Wherein, the AndRespectively isAndIs used to determine the degree of approximation of (c),AndWeights for the evaluation network and the action network respectively,AndAs a linear uncorrelated neural network basis function,AndThe numbers of hidden layer neurons of the two networks respectively;
S202, integrating in the Belman equation AndRespectively using their approximationsAndReplace and make,Obtaining residual error:
Wherein, the AndRespectively representTime of day and time of dayAt the moment of time of day,For the indexing of the collected data sets,Is thatIs used to determine the degree of approximation of (c),AndRespectively isAndAn activation function of the moment of time,AndThe activation functions of the action network and the evaluation network at any time in the integration interval,AndRespectively the firstThe weights of the evaluation network and the action network of the secondary iteration;
s203, setting intermediate parameters AndThe expression for the reduced residual:
wherein, the ,,In order to augment the weight vector,Represents straightening the matrix in the order of columns;
S204, using least square method to minimize Obtaining a weight update law of the augmentation weight:
wherein, the Is the size of the training data set collected.
Further, the step S3 includes:
S301, setting the opening degree of the expansion valve and the rotation speed of the compressor as given input signals In the time intervalCollecting the state and input data of the established augmentation system, thereby obtainingAnd;
Wherein, the AndThe sizes of the training data sets collected are the initial time and the final time of data collection respectivelyTo satisfy the condition that there are two positive real numbersAndSo that for any oneAll have:
In the formula, Is thatA unit matrix of dimensions;
s302, setting Using an initial control strategyCalculating the weight of the corresponding action networkCalculation of the weight update law by increasing the weightAugmentation weights for multiple iterations;
S303, orderIterative updatingUp to ifWhen the iteration is stopped, and thenAs the optimal weight for increasing, otherwise, letContinuing the iteration;
Wherein, the For a preset termination iteration threshold,Is the firstThe weight of the iteration is increased for a number of times,Representing the euclidean norms of the vectors.
Further, the step S4 includes:
S401, solving an optimal control strategy by utilizing optimal augmentation weight of iterative solution And an optimal performance function;
S402, will beAnd the carbon dioxide refrigeration and temperature control system is applied to the data center as an input signal.
The beneficial effects of the application are as follows:
according to the data-driven carbon dioxide refrigeration data center temperature control method, the method is independent of a dynamic model of a carbon dioxide refrigeration system, the practicability and the adaptability of a control scheme are remarkably improved, the performance index of the refrigeration system is optimized, the accurate control of refrigeration energy consumption is realized through a data-driven dynamic regulation mechanism, the temperature control of the data center is realized, the energy saving efficiency of the system is improved, and the development requirement of green low carbon is met.
Drawings
In order to more clearly illustrate the embodiments of the application or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the application, and other embodiments may be obtained according to these drawings to those skilled in the art.
Fig. 1 is a schematic flow chart of a temperature control method of a data-driven carbon dioxide refrigeration data center according to an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. Based on the embodiments of the present application, all other embodiments obtained by the person skilled in the art based on the present application are included in the scope of protection of the present application.
The embodiment of the application provides a data-driven carbon dioxide refrigeration data center temperature control method, which can be seen in fig. 1, and fig. 1 is a flow diagram of the data-driven carbon dioxide refrigeration data center temperature control method provided by the embodiment of the application, and the method comprises the following steps:
S1, combining control targets of a carbon dioxide refrigeration and temperature control system of a data center, establishing an augmentation system about state errors and target states, and establishing a performance function, an HJB equation and an integral Belman equation of the augmentation system.
In one embodiment of the present application, first, a data center carbon dioxide refrigeration and temperature control system is modeled as a generic nonlinear system:
wherein, the As an actual state vector of the system,AndThe system drift dynamics and the input dynamics are respectively,For the input to the system,For the current momentIs used for the purpose of determining the derivative of (c),Is the number of the states of the system,For the number of inputs to the system,For the current moment of time,Representing a set of real numbers.
Setting target superheat degree of evaporator outlet and target temperature of internal environment of data center as target state,The dynamic characteristics of (a) are:
wherein, the As a function of the target state generator,For the current time target stateIs a derivative of (a).
Tracking errorIs the difference between the actual state and the target state, namely:
Tracking error The kinetic equation of (2) is:
In the formula, Tracking error for current timeIs a derivative of (a).
Based on tracking errorsSystem inputThe following performance functions were constructed:
wherein, the For states at any instant in the integration interval,For input at any time in the integration interval, superscriptIs the transpose of the vector or matrix,As a discount factor, the number of times the discount is calculated,AndThe matrix is positively defined for symmetry,Is an integral variable.
Then, an augmented vector of tracking error and target state is constructedThe kinetic equation of the augmentation system is:
wherein, the Augmenting vectors for current timeIs used for the purpose of determining the derivative of (c),In order to augment the drift dynamics of the system,In order to augment the input dynamics of the system,Is a system input.
The performance function of the augmentation system is then:
wherein, the For an augmented state at any time within the integration interval,For system inputs at any time within the integration interval,Is a symmetric matrix.
The constructed HJB equation is:
wherein, the For the optimal control strategy to be used,Representing an optimal performance functionFor a pair ofIs used for the purpose of determining the derivative of (c),In order to take the partial derivative operator,The form of (2) is:
wherein, the Is a real number setThe set of allowable controls on the upper part,Representing a set of admission controlsThe upper minimum
Optimal control strategy can be obtained by HJB equationThe method comprises the following steps:
Wherein, superscript Representing the inverse of the matrix,Representing taking the minimum value.
Finally, the augmentation system is rewritten as:
wherein, the In order to allow for the initial control to be allowed,Iteration of the control strategyThe control strategy obtained by the secondary calculation is used for controlling the control strategy,Is the number of iterations.
Combining the optimal performance function and the optimal control strategy, iterating the control strategyPerformance function obtained by secondary calculationTime-dependent along an overlaid system dynamic trajectoryDeriving to obtainRelative toIs the derivative of:
wherein, the Represents the firstPerformance function obtained by secondary calculationTo the stateIs used for the purpose of determining the derivative of (c),Iteration of the control strategyAnd calculating the obtained control strategy once.
Then in the time intervalAnd (5) carrying out internal integration to obtain an integral Belman equation:
wherein, the The integration time is indicated as such,Representation ofAt the lower part of the timeThe performance function of the number of iterations,AndIs the firstSecondary and tertiaryThe control strategy for the number of iterations,Is the corresponding firstPerformance function of the multiple iterations.
S2, approximating the performance function and the control strategy of the augmentation system through the evaluation network and the action network to obtain the approximation value of the performance function and the control strategy, replacing the performance function and the control strategy of the integral Belman equation based on the approximation value of the performance function and the control strategy, defining residual errors, and obtaining the weight update law of the augmentation weight based on the residual errors.
In one embodiment of the application, first, the control strategy is iterated by approximating the evaluation network and the action network, respectively, using the universal approximation of the neural networkPerformance function obtained by secondary calculationControl strategy:
Wherein, the AndRespectively isAndIs used to determine the degree of approximation of (c),AndWeights for the evaluation network and the action network respectively,AndAs a linear uncorrelated neural network basis function,AndThe numbers of hidden layer neurons of the two networks respectively.
Will integrate in the Belman equationAndRespectively using their approximation valuesAndReplace and make,Obtaining residual error:
Wherein, among them,AndRespectively representTime of day and time of dayAt the moment of time of day,For the indexing of the collected data sets,Is thatIs used to determine the degree of approximation of (c),AndRespectively isAndAn activation function of the moment of time,AndThe activation functions of the action network and the evaluation network at any time in the integration interval,AndRespectively the firstThe weights of the evaluation network and the action network of the secondary iteration.
Setting intermediate parametersAnd:
Simplified residualIs represented by the formula:
wherein, the In order to augment the weight vector,Representing straightening the matrix in column order.
Then, the least square method is used to minimizeThe weight update law of the augmentation weight can be obtained:
wherein, the Is the size of the training data set collected.
S3, given input signalAnd iteratively solving the optimal augmentation weight based on the weight updating law designed by S2.
In one embodiment of the present application, first, the set expansion valve opening and the set compressor rotation speed are used as the given input signalsIn the time intervalAnd collecting the state and input data of the established augmentation system, namely the superheat degree of the outlet of the evaporator, the internal environment temperature of the data center, the opening degree of the expansion valve and the rotating speed of the compressor. These states and input information of the augmentation system are obtained, and thenAnd
Wherein, the AndThe sizes of the training data sets collected are the initial time and the final time of data collection respectivelyTo satisfy the condition that there are two positive real numbersAndSo that for any oneAll have:
wherein, the Is thatAn identity matrix of dimensions.
Secondly, let theUsing an initial control strategyCalculating the weight of the action network at the momentCalculation of the weight update law by increasing the weightAugmentation weights for multiple iterations
Then, let theIterative updating. Up to the end ifStopping the iteration and thenAs the optimal augmentation weight, whereinFor a preset termination iteration threshold,Is the firstThe weight of the iteration is increased for a number of times,Representing the Euclidean norms of the vectors, otherwise, lettingContinuing the iteration
And S4, constructing an optimal control strategy by utilizing the optimal augmentation weight value solved in the step S3, and applying the optimal control strategy to a carbon dioxide refrigeration and temperature control system of the data center as an input signal of the carbon dioxide refrigeration and temperature control system.
In one embodiment of the present application, first, an optimal control strategy is solved using the optimal augmentation weights iterated at S3And an optimal performance function
Then, willAnd the carbon dioxide refrigeration and temperature control system is applied to the data center as an input signal.
By executing the steps, the temperature control of the data-driven carbon dioxide refrigeration data center can be realized.
According to the data-driven carbon dioxide refrigeration data center temperature control method, the method is independent of a dynamic model of a carbon dioxide refrigeration system, the practicability and the adaptability of a control scheme are remarkably improved, the performance index of the refrigeration system is optimized, the accurate control of refrigeration energy consumption is realized through a data-driven dynamic regulation mechanism, the temperature control of the data center is realized, the energy saving efficiency of the system is improved, and the development requirement of green low carbon is met.
It should be noted that the embodiments described herein are for the purpose of aiding the reader in understanding the principles of the present application, and should be understood as not limiting the scope of the application to such specific statements and embodiments. Those of ordinary skill in the art can make various other specific modifications and combinations from the teachings of the present disclosure without departing from the spirit thereof, and such modifications and combinations remain within the scope of the present disclosure.

Claims (10)

1.一种数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,包括:1. A data-driven method for temperature control in a carbon dioxide-cooled data center, characterized in that it comprises: S1:结合数据中心的二氧化碳制冷及温度控制系统的控制目标,建立关于状态误差与目标状态的增广系统,并建立增广系统的性能函数、HJB方程和积分贝尔曼方程;S1: Based on the control objectives of the carbon dioxide cooling and temperature control system of the data center, establish an augmented system concerning the state error and the target state, and establish the performance function, HJB equation and integral Bellman equation of the augmented system. S2:通过评价网络和动作网络逼近增广系统的性能函数和控制策略,得到性能函数和控制策略的近似值,基于性能函数和控制策略的近似值,替换积分贝尔曼方程的性能函数和控制策略,并定义残差,基于残差,得到增广权值的权值更新律;S2: By approximating the performance function and control strategy of the augmented system through the evaluation network and action network, approximate values of the performance function and control strategy are obtained. Based on the approximate values of the performance function and control strategy, the performance function and control strategy of the integral Bellman equation are replaced, and the residuals are defined. Based on the residuals, the weight update law of the augmented weights is obtained. S3:给定输入信号,收集增广系统的状态信息和输入信息,并基于增广权值的权值更新律,迭代求解最优增广权值;S3: Given an input signal, collect the state information and input information of the augmented system, and iteratively solve for the optimal augmented weights based on the weight update law of the augmented weights; S4:利用求解出的最优增广权值计算得到最优控制策略,并将最优控制策略作为输入信号应用到数据中心的二氧化碳制冷及温度控制系统中。S4: The optimal control strategy is calculated using the solved optimal augmented weights, and the optimal control strategy is used as the input signal to apply to the carbon dioxide cooling and temperature control system of the data center. 2.根据权利要求1所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述S1,包括:2. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 1, characterized in that step S1 includes: S101:结合数据中心的二氧化碳制冷及温度控制系统的控制目标,构建目标状态和跟踪误差,并构建性能函数;S101: Combining the control objectives of the data center's carbon dioxide cooling and temperature control system, construct the target state and tracking error, and build the performance function; S102:构建目标状态和跟踪误差的增广向量,基于增广向量构建增广系统,并基于性能函数,构建增广系统的性能函数;S102: Construct augmented vectors for the target state and tracking error, build an augmented system based on the augmented vectors, and construct the performance function of the augmented system based on the performance function; S103:基于增广系统和增广系统的性能函数,构建最优性能函数和HJB方程,并基于HJB方程,获得最优控制策略;S103: Based on the augmented system and its performance function, construct the optimal performance function and HJB equation, and obtain the optimal control strategy based on the HJB equation; S104:基于最优控制策略,改写增广系统,并基于最优性能函数、最优控制策略和改写后的增广系统,构造积分贝尔曼方程。S104: Based on the optimal control strategy, rewrite the augmented system, and construct the integral Bellman equation based on the optimal performance function, the optimal control strategy, and the rewritten augmented system. 3.根据权利要求2所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述结合数据中心的二氧化碳制冷及温度控制系统的控制目标,构建目标状态和跟踪误差,并构建性能函数,包括:3. The data-driven carbon dioxide refrigeration data center temperature control method according to claim 2, characterized in that, the step of constructing a target state and tracking error by combining the control objective of the data center's carbon dioxide refrigeration and temperature control system, and constructing a performance function, includes: A1:将数据中心的二氧化碳制冷及温度控制系统建模为一般的非线性系统:A1: Model the carbon dioxide cooling and temperature control system of the data center as a general nonlinear system: 式中,为系统的实际状态向量,分别为系统漂移动态和输入动态,为系统输入,为当前时刻的导数,为系统状态个数,为系统输入的个数,为当前时刻,表示实数集合;In the formula, Let be the actual state vector of the system. and These are the system drift dynamics and the input dynamics, respectively. For system input, For the current moment The derivative, The number of system states. The number of inputs from the system. For the current moment, Represents the set of real numbers; A2:设定蒸发器出口的目标过热度和数据中心内部环境目标温度为目标状态的动态特性为:A2: Set the target superheat at the evaporator outlet and the target temperature inside the data center as the target state. , The dynamic characteristics are: 式中,为目标状态生成器函数,为当前时刻目标状态的导数;In the formula, For the target state generator function, The target state at the current moment The derivative; A3:基于实际状态与目标状态,构建跟踪误差A3: Construct tracking error based on actual state and target state. : 其中,跟踪误差的动力学方程为:Among them, tracking error The dynamic equation is: 式中,为当前时刻跟踪误差的导数;In the formula, The tracking error at the current moment The derivative; A4:基于跟踪误差和系统输入,构建性能函数:A4: Based on tracking error and system input Construct the performance function: 式中,为积分区间内任意时刻的状态,为积分区间内任意时刻的输入,上标为向量或矩阵的转置,为折扣因子,为对称正定矩阵,为积分变量。In the formula, Let be the state at any time within the integration interval. For any time interval within the integration interval, the superscript is used as the input. For the transpose of a vector or matrix, As a discount factor, and It is a symmetric positive definite matrix. It is the integral variable. 4.根据权利要求3所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述增广向量为:4. The data-driven carbon dioxide refrigeration data center temperature control method according to claim 3, characterized in that the augmented vector... for: 所述增广系统为:The augmentation system is: 其中,为当前时刻增广向量的导数,为增广系统的漂移动态,为增广系统的输入动态,为系统输入;in, Augmented vector at the current time step The derivative, To augment the drift state of the system, To enhance the input dynamics of the system, Input for the system; 所述增广系统的性能函数为:The performance function of the augmented system is: 其中,为积分区间内任意时刻的增广状态,为积分区间内任意时刻的系统输入,为对称矩阵。in, Let be the augmented state at any time within the integration interval. For any time interval within the integration interval, the system input is given. It is a symmetric matrix. 5.根据权利要求4所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述最优性能函数为:5. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 4, characterized in that the optimal performance function... for: 其中,为实数集合上的容许控制集,表示容许控制集上最小的in, The set of real numbers The allowable control set on, Indicates allowable control set The smallest ; 所述HJB方程为:The HJB equation is: 其中,为最优控制策略,代表最优性能函数的导数,为求偏导运算符;in, This is the optimal control strategy. Represents the optimal performance function right The derivative, To find the partial derivative operator; 所述最优控制策略为:The optimal control strategy for: 其中,上标表示逆矩阵,表示取最小值。Among them, superscript Represents the inverse matrix. This indicates taking the minimum value. 6.根据权利要求5所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述改写后的增广系统为:6. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 5, characterized in that the rewritten augmented system is: 其中,为控制策略迭代第次计算得到的控制策略,为迭代次数。in, For the control strategy iteration The control strategy obtained from this calculation. This represents the number of iterations. 7.根据权利要求6所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述基于最优性能函数、最优控制策略和改写后的增广系统,构造积分贝尔曼方程,包括:7. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 6, characterized in that the construction of the integral Bellman equation based on the optimal performance function, optimal control strategy, and the rewritten augmented system includes: 结合最优性能函数和最优控制策略,对控制策略迭代第次计算得到的性能函数沿着改写的增广系统动态轨迹关于时间求导,得到相对于的导数:Combining the optimal performance function and the optimal control strategy, the control strategy is iterated in the 1st... The performance function obtained from the calculation Along the rewritten augmented system dynamic trajectory with respect to time Taking the derivative, we get Compared to The derivative: 其中,代表第次计算得到的性能函数对状态的导数,为控制策略迭代第次计算得到的控制策略;in, Representing the The performance function obtained from the calculation State The derivative, For the control strategy iteration The control strategy obtained from this calculation; 在时间区间内求积分,得到积分贝尔曼方程:In time interval Integrating internally, we obtain the integral Bellman equation: 其中,表示积分时间,表示时刻下第次迭代的性能函数,为第次和第次迭代的控制策略,为对应的第次迭代的性能函数。in, Indicates the integration time. express At this moment The performance function of the next iteration and For the first Second and third Control strategy for the next iteration For the corresponding number The performance function for each iteration. 8.根据权利要求7所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述S2,包括:8. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 7, characterized in that step S2 includes: S201:利用神经网络的万能逼近性,通过评价网络和动作网络分别近似控制策略迭代第次计算得到的性能函数和控制策略S201: Utilizing the universal approximation property of neural networks, the control policy is iterated through the evaluation network and the action network respectively. The performance function obtained from the calculation and control strategies : 其中,分别为的近似值,分别为评价网络和动作网络的权值,为线性不相关的神经网络基函数,分别为两个网络的隐层神经元个数;in, and They are respectively and Approximate value, and These are the weights for the evaluation network and the action network, respectively. and These are the basis functions for linearly uncorrelated neural networks. and These represent the number of hidden neurons in the two networks, respectively. S202:将积分贝尔曼方程中的分别用它们的近似值来代替,并令,得到残差S202: Integrating the Bellman equations , , and Use their approximations respectively , , and To replace, and make , To obtain the residual : 其中,分别代表时刻和时刻,为收集的数据集的索引,的近似值,分别为时刻的激活函数,分别是积分区间内任意时刻的动作网络和评价网络的激活函数,分别为第次迭代的评价网络和动作网络的权值;in, and Represent Time and time, For the index of the collected dataset, for Approximate value, and They are respectively and Activation function at time step and These are the activation functions of the action network and the evaluation network at any time within the integration interval, respectively. and The first The weights of the evaluation network and action network in the next iteration; S203:设置中间参数,简化残差的表达式:S203: Setting intermediate parameters and Simplify the expression for the residual: 其中,为增广权值向量,代表将矩阵按列的顺序进行拉直;in, , , For augmented weight vectors, This means straightening the matrix according to the column order; S204:使用最小二乘法来最小化,得到增广权值的权值更新律:S204: Use least squares to minimize The weight update law for augmented weights is obtained as follows: 其中,为收集的训练数据集的大小。in, The size of the collected training dataset. 9.根据权利要求8所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述S3,包括:9. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 8, characterized in that step S3 includes: S301:以设定的膨胀阀开度、压缩机转速作为给定输入信号,在时间区间上收集建立的增广系统的状态和输入数据,进而得到S301: The set expansion valve opening and compressor speed are used as the given input signals. In the time interval The state and input data of the augmented system are collected and then obtained. and ; 其中,分别为收集数据的初始时刻和终止时刻,收集的训练数据集的大小要满足条件:存在两个正实数,使得对于任意的,都有:in, and These represent the initial and final times of data collection, and the size of the collected training dataset. To satisfy the condition: there exist two positive real numbers. and , such that for any All of them have: 式中,维的单位矩阵;In the formula, yes An identity matrix of 3D; S302:设定,利用初始控制策略,计算对应动作网络的权值,通过增广权值的权值更新律计算第次迭代的增广权值S302: Setting Using the initial control strategy Calculate the weights of the corresponding action network. The weight update law of augmented weights is used to calculate the first... Augmented weights in the next iteration ; S303:令,迭代更新,直到如果时,停止迭代,并将此时的作为最优增广权值;否则,令,继续迭代S303: Order Iterative updates Until if When the iteration stops, and the value at this point is... As the optimal augmented weight; otherwise, let Continue iterating ; 其中,为预先设定的终止迭代阈值,为第次迭代的增广权值,代表向量的欧几里德范数。in, The pre-set termination iteration threshold, For the first Augmented weights for the next iteration The Euclidean norm represents the vector. 10.根据权利要求9所述的数据驱动的二氧化碳制冷数据中心温度控制方法,其特征在于,所述S4,包括:10. The data-driven carbon dioxide refrigerated data center temperature control method according to claim 9, characterized in that step S4 includes: S401:利用迭代求解的最优增广权重解出最优控制策略和最优性能函数S401: Solve for the optimal control strategy using the optimal augmented weights obtained through iterative solutions. and optimal performance function ; S402:将作为输入信号应用到数据中心的二氧化碳制冷及温度控制系统。S402: Will Carbon dioxide refrigeration and temperature control systems used as input signals in data centers.
CN202511409326.4A 2025-09-29 2025-09-29 A data-driven method for temperature control in data centers using carbon dioxide refrigeration Pending CN120868635A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202511409326.4A CN120868635A (en) 2025-09-29 2025-09-29 A data-driven method for temperature control in data centers using carbon dioxide refrigeration

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202511409326.4A CN120868635A (en) 2025-09-29 2025-09-29 A data-driven method for temperature control in data centers using carbon dioxide refrigeration

Publications (1)

Publication Number Publication Date
CN120868635A true CN120868635A (en) 2025-10-31

Family

ID=97465590

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202511409326.4A Pending CN120868635A (en) 2025-09-29 2025-09-29 A data-driven method for temperature control in data centers using carbon dioxide refrigeration

Country Status (1)

Country Link
CN (1) CN120868635A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107783423A (en) * 2017-10-25 2018-03-09 珠海格力电器股份有限公司 PID parameter self-tuning method and device based on machine learning
CN110209130A (en) * 2019-05-07 2019-09-06 重庆科技学院 A kind of supply of cooling, heating and electrical powers type micro-grid system optimizing operation method based on ADHDP
US20190384237A1 (en) * 2018-06-13 2019-12-19 Mitsubishi Electric Research Laboratories, Inc. System and Method for Data-Driven Output Feedback Control
CN112346342A (en) * 2020-11-03 2021-02-09 中国人民解放军空军工程大学 A Single-Network Adaptive Evaluation and Design Method for Non-affine Dynamic Systems
US20210291826A1 (en) * 2020-03-20 2021-09-23 Mitsubishi Electric Research Laboratories, Inc. Apparatus and Method for Control with Data-Driven Model Adaptation
CN115962561A (en) * 2023-01-04 2023-04-14 山东大学 Central air conditioner multi-zone comfort control method and system
CN117687300A (en) * 2023-12-14 2024-03-12 东北大学 Optimal control method of three-phase electric arc furnace electrode adjustment system based on hierarchical Stackelberg-Nash game

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107783423A (en) * 2017-10-25 2018-03-09 珠海格力电器股份有限公司 PID parameter self-tuning method and device based on machine learning
US20190384237A1 (en) * 2018-06-13 2019-12-19 Mitsubishi Electric Research Laboratories, Inc. System and Method for Data-Driven Output Feedback Control
CN110209130A (en) * 2019-05-07 2019-09-06 重庆科技学院 A kind of supply of cooling, heating and electrical powers type micro-grid system optimizing operation method based on ADHDP
US20210291826A1 (en) * 2020-03-20 2021-09-23 Mitsubishi Electric Research Laboratories, Inc. Apparatus and Method for Control with Data-Driven Model Adaptation
CN112346342A (en) * 2020-11-03 2021-02-09 中国人民解放军空军工程大学 A Single-Network Adaptive Evaluation and Design Method for Non-affine Dynamic Systems
CN115962561A (en) * 2023-01-04 2023-04-14 山东大学 Central air conditioner multi-zone comfort control method and system
CN117687300A (en) * 2023-12-14 2024-03-12 东北大学 Optimal control method of three-phase electric arc furnace electrode adjustment system based on hierarchical Stackelberg-Nash game

Similar Documents

Publication Publication Date Title
US20190293314A1 (en) System and Method for Controlling Operations of Air-Conditioning System
CN104019520B (en) Data drive control method for minimum energy consumption of refrigerating system on basis of SPSA
CN108444110A (en) Air source heat pump hot water system design method based on BP neural network forecasts
CN101498534A (en) Multi-target intelligent control method for electronic expansion valve of refrigeration air conditioner heat pump system
CN105843037A (en) Q-learning based control method for temperatures of smart buildings
CN107263483A (en) The control method for coordinating of two degrees of freedom articulated robot track
CN110262238A (en) A kind of learning feed-forward control device, vapour compression refrigeration control system and control method
CN110470039B (en) Air conditioner water valve adjusting method based on optimal control theory
CN120868635A (en) A data-driven method for temperature control in data centers using carbon dioxide refrigeration
JP2025530555A (en) System and method for data-driven control of an air conditioning system
Yin et al. Model predictive control for vapor compression cycle of refrigeration process
CN111258212B (en) An iterative learning refrigeration control system and method based on fractional order
CN112327628A (en) Improved self-adaptive generalized predictive control method for data-driven time-lag system
CN115978856B (en) Whale optimized disturbance compensation Smith pre-estimated control method in air wave refrigeration process
CN115877714B (en) Control method and device for refrigerating system, electronic equipment and storage medium
WO2024075436A1 (en) System and method for data-driven control of an air-conditioning system
Cui et al. Real-time multi-variable optimization of the interstage subcooling vapor-injection based transcritical CO2 HPWH: An integrated model predictive control approach
CN118168276A (en) A closed grain drying system and design method thereof
CN117636001A (en) Online fault diagnosis and monitoring method and monitoring system for nuclear power secondary circuit system
CN110543702B (en) A Modeling Method for Internal Combustion Engine-LiBr Combined Cooling, Heating and Power System
CN110806693A (en) A grey wolf predictive control method for plate heat exchanger time delay
CN114739229A (en) Heat exchange process important parameter control method based on reinforcement learning
US20250190656A1 (en) Polytopic Reduced-Order Model for Prediction, Estimation and Control of Partial Differential Equations
CN119668124B (en) ADRC (automatic dependent variable rate) optimization compensation control method and system based on partial differential flatness characteristics of controlled object
CN116182333B (en) Energy-saving prediction method and device for ice storage air conditioner

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination