CN111601490B - Reinforced learning control method for data center active ventilation floor - Google Patents
Reinforced learning control method for data center active ventilation floor Download PDFInfo
- Publication number
- CN111601490B CN111601490B CN202010456237.6A CN202010456237A CN111601490B CN 111601490 B CN111601490 B CN 111601490B CN 202010456237 A CN202010456237 A CN 202010456237A CN 111601490 B CN111601490 B CN 111601490B
- Authority
- CN
- China
- Prior art keywords
- rack
- time
- value
- active ventilation
- function
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000009423 ventilation Methods 0.000 title claims abstract description 40
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 21
- 230000002787 reinforcement Effects 0.000 claims abstract description 21
- 238000005265 energy consumption Methods 0.000 claims abstract description 12
- 230000006870 function Effects 0.000 claims description 28
- 230000006399 behavior Effects 0.000 claims description 27
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000011156 evaluation Methods 0.000 claims description 4
- 238000012423 maintenance Methods 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 claims description 3
- 238000007726 management method Methods 0.000 abstract description 2
- 238000004378 air conditioning Methods 0.000 abstract 1
- 238000005057 refrigeration Methods 0.000 abstract 1
- 238000001816 cooling Methods 0.000 description 11
- 230000001186 cumulative effect Effects 0.000 description 5
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- PCTMTFRHKVHKIS-BMFZQQSSSA-N (1s,3r,4e,6e,8e,10e,12e,14e,16e,18s,19r,20r,21s,25r,27r,30r,31r,33s,35r,37s,38r)-3-[(2r,3s,4s,5s,6r)-4-amino-3,5-dihydroxy-6-methyloxan-2-yl]oxy-19,25,27,30,31,33,35,37-octahydroxy-18,20,21-trimethyl-23-oxo-22,39-dioxabicyclo[33.3.1]nonatriaconta-4,6,8,10 Chemical compound C1C=C2C[C@@H](OS(O)(=O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2.O[C@H]1[C@@H](N)[C@H](O)[C@@H](C)O[C@H]1O[C@H]1/C=C/C=C/C=C/C=C/C=C/C=C/C=C/[C@H](C)[C@@H](O)[C@@H](C)[C@H](C)OC(=O)C[C@H](O)C[C@H](O)CC[C@@H](O)[C@H](O)C[C@H](O)C[C@](O)(C[C@H](O)[C@H]2C(O)=O)O[C@H]2C1 PCTMTFRHKVHKIS-BMFZQQSSSA-N 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000013486 operation strategy Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H05—ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
- H05K—PRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
- H05K7/00—Constructional details common to different types of electric apparatus
- H05K7/20—Modifications to facilitate cooling, ventilating, or heating
- H05K7/20709—Modifications to facilitate cooling, ventilating, or heating for server racks or cabinets; for data centers, e.g. 19-inch computer racks
- H05K7/20836—Thermal management, e.g. server temperature control
-
- H—ELECTRICITY
- H05—ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
- H05K—PRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
- H05K7/00—Constructional details common to different types of electric apparatus
- H05K7/20—Modifications to facilitate cooling, ventilating, or heating
- H05K7/20709—Modifications to facilitate cooling, ventilating, or heating for server racks or cabinets; for data centers, e.g. 19-inch computer racks
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Thermal Sciences (AREA)
- Microelectronics & Electronic Packaging (AREA)
- Air Conditioning Control Device (AREA)
Abstract
Description
技术领域technical field
本发明属于自动控制技术领域,特别涉及数据中心主动通风地板的强化学习控制方法。The invention belongs to the technical field of automatic control, and particularly relates to a reinforcement learning control method for an active ventilation floor of a data center.
背景技术Background technique
机架热点,即数据中心机房机架某一个或几个位置,温度明显高于其他位置温度的高温点。过高的温度会导致数据中心某些服务器工作效率降低,进而降低其整体功率密度,同时也会降低其可靠性,这显然与数据中心的需求相悖。Rack hotspots are high-temperature spots where the temperature of one or several locations on the data center rack is significantly higher than that of other locations. Excessive temperatures can cause some servers in the data center to work less efficiently, thereby reducing their overall power density and reducing their reliability, which is obviously contrary to the needs of the data center.
采用全局调控的方式进行缓解或消除机架热点,例如提升机房空调功率以提供足量冷气,必然会导致大部分机架区域处于过度制冷状态,在造成制冷资源浪费的同时,使得数据中心总能耗中占比近半的制冷能耗更加巨大。因此,机架级制冷方案更适合于缓解机架热点问题。Using global regulation to alleviate or eliminate rack hot spots, such as increasing the power of the air conditioner in the equipment room to provide sufficient cooling air, will inevitably lead to excessive cooling in most of the rack areas, resulting in waste of cooling resources and at the same time making the data center always available. The cooling energy consumption, which accounts for nearly half of the consumption, is even more huge. Therefore, rack-level cooling solutions are more suitable for alleviating rack hotspot issues.
目前已有机架级制冷方案,例如安装自适应通风地板、安装挡板、封闭单个机架并为其设置通风管等。但这些方案皆为“被动式”制冷方案,不能主动为机架提供冷气流,当冷气供应不足时,这些方案都无能为力。Rack-level cooling solutions exist, such as installing adaptive ventilation floors, installing baffles, and enclosing and ducting individual racks. However, these solutions are all "passive" cooling solutions, which cannot actively provide cold airflow to the racks. When the cooling air supply is insufficient, these solutions are powerless.
主动通风地板作为另一种机架级制冷方案,通过主动输送冷气的方式缓解机架热点问题,相较于上述方案更容易部署,更具成本效益,但其控制的难点主要在于其放置环境的多样性与动态性,例如机房空调、机架相对位置以及机架内部服务器分布不同;冷、热通道封闭状态不同,服务器机架标准和密封情况不同;机房空调功率、不同机架服务器的热负载不同,等等。因此,数据中心的热能效与气流模型,一般难以用解析模型进行描述。As another rack-level cooling solution, active ventilation floors can alleviate the problem of rack hot spots by actively transporting cold air. Compared with the above solutions, it is easier to deploy and more cost-effective, but the difficulty of its control mainly lies in the placement environment. Diversity and dynamism, such as different computer room air conditioners, relative positions of racks, and server distribution inside the rack; different closed states of cold and hot aisles, different server rack standards and sealing conditions; computer room air conditioner power, heat load of servers in different racks different, wait. Therefore, the thermal energy efficiency and airflow models of data centers are generally difficult to describe with analytical models.
现有的主动通风地板相关研究大多是基于测量或仿真的性能建模和评估,目前还没有主动通风地板控制问题的研究文献。Most of the existing active ventilation floor related research is based on measurement or simulation performance modeling and evaluation, and there is no research literature on the control problem of active ventilation floor.
发明内容SUMMARY OF THE INVENTION
为了克服上述现有技术的缺点,本发明的目的在于提供一种数据中心主动通风地板的强化学习控制方法,在不提升机房空调功率的前提下,自动学习最优运行策略,规划机架气流,使机架温度分布均匀化,缓解机架热点问题。且不必建立和校准复杂气流和热交换模型,从而提高主动通风地板的普适性。In order to overcome the above-mentioned shortcomings of the prior art, the purpose of the present invention is to provide a reinforcement learning control method for an active ventilation floor of a data center, which can automatically learn the optimal operation strategy and plan the rack airflow without increasing the power of the air conditioner in the computer room. Uniform rack temperature distribution and alleviate rack hot spots. And there is no need to build and calibrate complex airflow and heat exchange models, thereby improving the universality of active ventilation floors.
为了实现上述目的,本发明采用的技术方案是:In order to achieve the above object, the technical scheme adopted in the present invention is:
一种数据中心主动通风地板的强化学习控制方法,对抬升地板结构数据中心的机架热点问题建立马尔可夫决策过程模型,并提供一种强化学习模型求解算法,阵列式算法,作为强化学习控制算法的核心。所述模型由系统状态、行为、奖励和价值函数四部分组成,所述模型的解为,在一系列系统状态下不断选择最优行为,使得系统累计奖励最大化,所述强化学习控制算法,利用机架入风口温度分布是否均匀以及主动通风地板能耗是否较低作为评价标准,通过不断探索和学习PWM信号占空比值与该值升高、降低或者维持不变之间的复杂关系,调节主动通风地板风扇转速,使得机架入风口温度分布均匀化,缓解机架热点问题。A reinforcement learning control method for active ventilation floors of data centers, establishing a Markov decision process model for the rack hotspot problem of a data center with a raised floor structure, and providing a reinforcement learning model solving algorithm, an array algorithm, as reinforcement learning control. The heart of the algorithm. The model consists of four parts: system state, behavior, reward and value function. The solution of the model is to continuously select the optimal behavior under a series of system states to maximize the cumulative reward of the system. The reinforcement learning control algorithm , using whether the temperature distribution of the air inlet of the rack is uniform and whether the energy consumption of the active ventilation floor is low as the evaluation criteria, by continuously exploring and learning the complex relationship between the PWM signal duty cycle value and the increase, decrease or maintenance of this value, Adjust the fan speed of the active ventilation floor to make the temperature distribution of the air inlet of the rack uniform and alleviate the problem of rack hot spots.
与现有技术相比,本发明的有益效果是:Compared with the prior art, the beneficial effects of the present invention are:
本发明不必建立和校准复杂的气流和热交换模型,使用阵列式控制算法,克服主动通风地板放置环境的多样性和动态性,根据机架入风口温度分布是否均匀以及主动通风地板能耗,自动匹配PWM信号占空比值与该值升高、降低或维持不变之间的关系,只需要将原普通通风地板置换为运行本发明的主动通风地板,本发明即可自主运行,找到最优PWM信号占空比值,调节主动通风地板转速,改善机架入风口温度分布,缓解机架热点问题,相比其他方案,本发明普适性更高,更易部署,更具成本效益。The present invention does not need to establish and calibrate complex air flow and heat exchange models, uses an array control algorithm, overcomes the diversity and dynamics of the placement environment of the active ventilation floor, automatically To match the relationship between the duty cycle value of the PWM signal and the value of increasing, decreasing or maintaining the same value, it is only necessary to replace the original ordinary ventilated floor with the active ventilated floor running the present invention, and the present invention can operate autonomously and find the optimal PWM Compared with other solutions, the present invention is more universal, easier to deploy, and more cost-effective than other solutions.
相较于使用三种智能算法的智能控制方法,使用阵列式算法的强化学习控制方法更加简单,所需计算资源开销较小。Compared with the intelligent control method using three intelligent algorithms, the reinforcement learning control method using the array algorithm is simpler and requires less computational resources.
相较于使用阵列式算法的强化学习控制方法,使用所述三种智能算法的智能控制方法关于状态和行为的定义对解决热点问题更加直接有效,且非离散化的状态定义以及对Q函数的近似,强化了智能控制方法的普适性。Compared with the reinforcement learning control method using the array algorithm, the intelligent control method using the three intelligent algorithms has a more direct and effective definition of states and behaviors for solving hotspot problems, and the non-discrete state definition and the Q-function definition are more direct and effective. The approximation strengthens the universality of the intelligent control method.
附图说明Description of drawings
图1为主动通风地板设计及部署图。图中标号1为温度传感器,2为机架,3为微控制器,4为驱动板,5为开关电源,6为PC,7为主动通风地板。Figure 1 is an active ventilation floor design and deployment diagram.
具体实施方式Detailed ways
下面结合附图和实施例详细说明本发明的实施方式。The embodiments of the present invention will be described in detail below with reference to the accompanying drawings and examples.
图1为本发明的详细部署实施示意图,一定数量的温度传感器一1均匀分布在机架2入风口处,监测机架2入风口温度分布,同时在主动通风地板下另设一个温度传感器二,监测主动通风地板下送风温度。1 is a schematic diagram of the detailed deployment implementation of the present invention. A certain number of
本领域中,机架2是一个长方体铁盒子,里面放一定数量的服务器,许多机架一排一排摆放。在某一排机架中,一般某一机架左右面板与其他机架紧贴,机架前面板即为入风口,用来吸冷气制冷服务器,机架后面板为出风口,用来排出制冷后的热气,监测机架入风口温度分布即监测机架前面板某些位置的温度,这些位置的温度组成了机架入风口温度分布,因此温度传感器一1的个数取决于这些位置的数量。In the art, the
本发明主动通风地板强化学习控制方法运行于PC端,PC6与微控制器3连接,微控制器3连接驱动板4,驱动板4在连接开关电源5(12V,20A)后与主动通风地板风扇7连接。根据温度传感器一1传回的温度分布,产生PWM信号的占空比值,并传给微控制器3,微控制器3据此占空比值,产生相应PWM信号,传输给驱动板4,驱动板4根据PWM信号控制开关电源5提供给主动通风地板风扇7的电压,通过控制风扇供电电压,达到调节风扇转速的目的。The active ventilation floor reinforcement learning control method of the present invention runs on the PC side, the PC6 is connected to the
控制方法包括以下部分:The control method includes the following parts:
1、对抬升地板结构(数据中心的送风结构,数据中心机房地板被架高,留出60-100cm高的地板下空间用于机房空调输送冷气,这种结构即为抬升地板结构,目前国内大部分数据中心均采用这种构造)数据中心的机架热点问题建立马尔可夫决策过程模型,由以下ABCD四部分组成:1. For the raised floor structure (the air supply structure of the data center, the floor of the data center computer room is raised, leaving a 60-100cm high under-floor space for the air conditioner of the computer room to deliver cold air. This structure is the raised floor structure. At present, domestic Most data centers use this structure) The rack hotspot problem of the data center establishes a Markov decision process model, which consists of the following four parts: ABCD:
A系统状态st,定义为离散化的PWM信号方波占空比,公式如下:A system state s t is defined as the duty cycle of the discretized PWM signal square wave, the formula is as follows:
st为t时刻系统状态,为状态空间,s为中的某一系统状态,DC为PWM信号方波占空比数值,max(DC)为DC最大值,DTQ为DC离散化等分比,k表示某个状态中DTQ的个数。s t is the system state at time t, is the state space, and s is For a certain system state in , DC is the duty cycle value of the square wave of the PWM signal, max(DC) is the maximum value of DC, D TQ is the equal division ratio of DC discretization, and k represents the number of D TQ in a certain state.
B系统行为空间定义为主动通风地板风扇转速的变化,即 B system behavior space Defined as the change in the fan speed of the active ventilation floor, i.e.
C奖励Rt+1,由机架入风口温度分布均匀程度的量化指标及主动通风地板风扇能耗两部分构成,其公式为:The C reward R t+1 is composed of the quantitative index of the uniformity of the temperature distribution of the air inlet of the rack and the energy consumption of the active ventilation floor fan. The formula is:
其中Rt+1为t时刻系统采取某行为后所得的奖励,表示机架入风口温度分布均匀程度,该式值全为负,越接近0,表明机架入风口温度分布越均匀,Tt,i为t时刻编号为i的温度传感器一的温度读数,为t时刻机架参考温度,Tt,under为t时刻温度传感器二的读数,ΔT为根据主动通风地板上下冷热气流混合程度设置的固定温度差,为正数,为温度传感器一的集合,为温度传感器一的总数;-(Aref×DCt)3表示主动通风地板风扇能耗,该式的值全为负,越接近0,表明风扇能耗越低,其中Aref为保持与机架入风口温度分布均匀程度同一量级的参考行为值,DCt为t时刻PWM信号方波占空比。where R t+1 is the reward obtained by the system after taking a certain behavior at time t, Indicates the uniformity of the temperature distribution of the air inlet of the rack. The value of this formula is all negative. The closer to 0, the more uniform the temperature distribution of the air inlet of the rack is. T t,i is the temperature reading of the temperature sensor number i at time t. is the rack reference temperature at time t, T t,under is the reading of
D价值函数Q(st,at),为行为价值函数,其公式为:D value function Q(s t , at t ) is a behavioral value function, and its formula is:
其中价值函数Q(s,a)称为Q函数,为t时刻系统采取的行为,为期望函数,y为相对于t时刻的未来时刻,Rt+y+1表示系统在t+y时刻采取行为后获得的奖励,γ表示衰减因子,表示模型对未来奖励(环境影响)的重视程度,0≤γ<1,γy为γ的y次方,是t+y时刻Rt+y+1的衰减因子。where the value function Q(s, a) is called the Q function, is the action taken by the system at time t, is the expectation function, y is the future time relative to time t, R t+y+1 represents the reward obtained by the system after taking action at time t+y, γ represents the decay factor, which represents the importance of the model to the future reward (environmental impact) degree, 0≤γ<1, γy is the y power of γ, which is the attenuation factor of R t+y+1 at time t+y.
E马尔可夫决策过程模型可以被总结为,在任意t时刻系统状态下,通过选择最优行为,使得累计奖励最大化,其模型公式为:The E-Markov decision process model can be summarized as, in the system state at any time t, by selecting the optimal behavior to maximize the cumulative reward, the model formula is:
约束于bound to
γt是t时刻系Rt+1的衰减因子。γ t is the decay factor of the system R t+1 at time t.
2、模型的解及求解算法2. Model solution and solution algorithm
a模型的解,计算得到最优Q函数,即可根据最优Q函数在任意t时刻系统状态下选择最优行为,使累计奖励最大化,最优Q函数计算公式为:The solution of model a can be calculated to obtain the optimal Q function, and then the optimal behavior can be selected according to the optimal Q function in the system state at any time t to maximize the cumulative reward. The calculation formula of the optimal Q function is:
在任意t时刻,最优行为选择公式为:At any time t, the optimal behavior selection formula is:
其中Q*(st,at)表示最优Q函数,st+1表示t+1时刻系统状态,a表示在t+1时刻系统可能采取的所有行为中的任一行为,亦即行为空间中的某一行为,表示在st+1状态下,系统采取任意一个中的行为,能得到的最大的最优Q函数值。where Q * (s t , at ) represents the optimal Q function, s t +1 represents the state of the system at
b求解算法即为,计算得到最优Q函数并在决策中选择选择最优行为,使得累计奖励最大化。强化学习模型求解算法为阵列式算法,采用二维阵列(行索引为状态,列索引为行为)存储所述Q函数,通过计算Q样本值Qt+1,target与Q查询值Qt(st,at)之差δt+1,迭代更新阵列中的Q值,计算最优Q函数,进而通过查询阵列选择最优行为,使得所述模型的累计奖励最大化。其中Q样本值根据最优Q函数计算公式,以及实时系统所得Rt+1和st+1计算得到,Q查询值为根据系统实时所得st和at,到二维阵列中对应行列查询所得值。The b solution algorithm is to calculate the optimal Q function and select the optimal behavior in the decision-making, so as to maximize the cumulative reward. The reinforcement learning model solving algorithm is an array algorithm, using a two-dimensional array (row index is the state, column index is the behavior) to store the Q function, by calculating the Q sample value Q t+1, target and Q query value Q t (s t , at t ) difference δ t+1 , iteratively update the Q value in the array, calculate the optimal Q function, and then select the optimal behavior by querying the array to maximize the cumulative reward of the model. The Q sample value is calculated according to the optimal Q function calculation formula and R t+1 and s t+1 obtained by the real-time system, and the Q query value is obtained according to the real-time s t and at of the system, and the corresponding row and column query in the two-dimensional array obtained value.
Q样本值计算公式如下:The formula for calculating the sample value of Q is as follows:
其中为t时刻所述二维阵列st+1对应行中最大的Q查询值,阵列更新方式为:in is the largest Q query value in the row corresponding to the two-dimensional array s t+1 at time t, and the array update method is:
其中Qt(st,at)为t时刻二维阵列中st和at对应的Q查询值,Qt+1(st,at)为t+1时刻二维阵列中st和at对应的Q查询值,β(st,at)∈[0,1]为阵列中每个状态-行为对对应的学习步长。where Q t (s t , at t ) is the Q query value corresponding to s t and at t in the two-dimensional array at time t, and Q t+1 (s t , at t ) is the s t in the two-dimensional array at time t+1 Q query value corresponding to a t , β(s t , at t )∈[0,1] is the learning step size corresponding to each state-action pair in the array.
3,采用强化学习模型求解算法对所述模型求解,利用机架入风口温度分布是否均匀以及主动通风地板能耗是否较低作为评价标准,通过不断探索和学习PWM信号占空比值与该值升高、降低或者维持不变之间的复杂关系,调节主动通风地板风扇转速,使得机架入风口温度分布均匀化,缓解机架热点问题。其在PC端的运行逻辑如下:3. Use the reinforcement learning model solving algorithm to solve the model, using whether the temperature distribution of the air inlet of the rack is uniform and whether the energy consumption of the active ventilation floor is low as the evaluation criteria, through continuous exploration and learning of the PWM signal duty cycle value and this value increase. The complex relationship between high, low or unchanged, adjust the active ventilation floor fan speed, make the temperature distribution of the rack air inlet uniform, and alleviate the problem of rack hot spots. Its operation logic on the PC side is as follows:
1:设置参考温度初始化β(st,at);初始化所述阵列;1: Set the reference temperature initialize β( s t , at ); initialize the array;
2:设置初始时刻t=0;探索概率变化区间random_slots;初始行为探索概率ε,探索率随t减少量Δε,最小探索概率εmin;2: Set the initial time t=0; explore the probability change interval random_slots; initial behavior exploration probability ε, the exploration rate decreases with t Δ ε , the minimum exploration probability ε min ;
3:选取初始状态s0=max(DC);3: Select the initial state s 0 =max(DC);
4:循环体开始4: The loop body starts
5:若t小于random_slots,行为从行为空间随机选择并转7,否则转6;5: If t is less than random_slots, the behavior is randomly selected from the behavior space and turned to 7, otherwise it is turned to 6;
6:探索概率ε取ε-Δε和εmin中的最小值,并根据以下公式选择行为:6: The exploration probability ε takes the minimum of ε- Δε and ε min and chooses the behavior according to the following formula:
7:执行at(PC发送占空比指令到微控制器),并获得系统下一状态st+1(PC发送温度请求指令获得机架温度分布),根据奖励公式计算Rt+1;7: Execute at (the PC sends the duty cycle command to the microcontroller), and obtain the next state of the system s t +1 (the PC sends the temperature request command to obtain the rack temperature distribution), and calculate R t+1 according to the reward formula;
8:根据公式更新阵列中对应值;8: Update the corresponding value in the array according to the formula;
9:时刻t增加1;9: time t increases by 1;
10:循环体结束。10: The loop body ends.
综上,本发明对抬升地板结构数据中心的机架热点问题建立马尔可夫决策过程模型,并提供一种强化学习模型求解算法,作为强化学习控制算法的核心,在不提升机房空调功率的前提下,根据当前机架温度分布,智能控制主动通风地板(在普通通风地板背部附装风扇的地板)风扇转速,通过这种主动输送足量冷气的方式,使得机架入风口温度分布均匀化,缓解抬升地板结构的数据中心普遍存在的机架热点问题,从而节约制冷能耗,保证服务器的安全性和稳定性。与现有的数据中心机架级气流管理方法相比,本发明更容易部署,更具成本效益,普适性更强。To sum up, the present invention establishes a Markov decision process model for the rack hotspot problem of a data center with a raised floor structure, and provides a reinforcement learning model solving algorithm, which is the core of the reinforcement learning control algorithm without increasing the power of the computer room air conditioner. According to the current rack temperature distribution, the fan speed of the active ventilation floor (the floor with fans attached to the back of the ordinary ventilation floor) is intelligently controlled, and through this method of actively delivering sufficient cold air, the temperature distribution of the air inlet of the rack is evenly distributed. Alleviate the rack hotspot problem common in data centers with raised floor structures, thereby saving cooling energy consumption and ensuring the security and stability of servers. Compared with existing data center rack-level airflow management methods, the present invention is easier to deploy, more cost-effective, and more universal.
Claims (3)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010456237.6A CN111601490B (en) | 2020-05-26 | 2020-05-26 | Reinforced learning control method for data center active ventilation floor |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010456237.6A CN111601490B (en) | 2020-05-26 | 2020-05-26 | Reinforced learning control method for data center active ventilation floor |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111601490A CN111601490A (en) | 2020-08-28 |
| CN111601490B true CN111601490B (en) | 2022-08-02 |
Family
ID=72186518
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010456237.6A Active CN111601490B (en) | 2020-05-26 | 2020-05-26 | Reinforced learning control method for data center active ventilation floor |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111601490B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114020079B (en) * | 2021-11-03 | 2022-09-16 | 北京邮电大学 | Indoor space temperature and humidity regulation and control method and device |
| CN120596339B (en) * | 2025-08-07 | 2025-10-28 | 苏州元脑智能科技有限公司 | Temperature control method, device, electronic device, storage medium and product |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH05159075A (en) * | 1991-12-03 | 1993-06-25 | Nippon Telegr & Teleph Corp <Ntt> | Interpolation method based on Markov random field with continuous values |
| CN103473613A (en) * | 2013-09-09 | 2013-12-25 | 武汉理工大学 | Landscape structure-surface temperature-electricity consumption coupling model and application thereof |
| JP2015082224A (en) * | 2013-10-23 | 2015-04-27 | 日本電信電話株式会社 | Probabilistic server load estimation method and server load estimation apparatus |
| CN106528941A (en) * | 2016-10-13 | 2017-03-22 | 内蒙古工业大学 | Data center energy consumption optimization resource control algorithm under server average temperature constraint |
| CN108446783A (en) * | 2018-01-29 | 2018-08-24 | 杭州电子科技大学 | A kind of prediction of new fan operation power and monitoring method |
| WO2019154739A1 (en) * | 2018-02-07 | 2019-08-15 | Abb Schweiz Ag | Method and system for controlling power consumption of a data center based on load allocation and temperature measurements |
| CN110322977A (en) * | 2019-07-10 | 2019-10-11 | 河北工业大学 | A kind of analysis method for reliability of nuclear power reactor core water level monitoring system |
| CN111144793A (en) * | 2020-01-03 | 2020-05-12 | 南京邮电大学 | Commercial building HVAC control method based on multi-agent deep reinforcement learning |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8478451B2 (en) * | 2009-12-14 | 2013-07-02 | Intel Corporation | Method and apparatus for dynamically allocating power in a data center |
| US20130226501A1 (en) * | 2012-02-23 | 2013-08-29 | Infosys Limited | Systems and methods for predicting abnormal temperature of a server room using hidden markov model |
| US20140324240A1 (en) * | 2012-12-14 | 2014-10-30 | Alcatel-Lucent Usa Inc. | Method And System For Disaggregating Thermostatically Controlled Appliance Energy Usage From Other Energy Usage |
| JP7134949B2 (en) * | 2016-09-26 | 2022-09-12 | ディー-ウェイブ システムズ インコーポレイテッド | Systems, methods, and apparatus for sampling from a sampling server |
| US20180100662A1 (en) * | 2016-10-11 | 2018-04-12 | Mitsubishi Electric Research Laboratories, Inc. | Method for Data-Driven Learning-based Control of HVAC Systems using High-Dimensional Sensory Observations |
-
2020
- 2020-05-26 CN CN202010456237.6A patent/CN111601490B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH05159075A (en) * | 1991-12-03 | 1993-06-25 | Nippon Telegr & Teleph Corp <Ntt> | Interpolation method based on Markov random field with continuous values |
| CN103473613A (en) * | 2013-09-09 | 2013-12-25 | 武汉理工大学 | Landscape structure-surface temperature-electricity consumption coupling model and application thereof |
| JP2015082224A (en) * | 2013-10-23 | 2015-04-27 | 日本電信電話株式会社 | Probabilistic server load estimation method and server load estimation apparatus |
| CN106528941A (en) * | 2016-10-13 | 2017-03-22 | 内蒙古工业大学 | Data center energy consumption optimization resource control algorithm under server average temperature constraint |
| CN108446783A (en) * | 2018-01-29 | 2018-08-24 | 杭州电子科技大学 | A kind of prediction of new fan operation power and monitoring method |
| WO2019154739A1 (en) * | 2018-02-07 | 2019-08-15 | Abb Schweiz Ag | Method and system for controlling power consumption of a data center based on load allocation and temperature measurements |
| CN110322977A (en) * | 2019-07-10 | 2019-10-11 | 河北工业大学 | A kind of analysis method for reliability of nuclear power reactor core water level monitoring system |
| CN111144793A (en) * | 2020-01-03 | 2020-05-12 | 南京邮电大学 | Commercial building HVAC control method based on multi-agent deep reinforcement learning |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111601490A (en) | 2020-08-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN104698843B (en) | A kind of data center's energy-saving control method based on Model Predictive Control | |
| US8594857B2 (en) | Modulized heat-dissipation control method for datacenter | |
| US10888028B2 (en) | Chassis intelligent airflow control and cooling regulation mechanism | |
| WO2024113906A1 (en) | Server cluster temperature adjustment method and device | |
| CN111601490B (en) | Reinforced learning control method for data center active ventilation floor | |
| Wan et al. | Intelligent rack-level cooling management in data centers with active ventilation tiles: A deep reinforcement learning approach | |
| CN104728997A (en) | Air conditioner for constant temperature control, constant temperature control system and constant temperature control method | |
| CN114511208A (en) | Optimal control method of data center energy consumption based on deep reinforcement learning | |
| US20140206272A1 (en) | Container-type data center and method for controlling container-type data center | |
| CN117519980A (en) | Energy-saving data center | |
| CN111836524A (en) | IT load change-based method for regulating and controlling variable air volume of precision air conditioner between data center columns | |
| WO2025152564A1 (en) | Control method for thermal management system, electronic device, computer readable storage medium, and vehicle | |
| CN110263974B (en) | Regional energy management system and management method based on distributed optimization algorithm | |
| CN118151689A (en) | Control system and method for adjusting temperature of temperature control equipment in machine room | |
| CN119247786B (en) | Thermal-electric integrated optimization control method for intelligent connected fuel cell vehicles | |
| CN115793751A (en) | Battery cabinet heat management method and device, battery cabinet and readable storage medium | |
| CN111637614A (en) | Intelligent control method for active ventilation floor in data center | |
| Pramanik | Electronic Cooler Technologies and Superior Data Center Cooling Techniques | |
| CN117608331A (en) | Temperature control methods, devices, equipment and storage media for data center computer rooms | |
| CN120645625A (en) | Thermal management system for a vehicle | |
| CN119451038A (en) | Micro-module data center air conditioning group control method and system | |
| CN111787768A (en) | A data center fan and its control method | |
| CN119374202A (en) | A method and system for multi-brand central air conditioning certification and adjustment based on mathematical hybrid model driving | |
| CN114368279A (en) | Vehicle thermal management control system and method | |
| CN110595008A (en) | A multi-equipment collaborative optimization method and system for a ground source heat pump air conditioning system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |