WO2023207803A1 - Multi-agent navigation control method and device based on gene regulatory network, and medium - Google Patents
Multi-agent navigation control method and device based on gene regulatory network, and medium Download PDFInfo
- Publication number
- WO2023207803A1 WO2023207803A1 PCT/CN2023/089865 CN2023089865W WO2023207803A1 WO 2023207803 A1 WO2023207803 A1 WO 2023207803A1 CN 2023089865 W CN2023089865 W CN 2023089865W WO 2023207803 A1 WO2023207803 A1 WO 2023207803A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- grid
- agent
- state
- concentration
- map
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0212—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory
- G05D1/0217—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory in accordance with energy consumption, time reduction or distance reduction criteria
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/20—Instruments for performing navigational calculations
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Definitions
- the present invention relates to but is not limited to the field of navigation technology, and in particular, to a multi-agent navigation control method, equipment and medium based on a gene regulatory network.
- the A* algorithm is widely used in path planning tasks involved in intelligent navigation systems.
- the operation process of the A* algorithm requires the cost calculation of all grids on the map in sequence, that is, the movement costs of all possible routes need to be calculated. The calculation cost is high.
- the obstacles or target points may fall on points that have been traversed, and the A* algorithm will no longer be applicable.
- Embodiments of the present invention provide a multi-agent navigation control method, equipment and medium based on a gene regulatory network, which can adaptively find the optimal navigation path in real time.
- embodiments of the present invention provide a multi-agent navigation control method based on a gene regulatory network, including:
- Step S100 obtain the plane map where multiple agents are located
- Step S200 Convert the flat map into a grid map, and determine the concentration information corresponding to each grid in the grid map; wherein the concentration information includes the concentration information field of the target position and the concentration information of the obstacle position. field;
- Step S300 Determine the current grid and target grid of the agent to be navigated, and determine the optimal path for the agent from the current grid to the target grid based on the concentration information corresponding to each grid in the grid map. ;
- the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map;
- Step S400 While the agent is moving along the optimal path, vacant grids are screened out from the eight neighborhood grids adjacent to the current grid, and the concentration evaluation value of each vacant grid is determined. , the first distance evaluation value and the second distance evaluation value; wherein, the vacant grid is a grid passable by the agent, and the concentration evaluation value is the grid where the agent is currently located and the vacant grid.
- Step S500 Determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the grid to be entered;
- Step S600 Determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the intelligent agent to move toward the target according to the operating state of the intelligent agent. Position forward.
- converting the flat map into a grid map and determining the concentration information corresponding to each grid in the grid map includes:
- each grid in the target concentration map contains the concentration information field of the target location, and the target concentration map contains Each grid contains the concentration information field of the obstacle location;
- the target concentration map and the obstacle concentration map are coupled according to corresponding grids to obtain concentration information corresponding to each grid in the grid map.
- the calculation formula relied upon to generate the target concentration map is:
- p 1 is the concentration information field generated by the target
- b 1 is the adjustable parameter that affects the concentration value of each grid in the target concentration map
- v 1 is the concentration diffusion factor, which is used to adjust the mapping relationship between distance parameters and concentration parameters.
- r 1 is the relative distance between the center of each grid in the target concentration map and the location of the target;
- p 2 is the concentration information field generated by the obstacle
- b 2 is the adjustable parameter that affects the concentration value of each grid in the obstacle concentration map
- v 2 is the concentration diffusion factor
- r 2 is each grid in the target concentration map. The relative distance between the grid center and the location of the obstacle;
- the calculation formula for the concentration information corresponding to each grid in the grid map is:
- g 1 is the morphological gradient space presented by the target concentration map
- g 2 is the morphological gradient space presented by the obstacle concentration map
- g 3 is the morphology presented by coupling the concentration information field of the target position and the concentration information field of the obstacle position.
- ⁇ 1 is an adjustable parameter that affects the concentration value range of the target concentration map
- k 1 is an adjustable parameter that affects the concentration difference between adjacent grids of the target concentration map
- ⁇ 2 is the concentration that affects the obstacle concentration map
- the adjustable parameter of the value range k 2 is the adjustable parameter that affects the concentration difference between adjacent grids in the obstacle concentration map
- ⁇ 3 is the adjustable parameter that affects the concentration value range of the grid in the grid map
- k 3 It is an adjustable parameter that affects the concentration difference between adjacent grids in the raster map.
- the calculation formula of the cost evaluation value is:
- P e is the cost evaluation value for estimating that the agent to be navigated can reach the grid at the next moment
- C e is the concentration evaluation value for estimating that the agent to be navigated can reach the grid at the next moment
- D p In order to evaluate the first distance evaluation value that the agent to be navigated can reach to the grid at the next moment, D t is to evaluate the second distance evaluation value that the agent to be navigated can reach the grid at the next moment, a, b and c are both weight parameters
- C N is the concentration of the grid that the agent to be navigated can reach at the next moment.
- C M is the concentration value of the grid where the agent to be navigated is currently located
- C min is the preset minimum optional grid concentration
- C max is the preset maximum optional grid concentration
- N x is the abscissa coordinate of the grid that the agent to be navigated can reach at the next moment
- N y is the ordinate coordinate of the grid that the agent to be navigated can reach at the next moment
- P x is the coordinate of the grid that the agent to be navigated can reach at the next moment.
- P y is the ordinate coordinate of the grid where the agent to be navigated was located at the last moment
- T x is the abscissa coordinate of the target location
- T y is the target location. ordinate of .
- determining the obstacle distance between the grid to be entered and the grid where the obstacle is located, and determining the operating status of the agent based on the relationship between the distance and a preset safety distance includes:
- Step S610 Determine whether the obstacle distance is greater than the safety distance. If the obstacle distance is greater than the safety distance, execute step S620; otherwise, execute step S660;
- Step S620 Control the intelligent agent to move forward toward the target position in a first state; wherein the first state is to move forward while avoiding obstacles;
- Step S630 During the process of the intelligent agent moving forward in the first state, if the trigger points marked by other intelligent agents are detected, the operating state of the intelligent agent is changed from the first state to the fourth state, and the intelligent agent is controlled. The agent moves toward the target position in the fourth state; wherein the fourth state is to move along the shortcut;
- Step S640 During the process of the agent moving from the first state to the fourth state, if the end points marked by other agents are detected, the running state of the agent is changed from the fourth state to the fourth state.
- the intelligent agent In one state, the intelligent agent is controlled to move toward the target position in the first state; if the obstacle is detected to be within a safe distance, the operating state of the intelligent agent is transferred from the fourth state to the second state. , controlling the intelligent agent to move forward toward the target position in the second state; wherein the second state is to move along the edge of the obstacle;
- Step S650 During the process of the intelligent agent moving from the fourth state to the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is converted from the second state to the second state. In the fourth state, the intelligent agent is controlled to move toward the target position in the fourth state;
- Step S660 Change the running state of the intelligent agent from the first state to the second state, and control the intelligent agent to move toward the target position in the second state;
- Step S670 During the process of the intelligent agent changing from the first state to moving forward in the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is changed from the second state to the second state.
- the intelligent agent In the first state, the intelligent agent is controlled to move forward toward the target position in the first state; if the obstacle distance is detected to be less than the field boundary distance, the location where the obstacle distance is detected to be less than the field boundary distance is marked as a trigger.
- the running state of the intelligent agent is changed from the second state to the third state, and the intelligent agent is controlled to move toward the target position in the third state; wherein the third state is to follow the original path from the trigger point return.
- the method further includes:
- step S500 is executed
- embodiments of the present invention also provide an electronic device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor.
- the processor executes the computer program, the following is implemented: The multi-agent navigation control method based on the gene regulatory network described in the first aspect.
- embodiments of the present invention also provide a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are used to execute the multi-agent method based on the gene regulation network as described in the first aspect. Navigation control methods.
- Embodiments of the present invention include: obtaining a plane map where multiple agents are located; converting the plane map into a grid map, and determining concentration information corresponding to each grid in the grid map; wherein the concentration information includes The concentration information field of the target position and the concentration information field of the obstacle position; determine the current grid and target grid of the agent to be navigated, and determine the agent according to the concentration information corresponding to each grid in the grid map
- the optimal path from the current grid to the target grid wherein, the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map; in While the agent is moving along the optimal path, it selects vacant grids from the eight neighborhood grids adjacent to the current grid, and determines the concentration evaluation value and first distance of each vacant grid.
- the difference between the information, the first distance evaluation value is the distance difference between the agent's previous grid and the vacant grid, and the second distance evaluation value is the distance between the vacant grid and the vacant grid.
- the distance difference between the target grids determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the candidate grid. Enter the grid; determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the direction of the intelligent agent according to the operating state of the intelligent agent. Move forward to the target location.
- Figure 1 is a schematic flow chart of a multi-agent navigation control method based on a gene regulatory network in an embodiment of the present invention
- Figure 2 is a specific schematic diagram of the target concentration map applied in the first scenario according to the embodiment of the present invention.
- Figure 3 is a specific schematic diagram of the obstacle concentration map applied in the first scenario according to the embodiment of the present invention.
- Figure 4 is a specific schematic diagram of the overall concentration map applied in the first scenario according to the embodiment of the present invention.
- Figure 5 is a specific schematic diagram of multi-agent navigation applied in the first scenario according to the embodiment of the present invention.
- Figure 6 is a specific schematic diagram of the target concentration map applied in the second scenario according to the embodiment of the present invention.
- Figure 7 is a specific schematic diagram of the obstacle concentration map applied in the second scenario according to the embodiment of the present invention.
- Figure 8 is a specific schematic diagram of the overall concentration map applied in the second scenario according to the embodiment of the present invention.
- Figure 9 is a specific schematic diagram of multi-agent navigation applied in the second scenario according to the embodiment of the present invention.
- Figure 10 is a specific schematic diagram of the target concentration map applied in the third scenario according to the embodiment of the present invention.
- Figure 11 is a specific schematic diagram of the obstacle concentration map applied in the third scenario according to the embodiment of the present invention.
- Figure 12 is a specific schematic diagram of the overall concentration map applied in the third scenario according to the embodiment of the present invention.
- Figure 13 is a specific schematic diagram of multi-agent navigation applied in the third scenario according to the embodiment of the present invention.
- Figure 14 is a specific illustration of the application of the obstacle avoidance control strategy in the fourth scenario of single agent navigation in the embodiment of the present invention. intention;
- Figure 15 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to single agent navigation in the fifth scenario;
- Figure 16 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to multi-agent navigation in the sixth scenario;
- Figure 17 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to multi-agent navigation in the seventh scenario;
- Figure 18 is a structural diagram of an electronic device provided by another embodiment of the present invention.
- Figure 1 is a flow chart of a multi-agent navigation control method based on a gene regulatory network provided by one embodiment of the present invention. The method includes but is not limited to the following steps:
- Step S100 obtain the plane map where multiple agents are located
- Step S200 Convert the flat map into a grid map, and determine the concentration information corresponding to each grid in the grid map; wherein the concentration information includes the concentration information field of the target position and the concentration information of the obstacle position. field;
- the grid in the grid map is the smallest square that allows a single agent to rotate once;
- Step S300 Determine the current grid and target grid of the agent to be navigated, and determine the optimal path for the agent from the current grid to the target grid based on the concentration information corresponding to each grid in the grid map. ;
- the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map;
- Step S400 While the agent is moving along the optimal path, vacant grids are screened out from the eight neighborhood grids adjacent to the current grid, and the concentration evaluation value of each vacant grid is determined. , the first distance evaluation value and the second distance evaluation value; wherein, the vacant grid is a grid passable by the agent, and the concentration evaluation value is the grid where the agent is currently located and the vacant grid.
- the agent to be navigated can obtain the current location information of each other agent in the overall concentration map and the locations of the remaining eight grids. The area ranges are compared one by one, and grids occupied by other agents are obtained and eliminated, thereby preventing the agent to be navigated from colliding with any other agents.
- the agent to be navigated determines that the remaining 8 grids have been occupied by other agents, the The agent to be navigated regards the current grid elimination process as an abnormal event, and at the same time enters the cyclic position query and occupancy judgment operations to wait for the agent in any one or more grids to start moving until it leaves the association.
- the agent to be navigated regards the current grid elimination process as an abnormal event, and at the same time enters the cyclic position query and occupancy judgment operations to wait for the agent in any one or more grids to start moving until it leaves the association.
- Step S500 Determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the grid to be entered;
- Step S600 Determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the intelligent agent to move toward the target according to the operating state of the intelligent agent. Position forward.
- the optimal path for the agent to move to the target is adaptively generated based on the concentration intensity in the grid map. It is not necessary to calculate all feasible paths on the map in order to determine the optimal path for the agent. This can reduce computing costs and make the implementation process simpler.
- the agent uses the cost evaluation value determined by the concentration information and distance information to select and formulate the next moving position, which has strong real-time performance.
- the operating status of the agent is determined based on the relationship between the distance and the preset safety distance, which can quickly navigate to the target in a complex environment and solve the problem of collision avoidance between group robots and collision avoidance between robots and obstacles. Problems, as well as problems such as adaptively finding the optimal navigation path, are suitable for scenarios where obstacles or target point positions change.
- step S200 in the embodiment shown in FIG. 1 converting the flat map into a grid map and determining the concentration information corresponding to each grid in the grid map includes: :
- Step S210 Determine the grid where the target position is located, the grid where the obstacle is located, and the movable range boundary of the agent in the grid map;
- Step S220 Import the grid map into the gene regulation network model to generate a target concentration map and an obstacle concentration map; wherein each grid in the target concentration map contains the concentration information field of the target location, and the target concentration Each grid in the map contains a concentration information field for the location of obstacles;
- Step S230 Couple the target concentration map and the obstacle concentration map according to corresponding grids to obtain concentration information corresponding to each grid in the grid map.
- a gene regulation network is constructed.
- the gene regulation network used in this embodiment is the gene regulation network disclosed in Publication No. CN112684700A; by obtaining multiple basic elements in the basic element library, according to the multiple The topological structure obtained by combining the basic elements forms a gene regulatory network model; then, the grid map is imported into the gene regulatory network model to generate a target concentration map and an obstacle concentration map; finally, the target concentration map and the obstacle concentration map are The object concentration maps are coupled according to corresponding grids, and the coupled grid map contains the target information and the obstacle information.
- the grid map is marked with multiple points around the target, multiple points around obstacles, and multiple points on the map boundary.
- N points around the target that is, the edge of the area where the target is located
- the protein concentration of the grid occupied by a point in the grid map is then superimposed to form a target concentration map; similarly, all obstacles surrounding all obstacles (i.e. all obstacles) are obtained in the flat map M points at the edge of the area where each obstacle in the object is located) and K points at the boundary of the movable range, calculate the protein concentration of the grid occupied by each point in the flat map, and then add M+
- the protein concentrations of K points are superimposed to form an obstacle concentration map.
- the target concentration map shown in Figure 2 and the obstacle concentration map shown in Figure 3 are generated, and then coupled to obtain the overall concentration map shown in Figure 4.
- the embodiment of the present invention is applied to the first scene
- the navigation diagram shown in Figure 5 is obtained;
- the target concentration map shown in Figure 6 and the obstacle concentration map shown in Figure 7 are generated, and then coupled
- the overall concentration map is obtained as shown in Figure 8.
- a navigation schematic diagram as shown in Figure 9 is obtained;
- the target concentration map shown in Figure 10 and the obstacle concentration map shown in Figure 11 are generated, and then coupled to obtain the overall concentration map shown in Figure 12.
- a navigation schematic diagram as shown in Figure 13 is obtained, thereby verifying the feasibility of the embodiment of the present invention.
- the calculation formula relied upon to generate the target concentration map is:
- p 1 is the concentration information field generated by the target
- b 1 is the adjustable parameter that affects the concentration value of each grid in the target concentration map
- v 1 is the concentration diffusion factor, which is used to adjust the mapping relationship between distance parameters and concentration parameters.
- r 1 is the relative distance between the center of each grid in the target concentration map and the location of the target;
- p 2 is the concentration information field generated by the obstacle
- b 2 is the adjustable parameter that affects the concentration value of each grid in the obstacle concentration map
- v 2 is the concentration diffusion factor
- r 2 is each grid in the target concentration map. The relative distance between the grid center and the location of the obstacle;
- the calculation formula for the concentration information corresponding to each grid in the grid map is:
- g 1 is the morphological gradient space presented by the target concentration map
- g 2 is the morphological gradient space presented by the obstacle concentration map
- g 3 is the morphology presented by coupling the concentration information field of the target position and the concentration information field of the obstacle position.
- ⁇ 1 is an adjustable parameter that affects the concentration value range of the target concentration map
- k 1 is an adjustable parameter that affects the concentration difference between adjacent grids of the target concentration map
- ⁇ 2 is the concentration that affects the obstacle concentration map
- the adjustable parameter of the value range k 2 is the adjustable parameter that affects the concentration difference between adjacent grids in the obstacle concentration map
- ⁇ 3 is the adjustable parameter that affects the concentration value range of the grid in the grid map
- k 3 It is an adjustable parameter that affects the concentration difference between adjacent grids in the raster map.
- parameter b 1 is set to 1
- parameter b 2 is set to 1.2
- parameters v 1 and v 2 are both set to 1
- parameters ⁇ 1 , ⁇ 2 , and ⁇ 3 are all set to 1.
- the value is 0, the value ranges of ⁇ 1 , ⁇ 2 and ⁇ 3 are all [0, 1]
- the parameters k 1 , k 2 and k 3 are all set to 1
- the values of k 1 , k 2 and k 3 are The value range is [0, 2].
- the calculation formula of the cost evaluation value is:
- P e is the cost evaluation value for estimating that the agent to be navigated can reach the grid at the next moment
- C e is the concentration evaluation value for estimating that the agent to be navigated can reach the grid at the next moment
- D p In order to evaluate the first distance evaluation value that the agent to be navigated can reach to the grid at the next moment, D t is to evaluate the second distance evaluation value that the agent to be navigated can reach the grid at the next moment
- a, b and c are both weight parameters
- C N is the concentration value of the grid that the agent to be navigated can reach at the next moment
- C M is the concentration value of the grid where the agent to be navigated is at the current moment
- C min is the preset minimum value of the optional grid concentration
- C max is the preset maximum value of the optional grid concentration
- N x is the abscissa coordinate of the grid that the agent to be navigated can reach at the next moment
- N y is the ordinate of the grid where the agent to be navigated can be evaluated
- T x is the abscissa of the target location
- T y is the ordinate of the target location.
- parameter a is set to 0.3
- parameter b is set to 0.35
- parameter c is set to 0.35.
- the purpose of selecting the concentration evaluation value in the embodiment of the present invention is to enable the agent to be navigated to move along the concentration gradient
- the purpose of selecting the first distance evaluation value is to enable the agent to be navigated to move as effectively as possible. That is to ensure that the grid that the agent to be navigated at the next moment can reach is farther from the grid at the previous moment.
- the purpose of selecting the second distance evaluation value is to enable the agent to be navigated to move in the direction of the target location. move.
- the grid screening process is explained as follows: first, calculate grid A and the intermediate grid according to the above formula
- the cost evaluation value between grid B and the intermediate grid is P e,A
- the cost assessment value between grid B and the intermediate grid is P e,B
- step S600 in the embodiment shown in Figure 1 the determination of the obstacle distance between the grid to be entered and the grid where the obstacle is located is based on the distance and the preset safety distance. Relationships determine the operating status of the agent, including:
- Step S610 Determine whether the obstacle distance is greater than the safety distance. If the obstacle distance is greater than the safety distance, execute step S620; otherwise, execute step S660;
- Step S620 Control the intelligent agent to move forward toward the target position in a first state; wherein the first state is to move forward while avoiding obstacles;
- Step S630 During the process of the intelligent agent moving forward in the first state, if the trigger points marked by other intelligent agents are detected, the operating state of the intelligent agent is changed from the first state to the fourth state, and the intelligent agent is controlled. The agent moves toward the target position in the fourth state; wherein the fourth state is to move along the shortcut;
- Step S640 During the process of the agent moving from the first state to the fourth state, if the end points marked by other agents are detected, the running state of the agent is changed from the fourth state to the fourth state.
- the intelligent agent In one state, the intelligent agent is controlled to move toward the target position in the first state; if the obstacle is detected to be within a safe distance, the operating state of the intelligent agent is transferred from the fourth state to the second state. , controlling the agent to move forward toward the target position in the second state; wherein the second state is to move along the edge of the obstacle; the end point is the exit of the agent;
- Step S650 During the process of the intelligent agent moving from the fourth state to the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is converted from the second state to the second state. In the fourth state, the intelligent agent is controlled to move toward the target position in the fourth state;
- Step S660 Change the running state of the intelligent agent from the first state to the second state, and control the intelligent agent to move toward the target position in the second state;
- Step S670 During the process of the agent moving from the first state to the second state, if the obstacle is detected If the distance is greater than the safe distance, then the operating state of the intelligent agent is changed from the second state to the first state, and the intelligent agent is controlled to move toward the target position in the first state; if the distance to the obstacle is detected to be less than the field boundary distance, then mark the location where the obstacle distance is less than the field boundary distance as the trigger point, change the running state of the intelligent agent from the second state to the third state, and control the intelligent agent to press the third state Move forward toward the target position; wherein the third state is to return from the trigger point along the original path.
- the obstacle distance is obtained through real-time detection and is a variable; the safety distance and field boundary distance are preset quantifications.
- the safety distance is greater than the field boundary distance.
- the intelligent agent has four operating states. When encountering different situations, it will switch back and forth between various operating states. In all running states, the agent cannot select the grid where other agents are located when selecting the next position. If there are other agents in the selected grid, the agent abandons the optimal choice and selects a feasible grid near the optimal grid. The agent calculates and selects the vacant grid with the largest cost evaluation value for navigation. A safe distance is set between the agent and obstacles. Set the field boundary distance from the agent to the boundary. The agent sensor has a certain detection range, and the agent can only detect obstacles or boundaries within the detection range. In the first state, the agent realizes basic navigation by evaluating the cost evaluation value of the surrounding grid, and in the second state, it is used to instruct the agent to walk along the edge of the obstacle.
- the agent When the agent enters this state, it leaves a "trigger point" when faced with a decision, randomly choosing possible directions to move along the edge of the obstacle. If the agent finds the exit, the agent will set the "end point” to remind other agents that they can move directly to the "end point”; if the agent does not find the exit but detects the field boundary, it means that the agent is at the "trigger point” No optimal direction was chosen. At this time, when other agents are near the "trigger point", the agent will directly communicate with other agents and inform other agents of the optimal direction, that is, the opposite direction of the wrong direction selected by the agent at the beginning, and provide other agents with The forward movement of the agent provides a feasible reference and saves the path selection time of other agents.
- the method further includes:
- step S500 is executed
- step S500 is executed; that is, the agent that marks the trigger point If the agent neither knows the field boundary nor detects the field boundary, the agent (the agent moving forward according to the fourth state) needs to calculate the concentration evaluation value, the first distance evaluation value and the second distance evaluation value by itself. Determine the cost evaluation value of each vacant raster, and use the vacant raster with the smallest cost evaluation value as the raster to be entered.
- the agent that marks the trigger point knows the field boundary and determines that its forward direction is blocked. Then the agent that sets the trigger point chooses to move in the opposite direction to the agent that marks the trigger point, thereby avoiding the inability to move forward. line path.
- the agent that marked the trigger point has found the correct path forward, the agent will directly follow the path of the agent that marked the trigger point and move forward.
- the agent that marked the trigger point has found the correct way forward and also found the exit, then the agent will directly follow the path of the agent that marked the trigger point and move toward the end point.
- one embodiment of the present invention also provides an electronic device 10.
- the electronic device 10 includes: a memory 11, a processor 12, and a computer program stored on the memory 11 and executable on the processor 12. .
- the processor 12 and the memory 11 may be connected through a bus or other means.
- the non-transient software programs and instructions required to implement the gene regulation network-based multi-agent navigation control method in the above embodiment are stored in the memory 11.
- the gene regulation network-based method in the above embodiment is executed. Multi-agent navigation control method.
- an embodiment of the present invention also provides a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are executed by a processor or controller, for example, by the above-mentioned Execution by a processor in the electronic device embodiment can cause the processor to execute the multi-agent navigation control method based on the gene regulatory network in the above embodiment.
- Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, Digital Versatile Disk (DVD) or other optical disk storage, magnetic cassettes, tapes, disk storage or other magnetic storage devices, or may Any other medium used to store the desired information and that can be accessed by a computer.
- communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .
Landscapes
- Engineering & Computer Science (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Automation & Control Theory (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Aviation & Aerospace Engineering (AREA)
- Traffic Control Systems (AREA)
- Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)
Abstract
Description
本发明涉及但不限于导航技术领域,尤其涉及一种基于基因调控网络的多智能体导航控制方法、设备及介质。The present invention relates to but is not limited to the field of navigation technology, and in particular, to a multi-agent navigation control method, equipment and medium based on a gene regulatory network.
目前智能体导航系统所涉及到的路径规划任务应用较为广泛的是A*算法,A*算法的操作过程需要依次对地图上的所有栅格进行代价值计算,即需要对所有可能路线的移动代价进行计算,耗费的运算成本高。此外,当地图中的障碍物或者目标点位置发生变化时,障碍物或者目标点可能落在已经遍历过的点位上,A*算法将不再适用。At present, the A* algorithm is widely used in path planning tasks involved in intelligent navigation systems. The operation process of the A* algorithm requires the cost calculation of all grids on the map in sequence, that is, the movement costs of all possible routes need to be calculated. The calculation cost is high. In addition, when the position of obstacles or target points in the map changes, the obstacles or target points may fall on points that have been traversed, and the A* algorithm will no longer be applicable.
因此,有必要对现有的智能体导航系统进行改进,能够实时、自适应寻找最优导航路径。Therefore, it is necessary to improve the existing intelligent navigation system to find the optimal navigation path in real time and adaptively.
发明内容Contents of the invention
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。The following is an overview of the topics described in detail in this article. This summary is not intended to limit the scope of the claims.
本发明实施例提供了一种基于基因调控网络的多智能体导航控制方法、设备及介质,能够实时、自适应寻找最优导航路径。Embodiments of the present invention provide a multi-agent navigation control method, equipment and medium based on a gene regulatory network, which can adaptively find the optimal navigation path in real time.
第一方面,本发明实施例提供了一种基于基因调控网络的多智能体导航控制方法,包括:In a first aspect, embodiments of the present invention provide a multi-agent navigation control method based on a gene regulatory network, including:
步骤S100,获取多个智能体所在的平面地图;Step S100, obtain the plane map where multiple agents are located;
步骤S200、将所述平面地图转换为栅格地图,确定所述栅格地图中的各个栅格对应的浓度信息;其中,所述浓度信息包含目标位置的浓度信息场和障碍物位置的浓度信息场;Step S200: Convert the flat map into a grid map, and determine the concentration information corresponding to each grid in the grid map; wherein the concentration information includes the concentration information field of the target position and the concentration information of the obstacle position. field;
步骤S300、确定待导航的智能体的当前栅格和目标栅格,根据所述栅格地图中的各个栅格对应的浓度信息确定所述智能体从当前栅格到目标栅格的最优路径;其中,所述当前栅格为所述智能体当前所在的栅格,所述目标栅格为目标位置在栅格地图中所在的栅格;Step S300: Determine the current grid and target grid of the agent to be navigated, and determine the optimal path for the agent from the current grid to the target grid based on the concentration information corresponding to each grid in the grid map. ; Wherein, the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map;
步骤S400、在所述智能体沿最优路径前行过程中,从与所述当前栅格相邻的8个邻域栅格中筛选出空置栅格,确定每个空置栅格的浓度评估值、第一距离评估值和第二距离评估值;其中,所述空置栅格为所述智能体可通行的栅格,所述浓度评估值为所述智能体当前时刻所在栅格和所述空置栅格的浓度信息之差,所述第一距离评估值为所述智能体的上一栅格和所述空置栅格之间的距离差值,所述第二距离评估值为所述空置栅格和所述目标栅格之间的距离差值;Step S400: While the agent is moving along the optimal path, vacant grids are screened out from the eight neighborhood grids adjacent to the current grid, and the concentration evaluation value of each vacant grid is determined. , the first distance evaluation value and the second distance evaluation value; wherein, the vacant grid is a grid passable by the agent, and the concentration evaluation value is the grid where the agent is currently located and the vacant grid. The difference between the concentration information of the grids, the first distance evaluation value is the distance difference between the agent's previous grid and the vacant grid, the second distance evaluation value is the vacant grid The distance difference between the grid and the target grid;
步骤S500、根据所述浓度评估值、第一距离评估值和第二距离评估值确定每个空置栅格的代价评估值,将代价评估值最小的空置栅格作为待进入栅格;Step S500: Determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the grid to be entered;
步骤S600、确定待进入栅格距离障碍物位置所在栅格的障碍物距离,基于所述障碍物距离确定所述智能体的运行状态,根据所述智能体的运行状态控制所述智能体朝目标位置前行。Step S600: Determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the intelligent agent to move toward the target according to the operating state of the intelligent agent. Position forward.
在一些实施例中,所述将所述平面地图转换为栅格地图,确定所述栅格地图中的各个栅格对应的浓度信息,包括:In some embodiments, converting the flat map into a grid map and determining the concentration information corresponding to each grid in the grid map includes:
确定所述栅格地图中目标位置所在栅格、障碍物位置所在栅格以及智能体的可移动范围边界; Determine the grid where the target position is located in the grid map, the grid where the obstacle is located, and the movable range boundary of the agent;
将所述栅格地图导入基因调控网络模型中,生成目标浓度地图和障碍物浓度地图;其中,所述目标浓度地图中的各个栅格包含目标位置的浓度信息场,所述目标浓度地图中的各个栅格包含障碍物位置的浓度信息场;Import the grid map into the gene regulation network model to generate a target concentration map and an obstacle concentration map; wherein each grid in the target concentration map contains the concentration information field of the target location, and the target concentration map contains Each grid contains the concentration information field of the obstacle location;
将所述目标浓度地图和所述障碍物浓度地图按对应栅格进行耦合,得到所述栅格地图中各个栅格对应的浓度信息。The target concentration map and the obstacle concentration map are coupled according to corresponding grids to obtain concentration information corresponding to each grid in the grid map.
在一些实施例中,生成目标浓度地图所依赖的计算公式为:
In some embodiments, the calculation formula relied upon to generate the target concentration map is:
其中,p1为目标生成的浓度信息场,b1为影响目标浓度地图中的各个栅格浓度值的可调参数,v1为浓度扩散因子,用于调整距离参数与浓度参数之间映射关系,r1为目标浓度地图中的各个栅格中心离目标所在位置的相对距离;Among them, p 1 is the concentration information field generated by the target, b 1 is the adjustable parameter that affects the concentration value of each grid in the target concentration map, and v 1 is the concentration diffusion factor, which is used to adjust the mapping relationship between distance parameters and concentration parameters. , r 1 is the relative distance between the center of each grid in the target concentration map and the location of the target;
生成障碍物浓度地图所依赖的计算公式为:
The calculation formula relied on to generate the obstacle concentration map is:
其中,p2为障碍物生成的浓度信息场,b2为影响障碍物浓度地图中的各个栅格浓度值的可调参数,v2为浓度扩散因子,r2为目标浓度地图中的各个栅格中心离障碍物所在位置的相对距离;Among them, p 2 is the concentration information field generated by the obstacle, b 2 is the adjustable parameter that affects the concentration value of each grid in the obstacle concentration map, v 2 is the concentration diffusion factor, and r 2 is each grid in the target concentration map. The relative distance between the grid center and the location of the obstacle;
所述栅格地图中各个栅格对应的浓度信息的计算公式为:
The calculation formula for the concentration information corresponding to each grid in the grid map is:
其中,g1为目标浓度地图所呈现的形态梯度空间,g2为障碍物浓度地图所呈现的形态梯度空间,g3为耦合目标位置的浓度信息场和障碍物位置的浓度信息场所呈现的形态梯度空间,θ1为影响目标浓度地图的浓度值范围的可调参量,k1为影响目标浓度地图的相邻栅格间浓度差值的可调参量,θ2为影响障碍物浓度地图的浓度值范围的可调参量,k2为影响障碍物浓度地图的相邻栅格间浓度差值的可调参量,θ3为影响栅格地图中栅格的浓度值范围的可调参量,k3为影响栅格地图中相邻栅格间浓度差值的可调参量。Among them, g 1 is the morphological gradient space presented by the target concentration map, g 2 is the morphological gradient space presented by the obstacle concentration map, g 3 is the morphology presented by coupling the concentration information field of the target position and the concentration information field of the obstacle position. Gradient space, θ 1 is an adjustable parameter that affects the concentration value range of the target concentration map, k 1 is an adjustable parameter that affects the concentration difference between adjacent grids of the target concentration map, θ 2 is the concentration that affects the obstacle concentration map The adjustable parameter of the value range, k 2 is the adjustable parameter that affects the concentration difference between adjacent grids in the obstacle concentration map, θ 3 is the adjustable parameter that affects the concentration value range of the grid in the grid map, k 3 It is an adjustable parameter that affects the concentration difference between adjacent grids in the raster map.
在一些实施例中,所述代价评估值的计算公式为:
In some embodiments, the calculation formula of the cost evaluation value is:
其中,Pe为评估所述待导航的智能体在下一时刻可到栅格的代价评估值,Ce为评估所述待导航的智能体在下一时刻可到栅格的浓度评估值,Dp为评估所述待导航的智能体在下一时刻可到栅格的第一距离评估值,Dt为评估所述待导航的智能体在下一时刻可到栅格的第二距离评估值,a、b、c均为权重参数,CN为评估所述待导航的智能体在下一时刻可到栅格的浓 度值,CM为所述待导航的智能体当前时刻所在栅格的浓度值,Cmin为预设的可选栅格浓度最小值,Cmax为预设的可选栅格浓度最大值,Nx为评估所述待导航的智能体在下一时刻可到栅格的横坐标,Ny为评估所述待导航的智能体在下一时刻可到栅格的纵坐标,Px为所述待导航的智能体上一时刻所在栅格的横坐标,Py为所述待导航的智能体上一时刻所在栅格的纵坐标,Tx为目标所在位置的横坐标,Ty为目标所在位置的纵坐标。Among them, P e is the cost evaluation value for estimating that the agent to be navigated can reach the grid at the next moment, C e is the concentration evaluation value for estimating that the agent to be navigated can reach the grid at the next moment, D p In order to evaluate the first distance evaluation value that the agent to be navigated can reach to the grid at the next moment, D t is to evaluate the second distance evaluation value that the agent to be navigated can reach the grid at the next moment, a, b and c are both weight parameters, C N is the concentration of the grid that the agent to be navigated can reach at the next moment. Degree value, C M is the concentration value of the grid where the agent to be navigated is currently located, C min is the preset minimum optional grid concentration, C max is the preset maximum optional grid concentration, N x is the abscissa coordinate of the grid that the agent to be navigated can reach at the next moment, N y is the ordinate coordinate of the grid that the agent to be navigated can reach at the next moment, and P x is the coordinate of the grid that the agent to be navigated can reach at the next moment. The abscissa coordinate of the grid where the navigation agent was located at the last moment, P y is the ordinate coordinate of the grid where the agent to be navigated was located at the last moment, T x is the abscissa coordinate of the target location, and T y is the target location. ordinate of .
在一些实施例中,所述确定待进入栅格距离障碍物位置所在栅格的障碍物距离,基于所述距离和预先设置的安全距离的关系确定所述智能体的运行状态,包括:In some embodiments, determining the obstacle distance between the grid to be entered and the grid where the obstacle is located, and determining the operating status of the agent based on the relationship between the distance and a preset safety distance includes:
步骤S610,确定所述障碍物距离是否大于安全距离,若所述障碍物距离大于安全距离,则执行步骤S620,否则执行步骤S660;Step S610: Determine whether the obstacle distance is greater than the safety distance. If the obstacle distance is greater than the safety distance, execute step S620; otherwise, execute step S660;
步骤S620,控制所述智能体按第一状态朝目标位置前行;其中,所述第一状态为在避开障碍物的前提下前行;Step S620: Control the intelligent agent to move forward toward the target position in a first state; wherein the first state is to move forward while avoiding obstacles;
步骤S630,在所述智能体按第一状态前行过程中,若检测到其他智能体标记的触发点,则将所述智能体的运行状态从第一状态转为第四状态,控制所述智能体按第四状态朝目标位置前行;其中,所述第四状态为沿近路前行;Step S630: During the process of the intelligent agent moving forward in the first state, if the trigger points marked by other intelligent agents are detected, the operating state of the intelligent agent is changed from the first state to the fourth state, and the intelligent agent is controlled. The agent moves toward the target position in the fourth state; wherein the fourth state is to move along the shortcut;
步骤S640,在所述智能体从第一状态转为按第四状态前行过程中,若检测到其他智能体标记的终止点,则将所述智能体的运行状态从第四状态转为第一状态,控制所述智能体按第一状态朝目标位置前行;若检测到所述障碍物距离在安全距离范围内,则将所述智能体的运行状态从第四状态转为第二状态,控制所述智能体按第二状态朝目标位置前行;其中,所述第二状态为沿着障碍物的边缘前行;Step S640: During the process of the agent moving from the first state to the fourth state, if the end points marked by other agents are detected, the running state of the agent is changed from the fourth state to the fourth state. In one state, the intelligent agent is controlled to move toward the target position in the first state; if the obstacle is detected to be within a safe distance, the operating state of the intelligent agent is transferred from the fourth state to the second state. , controlling the intelligent agent to move forward toward the target position in the second state; wherein the second state is to move along the edge of the obstacle;
步骤S650,在所述智能体从第四状态转为按第二状态前行过程中,若检测到所述障碍物距离大于安全距离,则将所述智能体的运行状态从第二状态转为第四状态,控制所述智能体按第四状态朝目标位置前行;Step S650: During the process of the intelligent agent moving from the fourth state to the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is converted from the second state to the second state. In the fourth state, the intelligent agent is controlled to move toward the target position in the fourth state;
步骤S660,将所述智能体的运行状态从第一状态转为第二状态,控制所述智能体按第二状态朝目标位置前行;Step S660: Change the running state of the intelligent agent from the first state to the second state, and control the intelligent agent to move toward the target position in the second state;
步骤S670,在所述智能体从第一状态转为按第二状态前行过程中,若检测到所述障碍物距离大于安全距离,则将所述智能体的运行状态从第二状态转为第一状态,控制所述智能体按第一状态朝目标位置前行;若检测到所述障碍物距离小于场边界距离,则将检测到所述障碍物距离小于场边界距离所在位置标记为触发点后,将所述智能体的运行状态从第二状态转为第三状态,控制所述智能体按第三状态朝目标位置前行;其中,所述第三状态为从触发点沿原路返回。Step S670: During the process of the intelligent agent changing from the first state to moving forward in the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is changed from the second state to the second state. In the first state, the intelligent agent is controlled to move forward toward the target position in the first state; if the obstacle distance is detected to be less than the field boundary distance, the location where the obstacle distance is detected to be less than the field boundary distance is marked as a trigger. After clicking, the running state of the intelligent agent is changed from the second state to the third state, and the intelligent agent is controlled to move toward the target position in the third state; wherein the third state is to follow the original path from the trigger point return.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
在所述智能体按第四状态前行过程中,若检测到触发点,则确定标记该触发点的智能体在当前的运行状态和之前的运行状态;During the process of the agent moving forward in the fourth state, if a trigger point is detected, the current operating state and the previous operating state of the agent marking the trigger point are determined;
若标记该触发点的智能体在当前的运行状态和之前的运行状态中均未按第三状态前行,则执行步骤S500;If the agent marking the trigger point does not move forward according to the third state in the current running state and the previous running state, step S500 is executed;
若标记该触发点的智能体在当前的运行状态为第三状态,则所述智能体按与当前前行路径相反的方向前行;If the current running state of the agent marking the trigger point is the third state, the agent will move forward in the opposite direction to the current forward path;
若标记该触发点的智能体在之前的运行状态为第三状态,且当前未按第三状态前行,则所述智能体跟随标记该触发点的智能体前行;If the previous operating state of the agent that marked the trigger point was the third state and is not currently moving forward in the third state, then the agent will follow the agent that marked the trigger point;
确定标记该触发点的智能体是否也标记过终止点,若标记该触发点的智能体也标记过终止点,则所述智能体跟随标记该触发点的智能体朝所述终止点前行。 Determine whether the agent that marked the trigger point has also marked the end point. If the agent that marked the trigger point has also marked the end point, then the agent follows the agent that marked the trigger point and moves toward the end point.
第二方面,本发明实施例还提供了一种电子设备,包括:存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如第一方面所述的基于基因调控网络的多智能体导航控制方法。In a second aspect, embodiments of the present invention also provide an electronic device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the computer program, the following is implemented: The multi-agent navigation control method based on the gene regulatory network described in the first aspect.
第三方面,本发明实施例还提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行如第一方面所述的基于基因调控网络的多智能体导航控制方法。In a third aspect, embodiments of the present invention also provide a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are used to execute the multi-agent method based on the gene regulation network as described in the first aspect. Navigation control methods.
本发明实施例包括:获取多个智能体所在的平面地图;将所述平面地图转换为栅格地图,确定所述栅格地图中的各个栅格对应的浓度信息;其中,所述浓度信息包含目标位置的浓度信息场和障碍物位置的浓度信息场;确定待导航的智能体的当前栅格和目标栅格,根据所述栅格地图中的各个栅格对应的浓度信息确定所述智能体从当前栅格到目标栅格的最优路径;其中,所述当前栅格为所述智能体当前所在的栅格,所述目标栅格为目标位置在栅格地图中所在的栅格;在所述智能体沿最优路径前行过程中,从与所述当前栅格相邻的8个邻域栅格中筛选出空置栅格,确定每个空置栅格的浓度评估值、第一距离评估值和第二距离评估值;其中,所述空置栅格为所述智能体可通行的栅格,所述浓度评估值为所述智能体当前时刻所在栅格和所述空置栅格的浓度信息之差,所述第一距离评估值为所述智能体的上一栅格和所述空置栅格之间的距离差值,所述第二距离评估值为所述空置栅格和所述目标栅格之间的距离差值;根据所述浓度评估值、第一距离评估值和第二距离评估值确定每个空置栅格的代价评估值,将代价评估值最小的空置栅格作为待进入栅格;确定待进入栅格距离障碍物位置所在栅格的障碍物距离,基于所述障碍物距离确定所述智能体的运行状态,根据所述智能体的运行状态控制所述智能体朝目标位置前行。Embodiments of the present invention include: obtaining a plane map where multiple agents are located; converting the plane map into a grid map, and determining concentration information corresponding to each grid in the grid map; wherein the concentration information includes The concentration information field of the target position and the concentration information field of the obstacle position; determine the current grid and target grid of the agent to be navigated, and determine the agent according to the concentration information corresponding to each grid in the grid map The optimal path from the current grid to the target grid; wherein, the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map; in While the agent is moving along the optimal path, it selects vacant grids from the eight neighborhood grids adjacent to the current grid, and determines the concentration evaluation value and first distance of each vacant grid. The evaluation value and the second distance evaluation value; wherein, the vacant grid is a grid passable by the agent, and the concentration evaluation value is the concentration of the grid where the agent is currently located and the vacant grid. The difference between the information, the first distance evaluation value is the distance difference between the agent's previous grid and the vacant grid, and the second distance evaluation value is the distance between the vacant grid and the vacant grid. The distance difference between the target grids; determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the candidate grid. Enter the grid; determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the direction of the intelligent agent according to the operating state of the intelligent agent. Move forward to the target location.
本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention may be realized and obtained by the structure particularly pointed out in the written description, claims and appended drawings.
附图用来提供对本发明技术方案的进一步理解,并且构成说明书的一部分,与本发明的实施例一起用于解释本发明的技术方案,并不构成对本发明技术方案的限制。The accompanying drawings are used to provide a further understanding of the technical solutions of the present invention, and constitute a part of the description. Together with the embodiments of the present invention, they are used to explain the technical solutions of the present invention, and do not constitute a limitation of the technical solutions of the present invention.
图1是本发明实施例中的一种基于基因调控网络的多智能体导航控制方法的流程示意图;Figure 1 is a schematic flow chart of a multi-agent navigation control method based on a gene regulatory network in an embodiment of the present invention;
图2是本发明实施例应用在第一场景中的目标浓度地图的具体示意图;Figure 2 is a specific schematic diagram of the target concentration map applied in the first scenario according to the embodiment of the present invention;
图3是本发明实施例应用在第一场景中的障碍物浓度地图的具体示意图;Figure 3 is a specific schematic diagram of the obstacle concentration map applied in the first scenario according to the embodiment of the present invention;
图4是本发明实施例应用在第一场景中的整体浓度地图的具体示意图;Figure 4 is a specific schematic diagram of the overall concentration map applied in the first scenario according to the embodiment of the present invention;
图5是本发明实施例应用在第一场景中的多智能体导航的具体示意图;Figure 5 is a specific schematic diagram of multi-agent navigation applied in the first scenario according to the embodiment of the present invention;
图6是本发明实施例应用在第二场景中的目标浓度地图的具体示意图;Figure 6 is a specific schematic diagram of the target concentration map applied in the second scenario according to the embodiment of the present invention;
图7是本发明实施例应用在第二场景中的障碍物浓度地图的具体示意图;Figure 7 is a specific schematic diagram of the obstacle concentration map applied in the second scenario according to the embodiment of the present invention;
图8是本发明实施例应用在第二场景中的整体浓度地图的具体示意图;Figure 8 is a specific schematic diagram of the overall concentration map applied in the second scenario according to the embodiment of the present invention;
图9是本发明实施例应用在第二场景中的多智能体导航的具体示意图;Figure 9 is a specific schematic diagram of multi-agent navigation applied in the second scenario according to the embodiment of the present invention;
图10是本发明实施例应用在第三场景中的目标浓度地图的具体示意图;Figure 10 is a specific schematic diagram of the target concentration map applied in the third scenario according to the embodiment of the present invention;
图11是本发明实施例应用在第三场景中的障碍物浓度地图的具体示意图;Figure 11 is a specific schematic diagram of the obstacle concentration map applied in the third scenario according to the embodiment of the present invention;
图12是本发明实施例应用在第三场景中的整体浓度地图的具体示意图;Figure 12 is a specific schematic diagram of the overall concentration map applied in the third scenario according to the embodiment of the present invention;
图13是本发明实施例应用在第三场景中的多智能体导航的具体示意图;Figure 13 is a specific schematic diagram of multi-agent navigation applied in the third scenario according to the embodiment of the present invention;
图14是本发明实施例中的避障控制策略应用在第四场景中的单个智能体导航的具体示 意图;Figure 14 is a specific illustration of the application of the obstacle avoidance control strategy in the fourth scenario of single agent navigation in the embodiment of the present invention. intention;
图15是本发明实施例中的避障控制策略应用在第五场景中的单个智能体导航的具体示意图;Figure 15 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to single agent navigation in the fifth scenario;
图16是本发明实施例中的避障控制策略应用在第六场景中的多智能体导航的具体示意图;Figure 16 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to multi-agent navigation in the sixth scenario;
图17是本发明实施例中的避障控制策略应用在第七场景中的多智能体导航的具体示意图;Figure 17 is a specific schematic diagram of the obstacle avoidance control strategy in the embodiment of the present invention applied to multi-agent navigation in the seventh scenario;
图18是本发明另一个实施例提供的电子设备的结构图。Figure 18 is a structural diagram of an electronic device provided by another embodiment of the present invention.
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the purpose, technical solutions and advantages of the present invention more clear, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention and are not intended to limit the present invention.
需要说明的是,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于流程图中的顺序执行所示出或描述的步骤。说明书、权利要求书或上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that although a logical sequence is shown in the flowchart, in some cases, the steps shown or described may be performed in an order different from that in the flowchart. The terms "first", "second", etc. in the description, claims or the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence.
下面结合附图,对本发明实施例作进一步阐述。The embodiments of the present invention will be further described below with reference to the accompanying drawings.
如图1所示,图1是本发明一个实施例提供的一种基于基因调控网络的多智能体导航控制方法的流程图,在该方法中,包括但不限于有以下步骤:As shown in Figure 1, Figure 1 is a flow chart of a multi-agent navigation control method based on a gene regulatory network provided by one embodiment of the present invention. The method includes but is not limited to the following steps:
步骤S100,获取多个智能体所在的平面地图;Step S100, obtain the plane map where multiple agents are located;
步骤S200、将所述平面地图转换为栅格地图,确定所述栅格地图中的各个栅格对应的浓度信息;其中,所述浓度信息包含目标位置的浓度信息场和障碍物位置的浓度信息场;Step S200: Convert the flat map into a grid map, and determine the concentration information corresponding to each grid in the grid map; wherein the concentration information includes the concentration information field of the target position and the concentration information of the obstacle position. field;
需要说明的是,所述栅格地图中的栅格为允许单个智能体自转一圈的最小正方形;It should be noted that the grid in the grid map is the smallest square that allows a single agent to rotate once;
步骤S300、确定待导航的智能体的当前栅格和目标栅格,根据所述栅格地图中的各个栅格对应的浓度信息确定所述智能体从当前栅格到目标栅格的最优路径;其中,所述当前栅格为所述智能体当前所在的栅格,所述目标栅格为目标位置在栅格地图中所在的栅格;Step S300: Determine the current grid and target grid of the agent to be navigated, and determine the optimal path for the agent from the current grid to the target grid based on the concentration information corresponding to each grid in the grid map. ; Wherein, the current grid is the grid where the agent is currently located, and the target grid is the grid where the target position is located in the grid map;
步骤S400、在所述智能体沿最优路径前行过程中,从与所述当前栅格相邻的8个邻域栅格中筛选出空置栅格,确定每个空置栅格的浓度评估值、第一距离评估值和第二距离评估值;其中,所述空置栅格为所述智能体可通行的栅格,所述浓度评估值为所述智能体当前时刻所在栅格和所述空置栅格的浓度信息之差,所述第一距离评估值为所述智能体的上一栅格和所述空置栅格之间的距离差值,所述第二距离评估值为所述空置栅格和所述目标栅格之间的距离差值;Step S400: While the agent is moving along the optimal path, vacant grids are screened out from the eight neighborhood grids adjacent to the current grid, and the concentration evaluation value of each vacant grid is determined. , the first distance evaluation value and the second distance evaluation value; wherein, the vacant grid is a grid passable by the agent, and the concentration evaluation value is the grid where the agent is currently located and the vacant grid. The difference between the concentration information of the grids, the first distance evaluation value is the distance difference between the agent's previous grid and the vacant grid, the second distance evaluation value is the vacant grid The distance difference between the grid and the target grid;
需要说明的是,在栅格地图中,以待导航的智能体所在栅格为中心,可以建立一个关联九宫格,所述智能体所在栅格存在8个相邻的栅格(邻域栅格);再确认这8个邻域栅格是否被占用,即,所述空置栅格上不存在障碍物、其他智能体,将未被占用的领域栅格作为空置栅格,得到所述智能体可通行的栅格。所述智能体的上一栅格是指所述智能体在当前栅格之前刚经历的栅格。在一实施例中,基于多个智能体之间的相互通信关系,所述待导航的智能体可通过获取其他各个智能体在所述整体浓度地图中的当前位置信息与剩余8个栅格所在区域范围进行逐个比对,进而得到被其他智能体占领的栅格并对其进行剔除处理,由此可防止所述待导航的智能体与其他任何智能体发生碰撞。It should be noted that in the grid map, with the grid where the agent to be navigated is located as the center, an associated nine-square grid can be established. The grid where the agent is located has 8 adjacent grids (neighborhood grids) ; Confirm again whether these 8 neighborhood grids are occupied, that is, there are no obstacles or other agents on the vacant grids. Use the unoccupied domain grids as vacant grids to get the result that the agent can Passage grid. The previous grid of the agent refers to the grid that the agent has just experienced before the current grid. In one embodiment, based on the mutual communication relationship between multiple agents, the agent to be navigated can obtain the current location information of each other agent in the overall concentration map and the locations of the remaining eight grids. The area ranges are compared one by one, and grids occupied by other agents are obtained and eliminated, thereby preventing the agent to be navigated from colliding with any other agents.
此外,若发生所述待导航的智能体判断出剩余8个栅格均已被其他智能体占领时,所述 待导航的智能体将当前栅格剔除过程视为异常事件,同时进入循环的位置查询与占位判断这一操作,以等待任意一个或者多个栅格内的智能体开始移动直至离开所述关联九宫格,再重新执行栅格剔除任务。In addition, if the agent to be navigated determines that the remaining 8 grids have been occupied by other agents, the The agent to be navigated regards the current grid elimination process as an abnormal event, and at the same time enters the cyclic position query and occupancy judgment operations to wait for the agent in any one or more grids to start moving until it leaves the association. Nine-square grid, and then re-execute the grid elimination task.
步骤S500、根据所述浓度评估值、第一距离评估值和第二距离评估值确定每个空置栅格的代价评估值,将代价评估值最小的空置栅格作为待进入栅格;Step S500: Determine the cost evaluation value of each vacant grid according to the concentration evaluation value, the first distance evaluation value and the second distance evaluation value, and use the vacant grid with the smallest cost evaluation value as the grid to be entered;
步骤S600、确定待进入栅格距离障碍物位置所在栅格的障碍物距离,基于所述障碍物距离确定所述智能体的运行状态,根据所述智能体的运行状态控制所述智能体朝目标位置前行。Step S600: Determine the obstacle distance between the grid to be entered and the grid where the obstacle is located, determine the operating state of the intelligent agent based on the obstacle distance, and control the intelligent agent to move toward the target according to the operating state of the intelligent agent. Position forward.
本发明提供的实施例中,根据栅格地图中的浓度强度自适应生成智能体移动至目标的最优路径,不需要依次计算出地图上的所有可行路径来确定智能体的最优路径,由此可以减少运算成本,实施过程更为简便。智能体在移动过程中借助浓度信息和距离信息所决定的代价评价值来筛选拟定下一步的移动位置,具有较强的实时性。基于所述距离和预先设置的安全距离的关系确定所述智能体的运行状态,能够在复杂环境中快速导航到目标,解决了群体机器人之间的避碰、机器人与障碍物之间的避碰问题,以及自适应寻找最优导航路径等问题,适用于障碍物或者目标点位置发生变化的场景。In the embodiment provided by the present invention, the optimal path for the agent to move to the target is adaptively generated based on the concentration intensity in the grid map. It is not necessary to calculate all feasible paths on the map in order to determine the optimal path for the agent. This can reduce computing costs and make the implementation process simpler. During the movement, the agent uses the cost evaluation value determined by the concentration information and distance information to select and formulate the next moving position, which has strong real-time performance. The operating status of the agent is determined based on the relationship between the distance and the preset safety distance, which can quickly navigate to the target in a complex environment and solve the problem of collision avoidance between group robots and collision avoidance between robots and obstacles. Problems, as well as problems such as adaptively finding the optimal navigation path, are suitable for scenarios where obstacles or target point positions change.
另外,在一实施例中,图1所示实施例中的步骤S200中,所述将所述平面地图转换为栅格地图,确定所述栅格地图中的各个栅格对应的浓度信息,包括:In addition, in one embodiment, in step S200 in the embodiment shown in FIG. 1 , converting the flat map into a grid map and determining the concentration information corresponding to each grid in the grid map includes: :
步骤S210、确定所述栅格地图中目标位置所在栅格、障碍物位置所在栅格以及智能体的可移动范围边界;Step S210: Determine the grid where the target position is located, the grid where the obstacle is located, and the movable range boundary of the agent in the grid map;
步骤S220、将所述栅格地图导入基因调控网络模型中,生成目标浓度地图和障碍物浓度地图;其中,所述目标浓度地图中的各个栅格包含目标位置的浓度信息场,所述目标浓度地图中的各个栅格包含障碍物位置的浓度信息场;Step S220: Import the grid map into the gene regulation network model to generate a target concentration map and an obstacle concentration map; wherein each grid in the target concentration map contains the concentration information field of the target location, and the target concentration Each grid in the map contains a concentration information field for the location of obstacles;
步骤S230、将所述目标浓度地图和所述障碍物浓度地图按对应栅格进行耦合,得到所述栅格地图中各个栅格对应的浓度信息。Step S230: Couple the target concentration map and the obstacle concentration map according to corresponding grids to obtain concentration information corresponding to each grid in the grid map.
在一些实施例中,首先,构建基因调控网络,本实施例所采用的基因调控网络为公开号CN112684700A所公开的基因调控网络;通过获取基本元件库中的多个基本元件,根据所述多个基本元件组合得到的拓扑结构形成基因调控网络模型;接着,将所述栅格地图导入基因调控网络模型中,生成目标浓度地图和障碍物浓度地图;最后,将所述目标浓度地图和所述障碍物浓度地图按对应栅格进行耦合,耦合得到的栅格地图包含所述目标信息和所述障碍物信息。In some embodiments, first, a gene regulation network is constructed. The gene regulation network used in this embodiment is the gene regulation network disclosed in Publication No. CN112684700A; by obtaining multiple basic elements in the basic element library, according to the multiple The topological structure obtained by combining the basic elements forms a gene regulatory network model; then, the grid map is imported into the gene regulatory network model to generate a target concentration map and an obstacle concentration map; finally, the target concentration map and the obstacle concentration map are The object concentration maps are coupled according to corresponding grids, and the coupled grid map contains the target information and the obstacle information.
具体的,在所述栅格地图中标记有围绕目标的多个点、围绕障碍物的多个点以及地图边界的多个点,围绕目标(即目标所在区域边缘)的N个点,计算每一个点在所述栅格地图中所占据栅格的蛋白质浓度,进而将N个点的蛋白质浓度进行叠加形成目标浓度地图;同理,在所述平面地图中获取围绕所有障碍物(即所有障碍物中的每一个障碍物所在区域边缘)的M个点以及所述可移动范围边界的K个点,计算其中每一个点在所述平面地图中所占据栅格的蛋白质浓度,进而将M+K个点的蛋白质浓度进行叠加形成障碍物浓度地图。Specifically, the grid map is marked with multiple points around the target, multiple points around obstacles, and multiple points on the map boundary. N points around the target (that is, the edge of the area where the target is located) are calculated. The protein concentration of the grid occupied by a point in the grid map is then superimposed to form a target concentration map; similarly, all obstacles surrounding all obstacles (i.e. all obstacles) are obtained in the flat map M points at the edge of the area where each obstacle in the object is located) and K points at the boundary of the movable range, calculate the protein concentration of the grid occupied by each point in the flat map, and then add M+ The protein concentrations of K points are superimposed to form an obstacle concentration map.
基于所述目标信息和所述障碍物信息是处于实时可变化的,当本发明实施例应用到不同变化场景下时可生成对应多组不同的浓度地图,分别如下:Based on the fact that the target information and the obstacle information are changeable in real time, when the embodiment of the present invention is applied to different changing scenarios, multiple sets of corresponding different concentration maps can be generated, as follows:
在第一场景中生成如图2所示的目标浓度地图和如图3所示的障碍物浓度地图,进而耦合得到如图4所示的整体浓度地图,当本发明实施例应用到第一场景中实现多个智能体导航至目标所在位置时,得到如图5所示的导航示意图;In the first scene, the target concentration map shown in Figure 2 and the obstacle concentration map shown in Figure 3 are generated, and then coupled to obtain the overall concentration map shown in Figure 4. When the embodiment of the present invention is applied to the first scene When multiple agents are implemented to navigate to the target location, the navigation diagram shown in Figure 5 is obtained;
在第二场景中生成如图6所示的目标浓度地图和如图7所示的障碍物浓度地图,进而耦 合得到如图8所示的整体浓度地图,当本发明实施例应用到第二场景中实现多个智能体导航至目标所在位置时,得到如图9所示的导航示意图;In the second scene, the target concentration map shown in Figure 6 and the obstacle concentration map shown in Figure 7 are generated, and then coupled The overall concentration map is obtained as shown in Figure 8. When the embodiment of the present invention is applied to the second scenario to realize multiple agents navigating to the target location, a navigation schematic diagram as shown in Figure 9 is obtained;
在第三场景中生成如图10所示的目标浓度地图和如图11所示的障碍物浓度地图,进而耦合得到如图12所示的整体浓度地图,当本发明实施例应用到第三场景中实现多个智能体导航至目标所在位置时,得到如图13所示的导航示意图,由此可以验证本发明实施例的可行性。In the third scene, the target concentration map shown in Figure 10 and the obstacle concentration map shown in Figure 11 are generated, and then coupled to obtain the overall concentration map shown in Figure 12. When the embodiment of the present invention is applied to the third scene When multiple agents are implemented to navigate to the target location, a navigation schematic diagram as shown in Figure 13 is obtained, thereby verifying the feasibility of the embodiment of the present invention.
在一实施例中,生成目标浓度地图所依赖的计算公式为:
In one embodiment, the calculation formula relied upon to generate the target concentration map is:
其中,p1为目标生成的浓度信息场,b1为影响目标浓度地图中的各个栅格浓度值的可调参数,v1为浓度扩散因子,用于调整距离参数与浓度参数之间映射关系,r1为目标浓度地图中的各个栅格中心离目标所在位置的相对距离;Among them, p 1 is the concentration information field generated by the target, b 1 is the adjustable parameter that affects the concentration value of each grid in the target concentration map, and v 1 is the concentration diffusion factor, which is used to adjust the mapping relationship between distance parameters and concentration parameters. , r 1 is the relative distance between the center of each grid in the target concentration map and the location of the target;
生成障碍物浓度地图所依赖的计算公式为:
The calculation formula relied on to generate the obstacle concentration map is:
其中,p2为障碍物生成的浓度信息场,b2为影响障碍物浓度地图中的各个栅格浓度值的可调参数,v2为浓度扩散因子,r2为目标浓度地图中的各个栅格中心离障碍物所在位置的相对距离;Among them, p 2 is the concentration information field generated by the obstacle, b 2 is the adjustable parameter that affects the concentration value of each grid in the obstacle concentration map, v 2 is the concentration diffusion factor, and r 2 is each grid in the target concentration map. The relative distance between the grid center and the location of the obstacle;
所述栅格地图中各个栅格对应的浓度信息的计算公式为:
The calculation formula for the concentration information corresponding to each grid in the grid map is:
其中,g1为目标浓度地图所呈现的形态梯度空间,g2为障碍物浓度地图所呈现的形态梯度空间,g3为耦合目标位置的浓度信息场和障碍物位置的浓度信息场所呈现的形态梯度空间,θ1为影响目标浓度地图的浓度值范围的可调参量,k1为影响目标浓度地图的相邻栅格间浓度差值的可调参量,θ2为影响障碍物浓度地图的浓度值范围的可调参量,k2为影响障碍物浓度地图的相邻栅格间浓度差值的可调参量,θ3为影响栅格地图中栅格的浓度值范围的可调参量,k3为影响栅格地图中相邻栅格间浓度差值的可调参量。作为优选,本发明实施例将参数b1取值为1,将参数b2取值为1.2,将参数v1、v2均取值为1,将参数θ1、θ2、θ3均取值为0,θ1、θ2和θ3的取值范围均为[0,1],将参数k1、k2、k3均取值为1,k1、k2和k3的取值范围均为[0,2]。Among them, g 1 is the morphological gradient space presented by the target concentration map, g 2 is the morphological gradient space presented by the obstacle concentration map, g 3 is the morphology presented by coupling the concentration information field of the target position and the concentration information field of the obstacle position. Gradient space, θ 1 is an adjustable parameter that affects the concentration value range of the target concentration map, k 1 is an adjustable parameter that affects the concentration difference between adjacent grids of the target concentration map, θ 2 is the concentration that affects the obstacle concentration map The adjustable parameter of the value range, k 2 is the adjustable parameter that affects the concentration difference between adjacent grids in the obstacle concentration map, θ 3 is the adjustable parameter that affects the concentration value range of the grid in the grid map, k 3 It is an adjustable parameter that affects the concentration difference between adjacent grids in the raster map. Preferably, in the embodiment of the present invention, parameter b 1 is set to 1, parameter b 2 is set to 1.2, parameters v 1 and v 2 are both set to 1, and parameters θ 1 , θ 2 , and θ 3 are all set to 1. The value is 0, the value ranges of θ 1 , θ 2 and θ 3 are all [0, 1], the parameters k 1 , k 2 and k 3 are all set to 1, and the values of k 1 , k 2 and k 3 are The value range is [0, 2].
在一些实施例中,所述代价评估值的计算公式为:
In some embodiments, the calculation formula of the cost evaluation value is:
其中,Pe为评估所述待导航的智能体在下一时刻可到栅格的代价评估值,Ce为评估所述待导航的智能体在下一时刻可到栅格的浓度评估值,Dp为评估所述待导航的智能体在下一时刻可到栅格的第一距离评估值,Dt为评估所述待导航的智能体在下一时刻可到栅格的第二距离评估值,a、b、c均为权重参数,CN为评估所述待导航的智能体在下一时刻可到栅格的浓度值,CM为所述待导航的智能体当前时刻所在栅格的浓度值,Cmin为预设的可选栅格浓度最小值,Cmax为预设的可选栅格浓度最大值,Nx为评估所述待导航的智能体在下一时刻可到栅格的横坐标,Ny为评估所述待导航的智能体在下一时刻可到栅格的纵坐标,Px为所述待导航的智能体上一时刻所在栅格的横坐标,Py为所述待导航的智能体上一时刻所在栅格的纵坐标,Tx为目标所在位置的横坐标,Ty为目标所在位置的纵坐标。优选地,本发明实施例将参数a取值为0.3,将参数b取值为0.35,将参数c取值为0.35。Among them, P e is the cost evaluation value for estimating that the agent to be navigated can reach the grid at the next moment, C e is the concentration evaluation value for estimating that the agent to be navigated can reach the grid at the next moment, D p In order to evaluate the first distance evaluation value that the agent to be navigated can reach to the grid at the next moment, D t is to evaluate the second distance evaluation value that the agent to be navigated can reach the grid at the next moment, a, b and c are both weight parameters, C N is the concentration value of the grid that the agent to be navigated can reach at the next moment, C M is the concentration value of the grid where the agent to be navigated is at the current moment, C min is the preset minimum value of the optional grid concentration, C max is the preset maximum value of the optional grid concentration, N x is the abscissa coordinate of the grid that the agent to be navigated can reach at the next moment, N y is the ordinate of the grid where the agent to be navigated can be evaluated at the next moment, P x is the abscissa of the grid where the agent to be navigated was at the previous moment, and P y is the intelligence to be navigated. The ordinate of the grid where the body is located at the moment, T x is the abscissa of the target location, and T y is the ordinate of the target location. Preferably, in this embodiment of the present invention, parameter a is set to 0.3, parameter b is set to 0.35, and parameter c is set to 0.35.
其中,本发明实施例选用浓度评估值的目的在于使得所述待导航的智能体可以顺浓度梯度移动,选用第一距离评估值的目的在于使得所述待导航的智能体尽可能的有效移动,即确保所述待导航的智能体下一时刻可到栅格距离上一时刻所在栅格更远,选用第二距离评估值的目的在于使得所述待导航的智能体可以朝向目标所在位置的方向移动。Among them, the purpose of selecting the concentration evaluation value in the embodiment of the present invention is to enable the agent to be navigated to move along the concentration gradient, and the purpose of selecting the first distance evaluation value is to enable the agent to be navigated to move as effectively as possible. That is to ensure that the grid that the agent to be navigated at the next moment can reach is farther from the grid at the previous moment. The purpose of selecting the second distance evaluation value is to enable the agent to be navigated to move in the direction of the target location. move.
以所述关联九宫格中仅存在栅格A、栅格B和栅格C未被任何智能体占领为例,对栅格筛选过程进行说明为:首先按照上述公式计算出栅格A与中间栅格之间的代价评估值为Pe,A、栅格B与中间栅格之间的代价评估值为Pe,B以及栅格C与中间栅格之间的代价评估值为Pe,C,若判断出Pe,A<Pe,B<Pe,C,则将栅格A的中心点指定为所述待导航的智能体的移动位置,若判断出Pe,A=Pe,B<Pe,C,则从栅格A和栅格B中随机选取一个栅格并将其中心点指定为所述待导航的智能体的移动位置。Taking as an example that there are only grid A, grid B and grid C in the associated nine-square grid that are not occupied by any agent, the grid screening process is explained as follows: first, calculate grid A and the intermediate grid according to the above formula The cost evaluation value between grid B and the intermediate grid is P e,A , the cost assessment value between grid B and the intermediate grid is P e,B , and the cost assessment value between grid C and the intermediate grid is P e,C , If it is determined that P e,A <P e,B <P e,C , then the center point of the grid A is designated as the movement position of the agent to be navigated. If it is determined that P e,A =P e, B <P e,C , then randomly select a grid from grid A and grid B and designate its center point as the movement position of the agent to be navigated.
另外,在一实施例中,图1所示实施例中的步骤S600中,所述确定待进入栅格距离障碍物位置所在栅格的障碍物距离,基于所述距离和预先设置的安全距离的关系确定所述智能体的运行状态,包括:In addition, in one embodiment, in step S600 in the embodiment shown in Figure 1, the determination of the obstacle distance between the grid to be entered and the grid where the obstacle is located is based on the distance and the preset safety distance. Relationships determine the operating status of the agent, including:
步骤S610,确定所述障碍物距离是否大于安全距离,若所述障碍物距离大于安全距离,则执行步骤S620,否则执行步骤S660;Step S610: Determine whether the obstacle distance is greater than the safety distance. If the obstacle distance is greater than the safety distance, execute step S620; otherwise, execute step S660;
步骤S620,控制所述智能体按第一状态朝目标位置前行;其中,所述第一状态为在避开障碍物的前提下前行;Step S620: Control the intelligent agent to move forward toward the target position in a first state; wherein the first state is to move forward while avoiding obstacles;
步骤S630,在所述智能体按第一状态前行过程中,若检测到其他智能体标记的触发点,则将所述智能体的运行状态从第一状态转为第四状态,控制所述智能体按第四状态朝目标位置前行;其中,所述第四状态为沿近路前行;Step S630: During the process of the intelligent agent moving forward in the first state, if the trigger points marked by other intelligent agents are detected, the operating state of the intelligent agent is changed from the first state to the fourth state, and the intelligent agent is controlled. The agent moves toward the target position in the fourth state; wherein the fourth state is to move along the shortcut;
步骤S640,在所述智能体从第一状态转为按第四状态前行过程中,若检测到其他智能体标记的终止点,则将所述智能体的运行状态从第四状态转为第一状态,控制所述智能体按第一状态朝目标位置前行;若检测到所述障碍物距离在安全距离范围内,则将所述智能体的运行状态从第四状态转为第二状态,控制所述智能体按第二状态朝目标位置前行;其中,所述第二状态为沿着障碍物的边缘前行;所述终止点为智能体的出口;Step S640: During the process of the agent moving from the first state to the fourth state, if the end points marked by other agents are detected, the running state of the agent is changed from the fourth state to the fourth state. In one state, the intelligent agent is controlled to move toward the target position in the first state; if the obstacle is detected to be within a safe distance, the operating state of the intelligent agent is transferred from the fourth state to the second state. , controlling the agent to move forward toward the target position in the second state; wherein the second state is to move along the edge of the obstacle; the end point is the exit of the agent;
步骤S650,在所述智能体从第四状态转为按第二状态前行过程中,若检测到所述障碍物距离大于安全距离,则将所述智能体的运行状态从第二状态转为第四状态,控制所述智能体按第四状态朝目标位置前行;Step S650: During the process of the intelligent agent moving from the fourth state to the second state, if it is detected that the obstacle distance is greater than the safe distance, the operating state of the intelligent agent is converted from the second state to the second state. In the fourth state, the intelligent agent is controlled to move toward the target position in the fourth state;
步骤S660,将所述智能体的运行状态从第一状态转为第二状态,控制所述智能体按第二状态朝目标位置前行;Step S660: Change the running state of the intelligent agent from the first state to the second state, and control the intelligent agent to move toward the target position in the second state;
步骤S670,在所述智能体从第一状态转为按第二状态前行过程中,若检测到所述障碍物 距离大于安全距离,则将所述智能体的运行状态从第二状态转为第一状态,控制所述智能体按第一状态朝目标位置前行;若检测到所述障碍物距离小于场边界距离,则将检测到所述障碍物距离小于场边界距离所在位置标记为触发点后,将所述智能体的运行状态从第二状态转为第三状态,控制所述智能体按第三状态朝目标位置前行;其中,所述第三状态为从触发点沿原路返回。Step S670: During the process of the agent moving from the first state to the second state, if the obstacle is detected If the distance is greater than the safe distance, then the operating state of the intelligent agent is changed from the second state to the first state, and the intelligent agent is controlled to move toward the target position in the first state; if the distance to the obstacle is detected to be less than the field boundary distance, then mark the location where the obstacle distance is less than the field boundary distance as the trigger point, change the running state of the intelligent agent from the second state to the third state, and control the intelligent agent to press the third state Move forward toward the target position; wherein the third state is to return from the trigger point along the original path.
需要说明的是,障碍物距离通过实时检测得到,是一个变量;安全距离和场边界距离是预先设置的定量,安全距离大于场边界距离,通过结合障碍物距离、安全距离和场边界距离这三者的关系,用来决定智能体的运行状态,保证智能体的避障的前提下以最快效率抵达目标位置。It should be noted that the obstacle distance is obtained through real-time detection and is a variable; the safety distance and field boundary distance are preset quantifications. The safety distance is greater than the field boundary distance. By combining the obstacle distance, safety distance and field boundary distance, The relationship between the agents is used to determine the operating status of the agent and ensure that the agent reaches the target location as quickly as possible while avoiding obstacles.
本实施例中,智能体有四种运行状态,当遇到不同的情况时,会在各种运行状态之间来回切换。在所有运行状态下,智能体在选择下一个位置时不能选择其他智能体所在栅格。如果选择的栅格中还有其他智能体,智能体放弃最优选择,选择最优栅格附近可行的栅格。智能体计算并选择代价评估值最大的空置栅格进行导航。智能体与障碍物之间设置有安全距离。设置智能体到边界距离的场边界距离,智能体传感器具有一定的检测范围,智能体只能在检测范围内探测到障碍物或边界。第一状态中智能体通过评估周围栅格的代价评估值实现基本导航,第二状态用于指示智能体沿障碍物边缘行走。当智能体进入这种状态时,它将在面临决定时留下一个“触发点”,随机选择可能的方向,沿着障碍物的边缘移动。如果智能体找到出口,智能体将设置“终止点”,提醒其他智能体可以直接向“终止点”移动;如果智能体没有找到出口,但检测到场边界,则意味着智能体在“触发点”没有选择最优方向。此时,当其他智能体在“触发点”附近时,智能体会直接与其他智能体进行通讯,告知其他智能体最优的方向,即一开始该智能体所选错误方向的相反方向,为其他智能体的前行提供可行性参考,节省其他智能体的路径选择时间。当其他智能体检测范围内存在“触发点”,但离开“触发点”的智能体不确定最佳方向时,其他智能体以一定概率自行随机选择方向,例如,在存在两个方向的情况下,两个方向被选择的概率各为0.5。当智能体到达“触发点”的时候,发现留下“触发点”的智能体进入过第三状态,并当前已经离开第三状态,说明智能体经进入了此时通过其自身经验得到的最佳方向,其他智能体可以直接跟随该智能体移动。在本发明实施例中,图14至图17示出所述避障控制策略在不同场景下的应用情况。In this embodiment, the intelligent agent has four operating states. When encountering different situations, it will switch back and forth between various operating states. In all running states, the agent cannot select the grid where other agents are located when selecting the next position. If there are other agents in the selected grid, the agent abandons the optimal choice and selects a feasible grid near the optimal grid. The agent calculates and selects the vacant grid with the largest cost evaluation value for navigation. A safe distance is set between the agent and obstacles. Set the field boundary distance from the agent to the boundary. The agent sensor has a certain detection range, and the agent can only detect obstacles or boundaries within the detection range. In the first state, the agent realizes basic navigation by evaluating the cost evaluation value of the surrounding grid, and in the second state, it is used to instruct the agent to walk along the edge of the obstacle. When the agent enters this state, it leaves a "trigger point" when faced with a decision, randomly choosing possible directions to move along the edge of the obstacle. If the agent finds the exit, the agent will set the "end point" to remind other agents that they can move directly to the "end point"; if the agent does not find the exit but detects the field boundary, it means that the agent is at the "trigger point" No optimal direction was chosen. At this time, when other agents are near the "trigger point", the agent will directly communicate with other agents and inform other agents of the optimal direction, that is, the opposite direction of the wrong direction selected by the agent at the beginning, and provide other agents with The forward movement of the agent provides a feasible reference and saves the path selection time of other agents. When there is a "trigger point" within the detection range of other agents, but the agent leaving the "trigger point" is not sure of the best direction, the other agents randomly select the direction on their own with a certain probability, for example, when there are two directions , the probability of each of the two directions being selected is 0.5. When the agent reaches the "trigger point", it is found that the agent that left the "trigger point" has entered the third state and has currently left the third state, indicating that the agent has entered the final state obtained through its own experience at this time. In the optimal direction, other agents can directly follow the agent to move. In this embodiment of the present invention, Figures 14 to 17 illustrate the application of the obstacle avoidance control strategy in different scenarios.
另外,在一实施例中,所述方法还包括:In addition, in one embodiment, the method further includes:
在所述智能体按第四状态前行过程中,若检测到触发点,则确定标记该触发点的智能体在当前的运行状态和之前的运行状态;During the process of the agent moving forward in the fourth state, if a trigger point is detected, the current operating state and the previous operating state of the agent marking the trigger point are determined;
若标记该触发点的智能体在当前的运行状态和之前的运行状态中均未按第三状态前行,则执行步骤S500;If the agent marking the trigger point does not move forward according to the third state in the current running state and the previous running state, step S500 is executed;
也就是说,所述智能体检测到了触发点,但是标记该触发点的智能体在当前的运行状态和之前的运行状态中不存在第三状态,则执行步骤S500;即,标记该触发点的智能体既不知道场边界,也没探测到场边界,则需要所述智能体(按第四状态前行的智能体)自行根据所述浓度评估值、第一距离评估值和第二距离评估值确定每个空置栅格的代价评估值,将代价评估值最小的空置栅格作为待进入栅格。That is to say, the agent detects the trigger point, but the agent that marks the trigger point does not have a third state in the current operating state and the previous operating state, then step S500 is executed; that is, the agent that marks the trigger point If the agent neither knows the field boundary nor detects the field boundary, the agent (the agent moving forward according to the fourth state) needs to calculate the concentration evaluation value, the first distance evaluation value and the second distance evaluation value by itself. Determine the cost evaluation value of each vacant raster, and use the vacant raster with the smallest cost evaluation value as the raster to be entered.
若标记该触发点的智能体在当前的运行状态为第三状态,则所述智能体按与当前前行路径相反的方向前行;If the current running state of the agent marking the trigger point is the third state, the agent will move forward in the opposite direction to the current forward path;
也就是说,标记该触发点的智能体知道场边界,确定其前行的方向不通,则设置触发点的智能体选择与标记该触发点的智能体的相反方向前行,从而避开无法前行的路径。 That is to say, the agent that marks the trigger point knows the field boundary and determines that its forward direction is blocked. Then the agent that sets the trigger point chooses to move in the opposite direction to the agent that marks the trigger point, thereby avoiding the inability to move forward. line path.
若标记该触发点的智能体在之前的运行状态为第三状态,且当前未按第三状态前行,则所述智能体跟随标记该触发点的智能体前行;If the previous operating state of the agent that marked the trigger point was the third state and is not currently moving forward in the third state, then the agent will follow the agent that marked the trigger point;
也就是说,标记该触发点的智能体已找到正确的前行,则所述智能体直接遵循标记该触发点的智能体的路径前行。That is to say, if the agent that marked the trigger point has found the correct path forward, the agent will directly follow the path of the agent that marked the trigger point and move forward.
确定标记该触发点的智能体是否也标记过终止点,若标记该触发点的智能体也标记过终止点,则所述智能体跟随标记该触发点的智能体朝所述终止点前行。Determine whether the agent that marked the trigger point has also marked the end point. If the agent that marked the trigger point has also marked the end point, then the agent follows the agent that marked the trigger point and moves toward the end point.
也就是说,标记该触发点的智能体已找到正确的前行,也找到出口,则所述智能体直接遵循标记该触发点的智能体的路径朝所述终止点前行。That is to say, if the agent that marked the trigger point has found the correct way forward and also found the exit, then the agent will directly follow the path of the agent that marked the trigger point and move toward the end point.
另外,参照图18,本发明的一个实施例还提供了一种电子设备10,该电子设备10包括:存储器11、处理器12及存储在存储器11上并可在处理器12上运行的计算机程序。In addition, referring to Figure 18, one embodiment of the present invention also provides an electronic device 10. The electronic device 10 includes: a memory 11, a processor 12, and a computer program stored on the memory 11 and executable on the processor 12. .
处理器12和存储器11可以通过总线或者其他方式连接。The processor 12 and the memory 11 may be connected through a bus or other means.
实现上述实施例的基于基因调控网络的多智能体导航控制方法所需的非暂态软件程序以及指令存储在存储器11中,当被处理器12执行时,执行上述实施例中的基于基因调控网络的多智能体导航控制方法。The non-transient software programs and instructions required to implement the gene regulation network-based multi-agent navigation control method in the above embodiment are stored in the memory 11. When executed by the processor 12, the gene regulation network-based method in the above embodiment is executed. Multi-agent navigation control method.
此外,本发明的一个实施例还提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机可执行指令,该计算机可执行指令被一个处理器或控制器执行,例如,被上述电子设备实施例中的一个处理器执行,可使得上述处理器执行上述实施例中的基于基因调控网络的多智能体导航控制方法。In addition, an embodiment of the present invention also provides a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are executed by a processor or controller, for example, by the above-mentioned Execution by a processor in the electronic device embodiment can cause the processor to execute the multi-agent navigation control method based on the gene regulatory network in the above embodiment.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统可以被实施为软件、固件、硬件及其适当的组合。某些物理组件或所有物理组件可以被实施为由处理器,如中央处理器、数字信号处理器或微处理器执行的软件,或者被实施为硬件,或者被实施为集成电路,如专用集成电路。这样的软件可以分布在计算机可读介质上,计算机可读介质可以包括计算机存储介质(或非暂时性介质)和通信介质(或暂时性介质)。如本领域普通技术人员公知的,术语计算机存储介质包括在用于存储信息(诸如计算机可读指令、数据结构、程序模块或其他数据)的任何方法或技术中实施的易失性和非易失性、可移除和不可移除介质。计算机存储介质包括但不限于RAM、ROM、EEPROM、闪存或其他存储器技术、CD-ROM、数字多功能盘(DVD)或其他光盘存储、磁盒、磁带、磁盘存储或其他磁存储装置、或者可以用于存储期望的信息并且可以被计算机访问的任何其他的介质。此外,本领域普通技术人员公知的是,通信介质通常包含计算机可读指令、数据结构、程序模块或者诸如载波或其他传输机制之类的调制数据信号中的其他数据,并且可包括任何信息递送介质。Those of ordinary skill in the art can understand that all or some steps and systems in the methods disclosed above can be implemented as software, firmware, hardware, and appropriate combinations thereof. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, a digital signal processor, or a microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit . Such software may be distributed on computer-readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). As is known to those of ordinary skill in the art, the term computer storage media includes volatile and nonvolatile media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. removable, removable and non-removable media. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, Digital Versatile Disk (DVD) or other optical disk storage, magnetic cassettes, tapes, disk storage or other magnetic storage devices, or may Any other medium used to store the desired information and that can be accessed by a computer. Additionally, it is known to those of ordinary skill in the art that communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .
以上是对本发明的较佳实施进行了具体说明,但本发明并不局限于上述实施方式,熟悉本领域的技术人员在不违背本发明精神的前提下还可作出种种的等同变形或替换,这些等同的变形或替换均包含在本发明权利要求所限定的范围内。 The above is a detailed description of the preferred implementation of the present invention, but the present invention is not limited to the above-mentioned embodiments. Those skilled in the art can also make various equivalent modifications or substitutions without violating the spirit of the present invention. Equivalent modifications or substitutions are included within the scope of the claims of the present invention.
Claims (8)
The multi-agent navigation control method based on gene regulation network according to claim 2, characterized in that the calculation formula relied on to generate the target concentration map is:
The calculation formula relied on to generate the obstacle concentration map is:
The calculation formula for the concentration information corresponding to each grid in the grid map is:
The multi-agent navigation control method based on gene regulation network according to claim 1, characterized in that the calculation formula of the cost evaluation value is:
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210436264.6 | 2022-04-25 | ||
| CN202210436264.6A CN114543815B (en) | 2022-04-25 | 2022-04-25 | Multi-agent navigation control method, equipment and medium based on gene regulatory network |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023207803A1 true WO2023207803A1 (en) | 2023-11-02 |
Family
ID=81667170
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/089865 Ceased WO2023207803A1 (en) | 2022-04-25 | 2023-04-21 | Multi-agent navigation control method and device based on gene regulatory network, and medium |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN114543815B (en) |
| WO (1) | WO2023207803A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119717797A (en) * | 2024-11-29 | 2025-03-28 | 汕头大学 | Unmanned cluster target search control method, equipment and medium based on local information |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114543815B (en) * | 2022-04-25 | 2022-07-19 | 汕头大学 | Multi-agent navigation control method, equipment and medium based on gene regulatory network |
| CN115145289A (en) * | 2022-09-02 | 2022-10-04 | 汕头大学 | Multi-agent cooperative trapping method, system, equipment and storage medium |
| CN115390589B (en) * | 2022-10-27 | 2023-02-28 | 汕头大学 | Unmanned aerial vehicle cluster control method and device, electronic equipment and storage medium |
| CN116578092B (en) * | 2023-05-29 | 2023-11-14 | 汕头大学 | An intelligent agent cluster motion control method and system based on gene regulation network |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108415425A (en) * | 2018-02-08 | 2018-08-17 | 东华大学 | It is a kind of that swarm algorithm is cooperateed with based on the Distributed Cluster robot for improving gene regulatory network |
| US20190051198A1 (en) * | 2018-09-28 | 2019-02-14 | Intel Corporation | Method of generating a collision free path of travel and computing system |
| CN111006666A (en) * | 2019-11-21 | 2020-04-14 | 深圳市优必选科技股份有限公司 | Robot path planning method and device, storage medium and robot |
| CN112684700A (en) * | 2020-11-30 | 2021-04-20 | 汕头大学 | Multi-target searching and trapping control method and system for swarm robots |
| CN114543815A (en) * | 2022-04-25 | 2022-05-27 | 汕头大学 | Multi-agent navigation control method, equipment and medium based on gene regulatory network |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103439972B (en) * | 2013-08-06 | 2016-06-29 | 重庆邮电大学 | A kind of method for planning path for mobile robot under DYNAMIC COMPLEX environment |
| US9446512B1 (en) * | 2014-03-26 | 2016-09-20 | Stc.Unm | iAnt swarm robotic platform and evolutionary algorithms |
| CN109164810B (en) * | 2018-09-28 | 2021-08-10 | 昆明理工大学 | Robot self-adaptive dynamic path planning method based on ant colony-clustering algorithm |
| US20210103286A1 (en) * | 2019-10-04 | 2021-04-08 | Hong Kong Applied Science And Technology Research Institute Co., Ltd. | Systems and methods for adaptive path planning |
| CN111694364A (en) * | 2020-06-30 | 2020-09-22 | 山东交通学院 | Hybrid algorithm based on improved ant colony algorithm and dynamic window method and applied to intelligent vehicle path planning |
| CN112631294B (en) * | 2020-12-16 | 2022-06-28 | 上海应用技术大学 | A mobile robot intelligent path planning method |
| CN112612275A (en) * | 2020-12-23 | 2021-04-06 | 南京市晨枭软件技术有限公司 | Complex path planning system and method for database machine room |
| CN112947591A (en) * | 2021-03-19 | 2021-06-11 | 北京航空航天大学 | Path planning method, device, medium and unmanned aerial vehicle based on improved ant colony algorithm |
| CN113325839B (en) * | 2021-05-08 | 2023-02-03 | 江苏科技大学 | A Path Planning Method for Intelligent Storage Robot Based on Improved Ant Colony Algorithm |
-
2022
- 2022-04-25 CN CN202210436264.6A patent/CN114543815B/en active Active
-
2023
- 2023-04-21 WO PCT/CN2023/089865 patent/WO2023207803A1/en not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108415425A (en) * | 2018-02-08 | 2018-08-17 | 东华大学 | It is a kind of that swarm algorithm is cooperateed with based on the Distributed Cluster robot for improving gene regulatory network |
| US20190051198A1 (en) * | 2018-09-28 | 2019-02-14 | Intel Corporation | Method of generating a collision free path of travel and computing system |
| CN111006666A (en) * | 2019-11-21 | 2020-04-14 | 深圳市优必选科技股份有限公司 | Robot path planning method and device, storage medium and robot |
| CN112684700A (en) * | 2020-11-30 | 2021-04-20 | 汕头大学 | Multi-target searching and trapping control method and system for swarm robots |
| CN114543815A (en) * | 2022-04-25 | 2022-05-27 | 汕头大学 | Multi-agent navigation control method, equipment and medium based on gene regulatory network |
Non-Patent Citations (1)
| Title |
|---|
| WANG CHEN; FAN ZHUN; LUO WEICHENG; GU MINQIANG; WANG DONGLIANG; SHI ZHAOHUI: "Collaborative Navigation Based on Gene Regulatory Networks and Finite State Machine", 2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS (BIGDIA), IEEE, 24 August 2022 (2022-08-24), pages 12 - 19, XP034183787, DOI: 10.1109/BigDIA56350.2022.9874179 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119717797A (en) * | 2024-11-29 | 2025-03-28 | 汕头大学 | Unmanned cluster target search control method, equipment and medium based on local information |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114543815A (en) | 2022-05-27 |
| CN114543815B (en) | 2022-07-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2023207803A1 (en) | Multi-agent navigation control method and device based on gene regulatory network, and medium | |
| CN110858076B (en) | Equipment positioning and grid map construction method and mobile robot | |
| EP3828587B1 (en) | Method for determining the position of a vehicle | |
| CN111694356B (en) | Driving control method and device, electronic equipment and storage medium | |
| CN111830979B (en) | Track optimization method and device | |
| CN110749901B (en) | Autonomous mobile robot and its map mosaic method, device and readable storage medium | |
| CN113805569B (en) | Countermeasure system, method, terminal and storage medium based on multi-agent technology | |
| US11131997B2 (en) | Route generation apparatus | |
| CN119376384A (en) | Robot motion control method, device, storage medium and robot | |
| CN108628318B (en) | Congested environment detection method, device, robot and storage medium | |
| CN111665868B (en) | Unmanned ship return method, device, equipment and storage medium based on virtual channel | |
| CN118226860B (en) | Robot motion control method, device, robot and storage medium | |
| CN114012732B (en) | Robot control method, device, equipment and medium | |
| JP2019191498A (en) | Map creation device | |
| CN111045433B (en) | A robot obstacle avoidance method, robot and computer-readable storage medium | |
| CN116009543A (en) | A method of escape for a robot | |
| CN114594772A (en) | Robot, path planning method, device and storage medium | |
| CN119270837B (en) | Mobile robot autonomous environment exploration method and system under unknown environment | |
| CN112904855B (en) | Follow-up robot local path planning method based on improved dynamic window | |
| CN118608559B (en) | Distributed-inspired unmanned swarm brain-like intelligence fusion search and capture method | |
| CN115562296B (en) | A robot scheduling method, system and device based on a hybrid control strategy | |
| CN116068574A (en) | Method, device, equipment and storage medium for creating and deleting multi-target tracking tracks | |
| CN115047871A (en) | Multi-unmanned vehicle collaborative search method, device, equipment and medium for dynamic target | |
| CN116048086B (en) | A method for autonomous switching control of static paths in ship navigation | |
| JP7762043B2 (en) | Position estimation device, robot system including same, and position estimation method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23795236 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 23795236 Country of ref document: EP Kind code of ref document: A1 |