[go: up one dir, main page]

WO2023003979A3 - Optimal data-driven decision-making in multi-agent systems - Google Patents

Optimal data-driven decision-making in multi-agent systems Download PDF

Info

Publication number
WO2023003979A3
WO2023003979A3 PCT/US2022/037763 US2022037763W WO2023003979A3 WO 2023003979 A3 WO2023003979 A3 WO 2023003979A3 US 2022037763 W US2022037763 W US 2022037763W WO 2023003979 A3 WO2023003979 A3 WO 2023003979A3
Authority
WO
WIPO (PCT)
Prior art keywords
driven
making
data
agent
systems
Prior art date
Application number
PCT/US2022/037763
Other languages
French (fr)
Other versions
WO2023003979A2 (en
Inventor
Benjamin CHASNOV
Lillian RATLIFF
Samuel A. BURDEN
Amy ORSBORN
Tanner FIEZ
Manchali Maneeshika MADDURI
Joseph G SULLIVAN
Momona YAMAGAMI
Original Assignee
University Of Washington
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University Of Washington filed Critical University Of Washington
Priority to US18/290,707 priority Critical patent/US20250094855A1/en
Publication of WO2023003979A2 publication Critical patent/WO2023003979A2/en
Publication of WO2023003979A3 publication Critical patent/WO2023003979A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/043Distributed expert systems; Blackboards
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • G06F3/015Input arrangements based on nervous system activity detection, e.g. brain waves [EEG] detection, electromyograms [EMG] detection, electrodermal response detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Computational Linguistics (AREA)
  • Neurology (AREA)
  • Neurosurgery (AREA)
  • Human Computer Interaction (AREA)
  • Dermatology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Feedback Control In General (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Systems and methods for optimizing data-driven decision-making in multi-agent systems are described. The system may construct an equilibrium concept to capture multi-layer and/or k-level depth reasoning by agents. The system may determine best-response type conjectures for agents to interact with one another. In some examples, the system may include a machine or an algorithm interacting with a strategic agent (e.g., a human or an entity). The system may include methods for: (1) data-driven estimation and/or learning of conjectures and associated depth; and (2) data-driven design of algorithmic mechanisms for exploring the conjectural equilibrium (CE) space by influencing the strategic agent(s) behaviors through adaptively adjusting and estimating deployed strategies.
PCT/US2022/037763 2021-07-21 2022-07-20 Optimal data-driven decision-making in multi-agent systems Ceased WO2023003979A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/290,707 US20250094855A1 (en) 2021-07-21 2022-07-20 Optimal data-driven decision-making in multi-agent systems

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163224325P 2021-07-21 2021-07-21
US63/224,325 2021-07-21

Publications (2)

Publication Number Publication Date
WO2023003979A2 WO2023003979A2 (en) 2023-01-26
WO2023003979A3 true WO2023003979A3 (en) 2023-02-23

Family

ID=84978747

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2022/037763 Ceased WO2023003979A2 (en) 2021-07-21 2022-07-20 Optimal data-driven decision-making in multi-agent systems

Country Status (2)

Country Link
US (1) US20250094855A1 (en)
WO (1) WO2023003979A2 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240070523A1 (en) * 2022-08-23 2024-02-29 Microsoft Technology Licensing, Llc Machine learning training content delivery
CN116843198B (en) * 2023-07-04 2025-08-12 西北工业大学 A dynamic human-machine function allocation method for manned submersibles based on non-cooperative game
WO2025043050A1 (en) * 2023-08-24 2025-02-27 Carnegie Mellon University Training and use of a posture invariant brain-computer interface
CN117518833B (en) * 2023-12-20 2024-05-31 哈尔滨工业大学 Improved high-order multi-autonomous cluster distributed non-cooperative game method and system
CN117521716B (en) * 2024-01-02 2024-03-19 山东大学 Collaborative decision-making methods and media for massive unknown options and limited memory space
WO2025166403A1 (en) * 2024-02-05 2025-08-14 Strategex Pty Ltd Multi-agent computer control system with iterative optimisation for autonomous agent coordination
CN118538362B (en) * 2024-04-08 2024-12-27 南京信息工程大学 Somatosensory-based interactive virtual rehabilitation training method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170334066A1 (en) * 2016-05-20 2017-11-23 Google Inc. Machine learning methods and apparatus related to predicting motion(s) of object(s) in a robot's environment based on image(s) capturing the object(s) and based on parameter(s) for future robot movement in the environment
US20180183827A1 (en) * 2016-12-28 2018-06-28 Palantir Technologies Inc. Resource-centric network cyber attack warning system
US20190102692A1 (en) * 2017-09-29 2019-04-04 Here Global B.V. Method, apparatus, and system for quantifying a diversity in a machine learning training data set
US20190310636A1 (en) * 2018-04-09 2019-10-10 SafeAl, Inc. Dynamically controlling sensor behavior
US20200298100A1 (en) * 2019-03-21 2020-09-24 Valve Corporation Brain-computer interfaces for computing systems

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170334066A1 (en) * 2016-05-20 2017-11-23 Google Inc. Machine learning methods and apparatus related to predicting motion(s) of object(s) in a robot's environment based on image(s) capturing the object(s) and based on parameter(s) for future robot movement in the environment
US20180183827A1 (en) * 2016-12-28 2018-06-28 Palantir Technologies Inc. Resource-centric network cyber attack warning system
US20190102692A1 (en) * 2017-09-29 2019-04-04 Here Global B.V. Method, apparatus, and system for quantifying a diversity in a machine learning training data set
US20190310636A1 (en) * 2018-04-09 2019-10-10 SafeAl, Inc. Dynamically controlling sensor behavior
US20200298100A1 (en) * 2019-03-21 2020-09-24 Valve Corporation Brain-computer interfaces for computing systems

Also Published As

Publication number Publication date
US20250094855A1 (en) 2025-03-20
WO2023003979A2 (en) 2023-01-26

Similar Documents

Publication Publication Date Title
WO2023003979A3 (en) Optimal data-driven decision-making in multi-agent systems
Schmickl et al. CoCoRo--The self-aware underwater swarm
Hirose et al. Deep visual mpc-policy learning for navigation
Kaplan et al. Maximizing learning progress: an internal reward system for development
Lin Self-improving reactive agents based on reinforcement learning, planning and teaching
JP7695836B2 (en) Method and device for learning a strategy and method and device for operating a strategy - Patents.com
Ender et al. Cognitive radar-enabling techniques for next generation radar systems
US20190384317A1 (en) Method for driving robot based on external image, and robot and server implementing the same
KR102794622B1 (en) Series elastic actuator
Braik et al. Applications of dynamic feature selection based on augmented white shark optimizer for medical diagnosis
Chen et al. LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning
Deka et al. Hiding Leader’s identity in leader-follower navigation through multi-agent reinforcement learning
Shi et al. Laser range data based semantic labeling of places
Paul Dean et al. The cerebellum as an adaptive filter: a general model?
Schultz et al. Continuous and embedded learning in autonomous vehicles: Adapting to sensor failures
Fink et al. Meeting in the dark room: Bayesian rational analysis and hierarchical predictive coding
Moulin-Frier et al. Autonomous development of turn-taking behaviors in agent populations: A computational study
Hausknecht et al. Grounded semantic networks for learning shared communication protocols
Blasch et al. Physio-associative temporal sensor integration
Lee et al. Clever: Stream-based active learning for robust semantic perception from human instructions
Shan Theory of Learning in Neural Networks with Small Weight Perturbations
Salomon Evolving receptive-field controllers for mobile robots
Singh et al. Enhancing multi-agent reinforcement learning: set function approximation and dynamic policy adaptation
Han et al. Informative Path Planning for Multi-UUV Cooperative Search with Distributed Q-Learning
Weingaertner et al. Evaluation of decision aiding in submarine emergency decision making

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22846578

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22846578

Country of ref document: EP

Kind code of ref document: A2

WWP Wipo information: published in national office

Ref document number: 18290707

Country of ref document: US