1 Multiagent Teamwork: Analyzing the Optimality and Complexity of Key Theories and Models David V....

Multiagent Teamwork:Analyzing the Optimality and Complexity of Key Theories and Models

David V. Pynadath and Milind Tambe

Information Sciences Institute and

Department of Computer Science

University of Southern California

Agent Teamwork

Agents, robots, sensors, spacecraft, etc. Performing a common task Operating in an uncertain environment Distinct, uncertain observations Distinct actions with uncertain effects Limited, costly communication

Battlefield Simulation Satellite Clusters Disaster Rescue

MotivationP

Complexity

OptimalNew algorithm

TheoreticalApproaches

No communication

Practical Systems

?Optimal

Outline of Results1) Unified teamwork framework

2) Complexity of optimal teamwork

3) New coordination algorithm

4) Optimality-Complexity evaluation of existing methods

Enemy Radar

Example Domain:Helicopter Team

Did theysee that? I destroyed the

enemy radar.

Communicative Multiagent Team Decision Problem (COM-MTDP)

S: states of the world e.g., position of helicopters, position of the enemy

A: domain-level actions e.g., fly below radar, fly normal altitude

P: transition probability function e.g., world dynamics, effects of actions

: communication capabilities, possible “speech acts” e.g., “I have destroyed enemy radar.”

COM-MTDPs (cont’d)

: observations e.g., enemy radar, position of other helicopter

O: probability (for each agent) of observation Maps state and actions into distribution over

observations (e.g., sensor noise model)R: reward (over states, actions, messages) e.g., good if we reach destination, better if we reach it earlier

e.g., saying, “I have destroyed enemy,” has a cost

Teamwork Definition: All members share same preferences (i.e., R)

Problem Complexity

COM-MTDPs

Free communication

Collectively Observable

Individually Observable

No communication

To Communicate orNot To Communicate

Local decision of one agent at a single point in time: “I have achieved a joint goal.” “Should I tell my teammate?”

Joint intentions theory: “I must attain mutual belief.” Always communicate [Jennings]

STEAM: “I must communicate if the expected cost of

miscoordination outweighs the cost of communication.” [Tambe]

Each cost is a fixed parameter specified by designer

Communicate if and only if: E[R | communicate] E[R | do not communicate]

Locally Optimal Criterionfor Communication

Expectation over possible histories of states and beliefs up to current time

Expected reward over future trajectories of states and beliefs WITH communication

Expected reward over future trajectories of states and beliefs WITHOUT communication

Expected cost of communicating

Empirical Results

Communication Cost

Observability

V_opt-V

Empirical Results

Silent

Optimality vs. Complexity

Complexity

Globally OptimalLocally Optimal

Jennings

seconds(log)0.1 1.0 10,000

1.40 Observability = 0.2Comm. Cost = 0.7

Jennings

Optimality vs. Complexity

Complexity

Globally OptimalLocally Optimal

Silent

seconds(log)0.1 1.0 10,000

1.43 Observability = 0.2Comm. Cost = 0.3

Summary

COM-MTDPs provide a unified framework for agent teamwork Representation subsumes many existing agent models Policy space subsumes many existing prescriptive theories

This framework supports deeper analyses of teamwork problems Quantitative characterization of optimality-efficiency

tradeoff , for different policies, in different domains Derivation of novel coordination algorithms

http://www.isi.edu/teamcore/Teamwork Detailed proofs Source code JAIR article

1 Multiagent Teamwork: Analyzing the Optimality and Complexity of Key Theories and Models David V....

Documents

AI for Earth: AI for Protecting Wildlife, Forests, Fish · AI for Earth: AI for Protecting Wildlife, Forests, Fish MILIND TAMBE Founding Co-director, Center for Artificial Intelligence

Security via Strategic Randomization Milind Tambe Fernando Ordonez Praveen Paruchuri Sarit Kraus (Bar Ilan, Israel) Jonathan Pearce, Jansuz Marecki James

39116610 Balaji Tambe

Tambe Sirs Qn Ans

Decision-Focused Learning of Adversary Behavior in ... · Games Andrew Perrault, Bryan Wilder, Eric Ewing, Aditya Mate, Bistra Dilkina, and Milind Tambe University of Southern California

The Human Element: Addressing Human …...Fernando Ordo´nez, Praveen Paruchuri, Christopher Portway, Shyamsunder Rathi, Michael Scott,˜ Eric Shieh, Erin Steigerwald, Milind Tambe,

projects.iq.harvard.edu...Acknowledgments First and foremost, I would like to thank my advisor, Milind Tambe. When I ﬁrst came abroad to USC to pursue my doctorate degree, little

Balaji Tambe Product Catalog

Kishore Pynadath CV with Portfolio Jul 2018 v5kishorepynadath.com/wp-content/uploads/2018/07/... · Kishore Pynadath kishore.pynadath@gmail.com 408-507-1599 Sunnyvale, CA UX Designer

AI for Social Good: The Essential Role of Human Machine Partnership · 2018. 1. 4. · The Essential Role of Human Machine Partnership MILIND TAMBE Founding Co-director, Center for

Milind Deora

Software Multiagent Systems: Lecture 13 Milind Tambe University of Southern California tambe@usc.edu

Game Theory for Security: Lessons Learned from Deployed Applications Milind Tambe Chris Kiekintveld Richard John & teamcore.usc.edu

Applying AI in preventative health interventions ...Applying AI in preventative health interventions: Algorithms, deployments and fairness MILIND TAMBE Director Center for Research

AI for Social Good: Key Techniques, Applications, …...1 AI for Social Good: Key Techniques, Applications, and Results MILIND TAMBE Founding Co-director, Center for Artificial Intelligence

Tambe group p pt

balaji tambe

Presented: 11/05/09 Agent-based Evacuation Modeling: Simulating the Los Angeles International Airport Milind Tambe, Jason Tsai,

Towards a Science of Security Games · Towards a Science of Security Games: ... Prof. Milind Tambe Helen N. and Emmett H. Jones Professor in Engineering University of Southern California

DESIGNING EFFICIENT CONTACT TRACING THROUGH ......Designing Efﬁcient Contact Tracing Through Risk-Based Quarantining Andrew Perrault, Marie Charpignon, Jonathan Gruber, Milind Tambe,