34
AI Meets Markets: Trading Agents & Strategic Reasoning Michael P. Wellman University of Michigan

Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Embed Size (px)

Citation preview

Page 1: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

AI Meets Markets: Trading Agents & Strategic ReasoningMichael P. WellmanUniversity of Michigan

Page 2: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

A software program that makes decisions autonomously about bidding in electronic markets

Trading Agent

Page 3: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Research In Trading Agents

Trading domainsFinancial marketsB2B marketplacesSpecialized:

SpectrumElectricity AdvertisingPrediction markets…

Simulations and abstract mechanisms

Research issuesAgent architectureTechniques

ForecastingLearningOptimization…

Strategic analysisDesign methodologyEffect of automated trading on markets

Page 4: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

A Trading Agent Competition

An open invitation market gameWhy hold a competition?

Do research on trading agents, but get others to write the agentsFocus effort on common problemMeasure progress as experience accumulates

Page 5: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Trading Agent Competition (TAC)

Open-invitation int’l tournaments, featuring market games18-43 entrants/year, worldwideTravel-shopping scenario (2000–06)

Designed at U MichiganFocus: interdependent markets, multiple market types

Supply Chain Mgmt (SCM) game (2003–)Designed at CMU, SICSFocus: Multi-tier mult-iattribute negotiation, intertemporal

CAT Market Design Game (2007–)Designed at U Liverpool, U Southampton, Brooklyn College

Page 6: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

TAC Travel: The Book

Autonomous Bidding AgentsStrategies and Lessons from the Trading Agent Competition

Michael P. Wellman, Amy Greenwald, and Peter Stone

(MIT Press, July 2007)

Page 7: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

TAC/SCM Configuration

Pintel

IMD

Basus

Macrostar

Mec

Queenmax

Watergate

Mintor

CPU

Moth

erb

oa

rdM

em

ory

Hard

Dis

ksuppliers

Manufacturer 1

Manufacturer 2

Manufacturer 3

Manufacturer 4

Manufacturer 5

Manufacturer 6

customer

component RFQs

supplier offers

component orders

PC RFQs

PC bids

PC orders

10 component types16 PC types220 simulation days15 seconds per day

Page 8: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

The Coordination Problem

Decisions interact across markets and timeIf PC demand is rising, build extra component inventory nowIf capacity is constrained, raise selling price and pay premiums for critical components

Formulating the entire game as a single optimization is daunting—both computationally and conceptually

Page 9: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Value-Based Decomposition

Decomposition simplifies the problemModularity yields substantial advantages

ComputationSoftware development

Natural subproblems with local constraintsNeed a way to coordinate separate decisionsSolution: Explicit values for resources

PCs and components

Page 10: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Deep Maize Design

Customer Bidding

Supply Purchasing

SpecificDecisions

PC Values

Component Values

Projected Manufacturing

Schedule

CoarseDecisions

State Estimation

Customer Market

Predictions

Supplier Market Predictions Market

Prediction

Page 11: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

. . .

. . .

. . .

. . .

. . .

Day d

Day d+1

Day d+2

Day d+3

ProcurementFactory Planning Sales

SupplyMarket

SalesMarket

Market Prediction

Page 12: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Coarse Decisions

. . .

. . .

. . .

. . .

. . .

Day d

Day d+1

Day d+2

Day d+3

ProcurementFactory Planning Sales

SupplyMarket

SalesMarket

Page 13: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Specific Decisions

. . .

. . .

. . .

. . .

. . .

Day d

Day d+1

Day d+2

Day d+3

ProcurementFactory Planning Sales

SupplyMarket

SalesMarket

Page 14: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Market Prediction

Market predictions: prices, availability

Depend on strategic behavior of agents

State estimates: demand, supplier capacities, inventoriesEmploy a variety of machine learning and modeling techniques

State Estimation

Customer Market Predictions

Supplier Market Predictions

Page 15: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Prediction Challenge

average over all games single game

avg over all games

Sidecompetition to compare performance on prediction subtask

single game

Predicting current computer prices

(Pardoe& Stone, 2008)

Page 16: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Coarse Decisions

Comprehensive production planAll levels of the supply chainFuture decisions (out to a horizon)

Approximate optimization, subject to cross-cutting constraints: Factory capacityInventory availability

Coarse decisions about sales and procurement:Sell a PC of type T for delivery on day DBuy a component of type T before day D

PC Values

Component Values

Projected Manufacturing

Schedule

Page 17: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Specific Decisions

Deal with intricacies of supplier/customer interactions

Specific customer bids to generate desired salesWhich supplier to buy fromLead times for component ordersLimits of number of RFQs

Very large action spacesAnytime search algorithms

Customer Bidding

Supply Purchasing

Page 18: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Values

Derive marginal values for components and PCsGiven type and dayRelative to projected manufacturing schedule

Purchasing and sales account for these valuesRepresent information about global conditions

If PC selling prices are low, component values fallIf factory capacity is constrained, PC and component values rise

PC Values

Component Values

Projected Manufacturing

Schedule

Page 19: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Discussion: Value-Based Decomposition

Accounts for (many) global interactions, yet retains modularityValues serve as an interface, allowing different techniques in each sub-problem and prediction taskDeep Maize design makes many approximations for simplicity and speed

No iteration of solutions Most logical sub-problems are never solvedApproximate optimization at all stages

See Kiekintveld et al, ACM Conference on Electronic Commerce, 2006.

Page 20: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

TAC/SCM 2008

Held at AAAI-08, Chicago, 15–16 JulyFinal round results:

1. Deep Maize $5.3M U Michigan2. TacTex 2.3M U Texas3. CMieux –0.8M Carnegie Mellon U4. CrocodileAgent –5.4M U Zagreb5. Botticelli –5.4M Brown U6. Merlion –11.5M Singapore Mgmt U

Page 21: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Strategic Analysis

Tournament result tells only part of the storyAgent performance depends on other strategiesRanking sensitive to tournament structure

Game-theoretic analysisEvaluate stability of strategy profilesSearch profile space for plausible agent configurationsProblems: Game procedurally defined, too large for standard analysis

Page 22: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Empirical Game-Theoretic Analysis

Experimental methodology for MAS, addressingExploration of large strategy spaces Use of simulation in lieu of analytic models

Employing game-theoretic criteria for assessing strategies and guiding search

Page 23: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

TAC/SCM Analysis

Strategy spaceSelect TAC agents from online repository: Deep Maize, GoBlueOval, Mertacor, MinneTAC, PhantAgent, TacTex, many others

Profile spaceIn general, SNprofiles, with symmetry “only” choose(N+S–1,N)With N = S = 6, there are 462 distinct symmetric profiles

Reduced gamesConsider a reduced 3-player version of the 6-player TAC/SCM gameEach player controls 2 agents

Less fidelity, more tractableWith S = 6, 56 profiles

Page 24: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Deviation Graph

Page 25: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Deviation Edge Explained

Best deviation from all-DM profile is to Mr

value of deviation represented in concentric levels

Page 26: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

2006 Deviation Graph

Page 27: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Ranking Analysis

Select a background “other-agent” contextCriterion: stabilityEvolutionary or game-theoretic

Evaluate payoff of each agent in this context

Page 28: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Comparing 05 and 06

05 Agent Mixture

Deep Maize 8.3%

Mertacor 43.1%

PhantAgent 31.4%

TacTex 17.2%

06 Agent2005

Server2006

Server

PhantAgent 5.55 6.57

TacTex 5.07 4.73

Deep Maize SF 4.22 4.56

Background Context Deviation Gain ($M)

Conclusion: substantial gains in 06

Page 29: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

NE-Response Ranking

Agent Deviation Gain Finals 05 Finals 06TacTex 06 0 n/a 5.85

PhantAgent 06 0 n/a 4.15Deep Maize 06 SF 0 n/a n/a

Mertacor 05 -0.57 0.55 n/aDeep Maize 06 F -0.95 n/a 3.58

Maxon 06 S -1.03 n/a n/aMinneTAC 05 -1.23 -0.31 n/a

PhantAgent 05 -1.51 n/a n/aDeep Maize 05 -3.18 -0.22 n/aMinneTAC 06 -3.48 n/a -2.70

TacTex 05 -5.96 4.74 n/a

Page 30: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Searching For Deep Maize 2008

Find equilibrium among all identified strategiesSearch for new strategy with beneficial deviationAdd new strategy to mix, derive new equilibriumRepeat until tournament deadlineSelect strategy from final equilibrium

Page 31: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Other Market Domains

TAC Travel (multiple interdependent auctions)Abstract market models

Simultaneous ascending auctionsContinuous double auctions

Keyword Ad Auctions for Sponsored Search

Page 32: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Keyword Ad Auctions

Markets in targeted advertisingComplex strategic issues for advertiser bidders

Page 33: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

TAC Ad Auction Game

coming Summer 2009

announcing

Page 34: Michael P. Wellman University of Michigan. A software program that makes decisions autonomously about bidding in electronic markets A software program

Microsoft Research Faculty Summit 2008