11
CS-E4800 Artificial Intelligence Jussi Rintanen Department of Computer Science Aalto University March 16, 2017 Games game method status Sudoku Constraints straightforward to solve optimally Minesweeper Constraints straightforward to solve well Tic-Tac-Toe Minimax straightforward to solve optimally Rush Hour A * , IDA * etc. straightforward to solve optimally Chess Minimax Deep Blue beats best humans in 1997 Checkers Minimax solved in 2007 (J. Schaeffer, Chinook) Bridge Monte Carlo GIB, WBridge5 close to humans 1998- Go Monte Carlo AlphaGo beats best humans in 2016 Poker Reinforcement L Liberatus beats professionals in 2017 Industrial Applications Examples of existing and emerging applications in: 1 Commerce 2 Autonomous Vehicles 3 Distributed Systems (Power, Telecom) 4 Software Industry Combinatorial Auctions in Procurement Sandholm, Expressive Commerce and Its Application to Sourcing, AI Mag 2007 Company buys goods from many suppliers: multi-round negotiations agreements on small details of the deals dependencies: contracts with supplier 1 and 2 must be compatible ordering of the negotiations problematic coordination of the whole process difficult optimality of the results questionable

Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

CS-E4800 Artificial Intelligence

Jussi Rintanen

Department of Computer ScienceAalto University

March 16, 2017

Games

game method status

Sudoku Constraints straightforward to solve optimallyMinesweeper Constraints straightforward to solve wellTic-Tac-Toe Minimax straightforward to solve optimallyRush Hour A∗, IDA∗ etc. straightforward to solve optimallyChess Minimax Deep Blue beats best humans in 1997Checkers Minimax solved in 2007 (J. Schaeffer, Chinook)Bridge Monte Carlo GIB, WBridge5 close to humans 1998-Go Monte Carlo AlphaGo beats best humans in 2016Poker Reinforcement L Liberatus beats professionals in 2017

Industrial Applications

Examples of existing and emerging applications in:

1 Commerce2 Autonomous Vehicles3 Distributed Systems (Power, Telecom)4 Software Industry

Combinatorial Auctions in ProcurementSandholm, Expressive Commerce and Its Application to Sourcing, AI Mag 2007

Company buys goods from many suppliers:

multi-round negotiations

agreements on small details of the deals

dependencies: contracts with supplier 1 and 2must be compatible

ordering of the negotiations problematic

coordination of the whole process difficult

optimality of the results questionable

Page 2: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Combinatorial Auctions in Procurement

Early electronic sourcing systems:

Goods grouped into lots; lots auctioned separatelyBidding difficult because of dependencies:

seller: “I could sell lot A, or lot B, not both.”seller: “My price for A&B is lower than A + B.”

If dependency cannot be expressed, how to bid?Item 1: transport X from Helsinki to Oulu

Item 2: transport Y from Oulu to Helsinki

=⇒ Accurate bidding impossible!

Combinatorial Auctions in Procurement

Solution: Combinatorial auction

Bids for combinations of itemsBoth synergy and conflict expressible

A costs AC100, B costs AC100, A+B costs AC150A costs AC100, B costs AC100, A+B costs AC250

Critical for real-world application: Rich set oflanguage features:

Large number of related bids can be expressedcompactlyExpression of discounts of various formsSide constraints

Combinatorial Auctions in Procurement

(Images: Sandholm 2007)

Combinatorial Auctions in Procurement

Benefits to sellers:

Risk of winning conflicting items eliminated

Negotiation process far simpler

Benefits to buyer:

Sellers bid lower, due to eliminated risks ⇒cost-savings

Negotiation process weeks instead of months

Impact of changing constraints can be evaluatedeasily

Page 3: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Combinatorial Auctions in Procurement

Real-world impact:Very large sourcing events, up to

160000 items (of varying numbers)2.6 million bids300000 side constraints

In 2000 to 2006, sourcing for total of USD 35billion

Largest sourcing events USD 1.6 billion

(There are now several competing products.)

Key Technical Features

expressive modeling language enabling detailedexpression of bids

scalable search algorithms for solving very largecombinatorial auctions

Where is the A.I. in this?

Replace human negotiations with automated ones

Negotiations very complex, clearly requiresintelligence from humans

Clearing the auction optimally is NP-complete:requires specialized Mixed Integer LinearProgramming methods tailored for combinatorialauctions

Control Rooms

Page 4: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Control Rooms

Many industrial facilities employ few humans only, in acontrol room

Connection often through a SCADA system(Supervisory Control and Data Acquisition)

Controls operated through SCADA

Alarms and other messages come through SCADA

Earlier: control panels with buttons, levers, etc. &lamps and other electrical and mechanical displays

Now: computer displays, keyboards (sometimesjoysticks, levers)

Why Control Room?

Automation already covers much of systemfunctionality (Control Theory, basic computerautomation)

What remains is complex cognitive tasks, requiringdeeper and broader expertise (and intelligence).

Situational awareness (What is going on?)

Detect (non-routine) fault situations

Diagnose fault situations (What is wrong?)

Recover from fault situations (How to fix it?):Take control actions (through computer interface)Deploy human crew(s) to fix the problem

Control Rooms

Manufacturing:

many types of industrial plants, manufacturing

process industries (metal, oil, chemical)

Energy, communications and transportation:

power stations (nuclear, coal)

electricity networks (distribution, transmission)

cellular communication networks

rail networks (local, underground, long-distance)

metropolitan road-traffic (traffic light control)

Military, Space, ...

Control Rooms

One could also view the following as control rooms:

command deck of a ship

cockpit of an airplane

driver in a motor vehicle

Page 5: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

General Approach: Models

Model each component typediscrete + continuous behaviorAlso faulty behavior must be modeled!

Model interaction between components ofdifferent types

Model how components connected (graph)More complex versions of languages like NDL

notion of component (or module)time (real, rational or integer)concurrency of events and actionsdiscrete and continuous change (hybrid systems)

General Approach: Reason about Models

Achieve model-based intelligence by

state-space search (BFS, Greedy Best First, A∗, ..)

constraint programming (IP, MILP, SAT, SMT, ...)

in order to

Analyze system: What behaviors are possible?

State estimation: Explain observations

Control: Which control actions to take?

The Smart Grid (Electricity)

Generation Transmission & Distribution Consumersmore renewables more proactive distributed generationless predictable better network utilization controllable demand

Future Distribution Networks

Future Distribution NetworksAutonomous, “self-aware”More advanced control (active, continuous)Why?• Multiple sources of power• Generation less predictable• More variation in consumption (electric cars!)

Advanced MeteringDemand Response

communications

co-generationplug-in electric vehicles

Page 6: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Future Distribution Networks

Problem:

More control is needed, in far smaller scale

Use of human operators not feasible (costs!)Control tasks are complicated:

Network topologies and devices very heterogenousControl heavily depends on larger context

Power Outages

Existing automation circuit-breakers:

Protective device opens switch upon detectingfault current

Recloser closes switch e.g. after 1 and 10 seconds

These handle faults without human intervention

This failing, control room operators take over:

Operate remote-controlled switches (to reduceoutage area)

Dispatch human crews to fix

Distribution Network Reconfiguration

Open and close (remotely-operated) switches tochange active network topology

Outage recovery: re-supply part of outage area

Loss reduction and load balancingObjective:

Radial/tree-like configuration maintained (no cycles!)Objective: maximize area supplied with powerObjective: minimize losses, balance loadsObjective: minimize switching-actions (wear and tear)

Distribution Network Reconfiguration

FAULT

Default configurationFault: Switch opened by protective deviceOutage!Re-supply part of network

Real-world problem complicated:

Complex urban network topologies

No-shed loads (hospitals!)

Capacity limits → transfer loadsbetween substations after initialreconfiguration

Page 7: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Distribution Network Reconfiguration

state-space search (A∗ etc.)constraint programming, including

Integer ProgrammingMixed-Integer Linear ProgrammingSAT (propositional logic)

genetic algorithms

ant colony optimization

neural networks

etc etc

Situational Awareness

Maintain an understanding of the state of thesystem, at an abstract level

Control room views and interprets event logs: allalarms and other messages from the devices withcommunications to the control room

Manufacturing plants, power stations, electricitynetworks, traffic networks, railway networks, airlineoperations, ...

Situational Awareness

Accidents caused/worsened by poor situationalawareness

airplane crashes (e.g. Air France AF447 in 2009)

nuclear power stations (Three Mile Island 1979)

A.I. to create, or assist in creating better situationalawareness

Situational Awareness

00:00:00 CB 1B A-B OPEN

00:00:00 CB 2B A-B OPEN

00:00:00 CB 2A A-B OPEN

00:00:00 CB 1A A-B OPEN

00:00:01 Line A-B KV LIMIT LOW

00:00:04 Line C-D KV LIMIT NORMAL

00:00:15 CB 1A A-B CLOSED

00:00:17 Line A-B KV LIMIT NORMAL

00:00:17 CB 1B A-B CLOSED

00:00:17 CB 2B A-B CLOSED

00:00:20 CB 2A A-B CLOSED

00:00:20 Line C-D KV LIMIT HIGH

Whole event log potentially useful

Many log entries can arise inshort time: difficult to understand

Need to focus on important parts

Causal connections between logentries

Model-based approach: interpretevent logs against a systemmodel, generate conciseexplanation of what is going on

Page 8: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Adaptive Management of Mobile Networks

Location and activity of mobile terminals (phones)change hour to hour, month to month

Need to adjust antenna: power, tilt (electric)

Future 5G networks have too many base stations→ need to automate

Adaptive Management of Mobile Networks

Solution:

Reinforcement learningLearn to adapt to traffic situation

Collect data on Quality of ServiceExplore antenna parameter valuesChoose best ones based ontime-of-day and variables

Need for Autonomy: Space

Cost of carrying crew is highfood, water, air, living quarters (weight, space)

Crew limits mission duration, reach

Ground-control limiting for distant spacecraft(light to Moon 1 sec, Mars 4 to 20 min)

Need for Autonomy: MilitaryDisadvantages of pilot (cost, functionality):

Pilot is not payloadLimits aircraft design (cockpit, catapult seat, oxygen)Maneuverability (acceleration limits)

Risk to pilot life eliminated by eliminating pilot

Remote control susceptible to jamming

“Autonomy is the biggest thing in military technology sincenuclear weapons”“Some experts say autonomous weapons are potentially weaponsof mass destruction”

“The availability on the black market of mass quantities of low-cost,

anti-personnel micro-robots that can be deployed by one person to

anonymously kill thousands or millions of people who meet the users targeting

criteria. Autonomous weapons are potentially weapons of mass destruction.”

Page 9: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Need for Autonomy: Ships

Crew salary costs

“75 to 96 per cent of marine accidents result fromhuman error, often because of fatigue”

piracy: no hostages to take!energy efficiency and construction costs:

wind resistance (command deck, living quarters)ventilation, heating, water, sewage

Remote-control feasible, but mostly unnecessary

Autonomous Systems: Implementation

Vehicles (air, space, land, water), robots, everything inbetween, have similar requirements for control:

1 Short-term motion planning and execution(Robotics courses)

2 Medium-term and long-term task planning(This course: planning, scheduling, constraint solving)

Motion planning and execution

intersection of Computer Science, Control Theory

Physical models of the device + environment

Search for best motion plan, based on the modelManaging uncertainty: discrepancy between

intended (predicted) behavior, andobservations.

Both include imprecision and uncertainty!Fusion of sense data and predicted behavior:

Particle filterKalman filterRecursive Bayesian estimation (Bayes filter)

Covered in robotics courses

Autonomous Vehicles

Waymo / Google Self-Driving Car

Control based on detailed 3D models created fromdetailed 3D model of all roads (acquired earlier!)LIDAR: Velodyne 64 beam laser HDL-64E (range: 120m; resolution: 2 cm)radars (range: 200 m; resolution: low)cameras (for traffic lights, signs)GPS

likely behaviors of pedestrians, bicyclists, othervehicles is learned

Humans responsible for 13 out of 14 crashes withGoogle Car; 14th was a minor collision with a bus

Page 10: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Autonomous Vehicles

Tesla’s Autopilot (semi-autonomous driving)

Cameras + radar for sensing

No detailed 3D model

$50000+ LIDAR too expensive; Tesla: “overkill”

Requires constant driver supervisionFatal Autopilot accident in 2016:

1 Large white truck+trailer turns, blocking the lane2 Tesla does not “see” it3 Driver distracted (watching DVD?), doesn’t intervene4 Car crashes under the truck’s trailer

Autonomous Spacecraft

Deep Space 1 probe 1997EUROPA planner (constraint-based temporal planner)EXEC plan-execution systemLivingstone diagnostics systemfull autonomy demonstratedDeep Space 2 crashed before deployment (mission: todrill to Mars soil, 1999)

Mars roversEarly rovers remote-control only (long comms delay)Curiosity rover has (partial) autonomy (2013-)

Autonomous Military Aircraft

Predator MQ-1 drone (1994-): remote-controlled

Reaper MQ-9 drone (2001-): can flypre-programmed routes; no combat autonomyPerdix drone (experimental 2014-):

purpose: surveillanceswarms of dozens of droneswingspan 30 cm, weight 290 g, speed 70 mph,flighttime 20 mindeveloped by MIT studentsless vulnerable than large drones (Reaper costs $18M)too expensive to control by pilot: full autonomy

Very active topic in military

Applications

Autonomous car: thousands of applications

Autonomous aircraft: thousands of applications

Human-shaped robot: thousands of applications

Compare: microprocessor

Given autonomous X,applications are at abstract level→ discrete problems→ state-space search, scheduling, constraints, ...

Page 11: Games CS-E4800 Arti cial Intelligence...2 Medium-term and long-termtask planning (This course: planning, scheduling, constraint solving) Motion planning and execution intersection

Systems that are Fleets of Vehicles

Fleets ofrobotsautonomous carsautonomous trucksshipsaircraftspacecraft

Needed intransportation and shippingmining, construction, agriculturemilitary

Large-scale planning, scheduling (state-spacesearch, constraint programming)

A.I. in Software Industry before 2010

Few, mostly minor and marginal examples:

Business rule systems, e.g. IBM ILOG jRulesBased on 1970ies OPS-5 rule systemsNot much intelligenceSolve a narrow problem

Business Process ManagementState-space models as in this courseIntegration with other software shallowNot widely adopted in the industry

Web-Service Composition and OrchestrationAutomatically combine services offered on InternetState-space search (AND-OR trees, conditional plans)Little industry adoption so far

Model-Based Software Development

1 Develop modeling language for product category2 Automate the use of the modeling language3 Software not programmed, but modeled

Advantages:

programming modelingcost expensive cheaptime slow fasterrors lots fewermodifiability bad goodextendibility bad good

Model-Based Software Development

Potential software categories:

Control software for physical systemsComputer games

3D physical world (single-player, multi-agent)Strategy

Model agents’ etc. behaviors, goals, ...

Information systems (my current project)logic-based representation of complex datasystem’s objectivessystem’s and users’ actionshigh-level software synthesis