73
© Fraunhofer 2021 Dr. Adam Porter SHOULD YOU RELY ON THAT AI? Session 2: Moving to a Full-lifetime Testing Approach

SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2021Dr. Adam Porter

SHOULD YOU RELY ON THAT AI?

Session 2: Moving to a Full-lifetime Testing Approach

Page 2: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

R 23G 156B 125

R 242G 148B 0

R 31G 130B 192

R 226G 0B 26

R 177G 200B 0

R 254G 239B 214

R 225G 227B 227

Panel Members

Dr. Mikael Lindvall Dr. Mikael Lindvall, is the Technology Director of Fraunhofer USA, Center Mid-Atlantic (CMA) and an adjunct professor at UMCP for more than 10 years. He recently created and delivered one of the first courses on Software Engineering for AI-based systems.

Dr. Jeffrey W. Herrmann is a professor at the University of Maryland, where he holds a joint appointment with the Department of Mechanical Engineering and the Institute for Systems Research.  He is the director of the Reliability Engineering Graduate Program and the director of the Systems Engineering Graduate Program at the University of Maryland.

Dr. Laura Freeman is a Research Associate Professor of Statistics and the Director of the Intelligent Systems Lab at the Virginia Tech Hume Center. Previously, she was the Assistant Director of the Operational Evaluation Division at the Institute for Defense Analyses.

Dr. Douglas C. Schmidt is the Cornelius Vanderbilt Professor of Computer Science, Associate Provost for Research Development and Technologies, Co-Chair of the Data Science Institute, and a Senior Researcher at the Institute for Software Integrated Systems, all at Vanderbilt University.

Page 3: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2021

No One Should Rely on Artificial Intelligence

Software safety is hard to achieve, even for systems that don’t use AIAI is used to manage dynamism, complexity, and uncertaintyThere’s no reason to believe that AI makes the safety problem easier

Page 4: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2021

Rely on the System, not on System Components

AI is a system component Don’t rely on AI; try to build a system you can rely onRequires that systems be architected to include:

Effective safeguards and harm mitigation, Monitoring and testing throughout the full life cycle, and Human accountability

Page 5: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2021

Eternal Vigiliance is the Price of Liberty

AI can learn over time; data sets grow and change rapidlyNo point in time where we can safely turn off monitoring and surveillanceFull lifecycle monitoring and testing is a fundamental requirementThis is a big data problem, not a pass/fail testing problem

Page 6: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2021

The System Includes People, Organizations, and Missions

AI systems make decisions that affect people Poor AI systems risks:

MalpracticeUnfairnessDiminishing agencyRejection by humans

Page 7: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

R 23G 156B 125

R 242G 148B 0

R 31G 130B 192

R 226G 0B 26

R 177G 200B 0

R 254G 239B 214

R 225G 227B 227

Panel Members

Dr. Mikael Lindvall Dr. Mikael Lindvall, is the Technology Director of Fraunhofer USA, Center Mid-Atlantic (CMA) and an adjunct professor at UMCP for more than 10 years. He recently created and delivered one of the first courses on Software Engineering for AI-based systems.

Dr. Jeffrey W. Herrmann is a professor at the University of Maryland, where he holds a joint appointment with the Department of Mechanical Engineering and the Institute for Systems Research.  He is the director of the Reliability Engineering Graduate Program and the director of the Systems Engineering Graduate Program at the University of Maryland.

Dr. Laura Freeman is a Research Associate Professor of Statistics and the Director of the Intelligent Systems Lab at the Virginia Tech Hume Center. Previously, she was the Assistant Director of the Operational Evaluation Division at the Institute for Defense Analyses.

Dr. Douglas C. Schmidt is the Cornelius Vanderbilt Professor of Computer Science, Associate Provost for Research Development and Technologies, Co-Chair of the Data Science Institute, and a Senior Researcher at the Institute for Software Integrated Systems, all at Vanderbilt University.

Page 8: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Challenges of Certifying Adaptive Dynamic Computing Environments

Douglas C. Schmidt [email protected]

www.dre.vanderbilt.edu/~schmidt

Institute for Software Integrated Systems

Vanderbilt University Nashville, Tennessee, USA

Page 9: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Motivating Example: DDG 1000

Page 10: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Motivating Example: DDG 1000

“The right answer delivered too late becomes the wrong answer”

Page 11: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Motivating Example: DDG 1000

“Captain James Kirk Takes Command of the Navy’s New $4 Billion Destroyer”

www.thefiscaltimes.com/2016/05/20/Real-Captain-Kirk-Takes-Command-Navy-s-New-4-Billion-Destroyer

Page 12: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

DDG 1000 has a “Total Ship Computing Environment”

www.raytheon.com/capabilities/products/zumwalt

Page 13: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

DDG 1000 has a “Total Ship Computing Environment”

•A single, encrypted network of 500+ multi-core computers that controls all shipboard computing applications

www.raytheon.com/capabilities/products/zumwalt

Page 14: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

DDG 1000 has a “Total Ship Computing Environment”

•A single, encrypted network of 500+ multi-core computers that controls all shipboard computing applications• e.g., ranging from the ship’s lights & machinery control to its radars, weapon systems, & crew entertainment

www.raytheon.com/capabilities/products/zumwalt

Page 15: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

DDG 1000 has a “Total Ship Computing Environment”

•A single, encrypted network of 500+ multi-core computers that controls all shipboard computing applications

•The TSCE's high degree of automation enables the ship to run more effectively & efficiently & dramatically reduces manning requirements

www.raytheon.com/capabilities/products/zumwalt

Page 16: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

The DDG 1000 TSCE Enables Dynamic Resource Management

80% 90%

High Low MediumInfo Warfare Land AttackAir Defense

Computing Network

Resources

Mission Priority

Resource allocation can be optimized dynamically across the TSCE

Page 17: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

The DDG 1000 TSCE Enables Dynamic Resource Management

80% 90%

Low High Medium

Computing Network

Resources

Mission Priority

Resource allocation can be optimized dynamically across the TSCE

Info Warfare Land AttackAir Defense

Page 18: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers

TSCE is a Major Improvement on Legacy Combat Systems

100%

Page 19: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers• Point-to-point interconnects

TSCE is a Major Improvement on Legacy Combat Systems

100%

Page 20: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers• Point-to-point interconnects• Limited growth capability

TSCE is a Major Improvement on Legacy Combat Systems

100%

Page 21: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers• Point-to-point interconnects• Limited growth capability• Tightly constrained by stove-pipe subsystems

TSCE is a Major Improvement on Legacy Combat Systems

Page 22: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers• Point-to-point interconnects• Limited growth capability• Tightly constrained by stove-pipe subsystems• Highly vulnerable to damage

TSCE is a Major Improvement on Legacy Combat Systems

100%

Page 23: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Proprietary UYK-43s computers• Point-to-point interconnects• Limited growth capability• Tightly constrained by stove-pipe subsystems• Highly vulnerable to damage

Resources allocated statically at initial system configuration!

TSCE is a Major Improvement on Legacy Combat Systems

100% 100%

Low Medium Medium

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense

100%

Page 24: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Static allocation is problematic in several scenarios

TSCE is a Major Improvement on Legacy Combat Systems

100%

Low Medium Medium

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense

100%

Page 25: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Static allocation is problematic in several scenarios• When # of threats exceed design parameters

TSCE is a Major Improvement on Legacy Combat Systems

“> N” Air Threats

100%

Low Medium Medium

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense

100%

Page 26: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Static allocation is problematic in several scenarios• When # of threats exceed design parameters• When resources are damaged/degraded

TSCE is a Major Improvement on Legacy Combat Systems

Low Medium Medium

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense “> N” Air

Threats

70% 60%

Page 27: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• Static allocation is problematic in several scenarios• When # of threats exceed design parameters• When resources are damaged/degraded

Dynamic resource management (DRM) can help both these scenarios

TSCE is a Major Improvement on Legacy Combat Systems

Low High Low

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense “> N” Air

Threats

70% 60%

Page 28: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

InfoWarfare

LandAttack

DARPA ARMS Program Created DRM for DDG 1000 TSCE

www.atl.external.lmco.com/programs/arms.php

Page 29: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

InfoWarfare

LandAttack

High High

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense QoS

QoS

Info Warfare Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

DARPA ARMS Program Created DRM for DDG 1000 TSCE

Low

100% 100%

• ARMS created ensemble-based bin-packing DRM middleware

See www.dre.vanderbilt.edu/~schmidt/PDF/JSS-2006.pdf

Page 30: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

InfoWarfare

LandAttack

High High

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense QoS

QoS

Info Warfare Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

DARPA ARMS Program Created DRM for DDG 1000 TSCE

Low

70% 60%

• ARMS created ensemble-based bin-packing DRM middleware • Automatically adapts to changes in mission conditions

See www.dre.vanderbilt.edu/~schmidt/PDF/autonomic-journal.pdf

Page 31: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

DARPA ARMS Program Created DRM for DDG 1000 TSCE

InfoWarfare

Air Threat

Low Medium

Computing Network

Resources

Mission PriorityInfo Warfare Land AttackAir Defense

100% 100%

• ARMS created ensemble-based bin-packing DRM middleware • Automatically adapts to changes in mission conditions• Ensures the allocation of computing & network resources accurately matches changing priorities of mission requirements

High

Page 32: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Contributions of the DARPA ARMS Program

• Proved that dynamic resource allocation delivers significantly greater survivability of combat system functionality

Tactical Load & Available Resources

Dynamic:

Survivable

performance

Static: Inadequate

performance

Per

form

ance

Page 33: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Contributions of the DARPA ARMS Program

• Proved that dynamic resource allocation delivers significantly greater survivability of combat system functionality

• Demonstrated effective & reliable dynamic allocation of processes/tasks to processors

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Page 34: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Contributions of the DARPA ARMS Program

• Proved that dynamic resource allocation delivers significantly greater survivability of combat system functionality

• Demonstrated effective & reliable dynamic allocation of processes/tasks to processors

• Demonstrated survivability of dynamic resource allocation system itself

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Page 35: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Contributions of the DARPA ARMS Program

• Proved that dynamic resource allocation delivers significantly greater survivability of combat system functionality

• Demonstrated effective & reliable dynamic allocation of processes/tasks to processors

• Demonstrated survivability of dynamic resource allocation system itself

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

However, the Navy could not deploy ARMS DRM-based systems due to challenges in certifying adaptive systems

Page 36: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Contributions of the DARPA ARMS Program

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

"We can’t certify that DDG-1000 doesn’t move into unstable, incorrect, or unsafe

operating configurations during system operations”

Page 37: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Challenges of Adaptive Dynamic Computing Environments

• To ensure real-time predictable quality of service mission-/safety-critical DoD systems today are statically configured

www.acq.osd.mil/se/docs/DoD-Safety-Data-Package-Preparation-Guide.pdf

Page 38: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Challenges of Adaptive Dynamic Computing Environments

• To ensure real-time predictable quality of service mission-/safety-critical DoD systems today are statically configured

• There are a range of time-proven techniques for certifying statically configured systems

Page 39: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Challenges of Adaptive Dynamic Computing Environments

• To ensure real-time predictable quality of service mission-/safety-critical DoD systems today are statically configured

• There are a range of time-proven techniques for certifying statically configured systems

• Dynamically managed systems deliver far greater efficiency & survivability

Page 40: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Challenges of Adaptive Dynamic Computing Environments

• To ensure real-time predictable quality of service mission-/safety-critical DoD systems today are statically configured

• There are a range of time-proven techniques for certifying statically configured systems

• Dynamically managed systems deliver far greater efficiency & survivability

• However, DRM techniques can’t currently be deployed in mission-/safety-critical DoD combat systems because they are not certifiable via conventional methods

Page 41: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Towards Certification of Adaptive Dynamic Computing Environments

• The Certification of Adaptive Dynamic Computing Environments (CADYNCE) was a follow-on to ARMS that focused on two topics:

Page 42: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• The Certification of Adaptive Dynamic Computing Environments (CADYNCE) was a follow-on to ARMS that focused on two topics:• How to constrain ARMS DRM to make it more “certification friendly”+Land Attack Configs

+Air Defense Configs

+Info Warfare Configs

+Auto-Failover Configs

Pre-compute 100’s of certified configs that are then used dynamically

Towards Certification of Adaptive Dynamic Computing Environments

Page 43: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

• The Certification of Adaptive Dynamic Computing Environments (CADYNCE) was a follow-on to ARMS that focused on two topics:• How to constrain ARMS DRM to make it more “certification friendly”• How to create & integrate an assisted certification tool chain

Towards Certification of Adaptive Dynamic Computing Environments

Page 44: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Increasingly StaticConfiguration Management

At runtime select from many automatically

generated, analyzed, tested, & pre-certified

configurations

At runtime generate new configurations based on

conditions that affect current configuration

Increasingly DynamicConfiguration Management

At runtime select from a few manually

generated pre-certified

configurations

Continuous adaptation through fine-grained

monitoring of resources & applications

CADYNCE focused on enabling certification of 100’s of configurations

that can be selected at runtime

Availability of 100’s of certified configurations demo’d benefits of DRM behavior, while maintaining the

assurance of certification, but more R&D is needed

No technical approach enabled certification of more dynamic

configuration management approaches

Towards Certification of Adaptive Dynamic Computing Environments

Page 45: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Concluding Remarks

www.darpa.mil/program/building-resource-adaptive-software-systems

• Adaptive dynamic computing environments remain a key topic for research & practice in mission-/safety-critical systems

Page 46: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Concluding Remarks• Adaptive dynamic computing environments remain a key topic for research & practice in mission-/safety-critical systems

• Adaptive dynamic computing environments aren’t deployed in DoD combat systems sincethey aren’t yet certifiable via conventional methods

QoS

QoS

Air Defense Application

Land Attack Application

Requested QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Control Vars.

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

Algorithm

Control

AlgorithmMeasured QoS

Network latency & bandwidth

Workload & Replicas

CPU & memory

Connections & priority bands

Page 47: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Concluding Remarks• Adaptive dynamic computing environments remain a key topic for research & practice in mission-/safety-critical systems

• Adaptive dynamic computing environments aren’t deployed in DoD combat systems sincethey aren’t yet certifiable via conventional methods

• It’s easier to pitch programs on adaptive computing than to pitch programs on certification of adaptive computing..

Page 48: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

R 23G 156B 125

R 242G 148B 0

R 31G 130B 192

R 226G 0B 26

R 177G 200B 0

R 254G 239B 214

R 225G 227B 227

Panel Members

Dr. Mikael Lindvall Dr. Mikael Lindvall, is the Technology Director of Fraunhofer USA, Center Mid-Atlantic (CMA) and an adjunct professor at UMCP for more than 10 years. He recently created and delivered one of the first courses on Software Engineering for AI-based systems.

Dr. Jeffrey W. Herrmann is a professor at the University of Maryland, where he holds a joint appointment with the Department of Mechanical Engineering and the Institute for Systems Research.  He is the director of the Reliability Engineering Graduate Program and the director of the Systems Engineering Graduate Program at the University of Maryland.

Dr. Laura Freeman is a Research Associate Professor of Statistics and the Director of the Intelligent Systems Lab at the Virginia Tech Hume Center. Previously, she was the Assistant Director of the Operational Evaluation Division at the Institute for Defense Analyses.

Dr. Douglas C. Schmidt is the Cornelius Vanderbilt Professor of Computer Science, Associate Provost for Research Development and Technologies, Co-Chair of the Data Science Institute, and a Senior Researcher at the Institute for Software Integrated Systems, all at Vanderbilt University.

Page 49: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

Dr. Mikael LindvallFraunhofer USA CMA

AI-BASED SYSTEMS NEED BETTER ENGINEERING TOOLS

Page 50: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

Traditional Tools and Methods Don't Always Work With AI

▪ Visualization and Analysis▪ Requirements ▪ Testing

Page 51: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

Visualization and Analysis of Learned Behavior

Incorrect/New behavior

Page 52: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

Requirements and Testing of AI and Autonomous Behavior

Page 53: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

Testing and Protecting AI-Based Image Recognition Systems

Clean

Adversarial

Ocean Cruiser

Submarine

Carrier

Ocean Cruiser

Rotation – 0.5 degrees

Rotation – 0.5 degrees

Page 54: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

AI-Based Systems Need Better Engineering Tools and Methods

• That provide visibility into neural networks

• That allow us to model and simulate requirements

• That allow us to generate test cases and identify oddities in system behavior

• During testing and runtime

Page 55: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer 2019

WANT TO HEAR MORE?

For more information [email protected]

Page 56: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

R 23G 156B 125

R 242G 148B 0

R 31G 130B 192

R 226G 0B 26

R 177G 200B 0

R 254G 239B 214

R 225G 227B 227

Panel Members

Dr. Mikael Lindvall Dr. Mikael Lindvall, is the Technology Director of Fraunhofer USA, Center Mid-Atlantic (CMA) and an adjunct professor at UMCP for more than 10 years. He recently created and delivered one of the first courses on Software Engineering for AI-based systems.

Dr. Jeffrey W. Herrmann is a professor at the University of Maryland, where he holds a joint appointment with the Department of Mechanical Engineering and the Institute for Systems Research.  He is the director of the Reliability Engineering Graduate Program and the director of the Systems Engineering Graduate Program at the University of Maryland.

Dr. Laura Freeman is a Research Associate Professor of Statistics and the Director of the Intelligent Systems Lab at the Virginia Tech Hume Center. Previously, she was the Assistant Director of the Operational Evaluation Division at the Institute for Defense Analyses.

Dr. Douglas C. Schmidt is the Cornelius Vanderbilt Professor of Computer Science, Associate Provost for Research Development and Technologies, Co-Chair of the Data Science Institute, and a Senior Researcher at the Institute for Software Integrated Systems, all at Vanderbilt University.

Page 57: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

AI Systems as OrganizationsJeffrey W. Herrmann

1. Overview2. Case study: IMPACT (Behymer et al., 2017)3. Implications

57

Behymer, Kyle, Clayton Rothwell, Heath Ruff, Michael Patzek, Gloria Calhoun, Mark Draper, Scott Douglass, Derek Kingston, and Doug Lange. Initial Evaluation of the Intelligent Multi-UxV Planner with Adaptive Collaborative/Control Technologies (IMPACT). Infoscitex Corp. Beavercreek, Ohio, 2017.AFRL report number: AFRL-RH-WP-TR-2017-0011

Page 58: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

58

The IMPACT user interface (Behymer et al., 2017)

Page 59: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

59

Play calling flowchart (Behymer et al., 2017)

CCA = Cooperative control algorithmIA = Intelligent agentHAI = Human-autonomy interface

Page 60: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

60

The organization included three personnel, an intelligent agent, and an algorithm.

Commander

Vehicleoperator

Sensoroperator

Intelligentagent

Resumenormal patrol

Send UAV to critical facility

Unidentified vessel is a

fishing boat

Cooperative control

algorithm

Page 61: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

61

Receive command

Call play

Generatesolutions

Evaluatesolutions

Modify plan

Executeplan

Monitorexecution

Selectplan

This conceptual model describes the key activities in the decision-making system.

Page 62: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

62

Receive command

Call play

Generatesolutions

Evaluatesolutions

Modify plan

Executeplan

Monitorexecution

Selectplan

What additional activities are needed to control system performance?

Monitorperformance

Modifyalgorithms

Page 63: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

R 23G 156B 125

R 242G 148B 0

R 31G 130B 192

R 226G 0B 26

R 177G 200B 0

R 254G 239B 214

R 225G 227B 227

Panel Members

Dr. Mikael Lindvall Dr. Mikael Lindvall, is the Technology Director of Fraunhofer USA, Center Mid-Atlantic (CMA) and an adjunct professor at UMCP for more than 10 years. He recently created and delivered one of the first courses on Software Engineering for AI-based systems.

Dr. Jeffrey W. Herrmann is a professor at the University of Maryland, where he holds a joint appointment with the Department of Mechanical Engineering and the Institute for Systems Research.  He is the director of the Reliability Engineering Graduate Program and the director of the Systems Engineering Graduate Program at the University of Maryland.

Dr. Laura Freeman is a Research Associate Professor of Statistics and the Director of the Intelligent Systems Lab at the Virginia Tech Hume Center. Previously, she was the Assistant Director of the Operational Evaluation Division at the Institute for Defense Analyses.

Dr. Douglas C. Schmidt is the Cornelius Vanderbilt Professor of Computer Science, Associate Provost for Research Development and Technologies, Co-Chair of the Data Science Institute, and a Senior Researcher at the Institute for Software Integrated Systems, all at Vanderbilt University.

Page 64: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website
Page 65: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Page 66: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

•••

••

••••••

••••••

Page 67: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website
Page 68: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website
Page 69: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

••••

Page 70: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Trained to detect planes in Southern California

Mission to detect in Northern California

Page 71: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

Increasing Cost

Page 72: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

••

••

Page 73: SHOULD YOU RELY ON THAT AI? - UMD ARLIS Website

© Fraunhofer [email protected] Porter

Q & A