1
A Systems Engineer’s Virtual Assistant (SEVA) A Systems Engineer’s Virtual Assistant (SEVA) Jitin Krishnan George Mason University Trevor Reed University of Southern California Patrick Coronado Direct Readout Lab GSFC Code 606.3 SEVA’s Architecture Abstract Systems Engineer's Virtual Assistant (SEVA) is a realtime, interactive system designed to assist a Systems Engineer in their daily work environment through complex information management and high level query answering which will augment their problemsolving abilities. SEVA collects information by ingesting various types of disciplinespecific documents including text, tables, graphs, and keyboard input. It uses natural language processing tools to convert the information into a knowledge base which is represented as an Ontology. It has the ability to handle information relating to schedule and resources. All information is timetagged and saved, so that older versions of modified information can still be queried. SEVA is a personal system that becomes attuned to the individual using it. The main function of SEVA is to make logical inferences and derive new information when needed in order to answer questions asked by the user. SEVA also has an important capability to remember scenarios as experiences, thus making the knowledge representation a function of questions, answers, and rules which in turn keeps it free of traditional case based structures. This research effort proposes an efficient combination of tools to be implemented in SEVA’s information extraction, ontology building, and reasoning processes in addition to testing the soundness and feasibility of the designed system. The end result of this research will be a conceptual architecture for SEVA. Introduction Interactive Dialogue USER INTERFACE INFERENCE ENGINE KNOWLEDGE BASE SEVA is more than just a typical Expert System ( Basic research and development) UPDATE DOCUMENTS GRAPHS TABLES COMMAND LINE USER / SYSTEMS ENGINEER NATURAL LANGUAGE PARSER QUERY ANSWER All humans are mortal. Trevor is a human. Is Trevor mortal? Yes. What is the density of aluminum? What is density? Density is mass divided by volume. The density of aluminum is 2.7 grams per cubic centimeter What is the mass of component X? 3 kilograms. What was the previous mass? 2.5 kilograms. When did the mass last change? July 1, 2015. INPUT Tools: Open IE, ClausIE, CSD IE, TANGO, pdf2table INFORMATION EXTRACTION TRIPLES (Subject, Predicate, Object) (Helium, is a, gas) (Fuel, is a part of, Rocket) (Helium, is used as, fuel) (a, b, c) ... Examples: Information from MEL, Scheduler, Excel,operations manual… Triple: (Helium, is a, gas) DICTIONARY Helium, … Sodium, is a, has component, liquid, … gas … Helium, typeof, gas ID: (1345) (745) (6341) Using a dictionary… Tools: WordNet … build an ontology Tools: RDF Triples / RDF Graph ONTOLOGY used as part of Helium Fuel Rocket property class has component matter gas part of type subclass of inverse of type type of TOOLS: OWLRL Concepts extracted from the input RULES Axiomatic Rules Defining relation of ‘same as’ being symmetric: If triple (x, same as, y) is true, then (y, same as , x) is also true. In OWLRL: T(?x, owl:sameAs, ?y) T(?y, owl:sameAs, ?x) Contextual Rules T(?x, has component, ?y) T(?y, part of, ?x) Rules based on input T(Helium, type of, gas) true T(Helium, used as, fuel) true Rules from experience IF: T( X, instance of, Aerogel), T(Y, instance of, Niacin molecule), T(Y, has speed, S), T(S, less than, 2), … THEN: T(X, can capture, Y) true ELSE: T(X, can capture, Y) false INFERENCE ENGINE Two main tasks: Check consistency of information in the ontology Answer queries Example of inference by Forward chaining Information in the ontology: T(Trevor, type of, Human) true T(Human,has property,Mortal) true Inferred information: T(Trevor, has property, Mortal) true Tools: BaseVISor, Jess, Jena, DLEJena, OWLIM, RuQAr QUERY Recall Example: What, When Yes/ No Example: Can, is, true/false S S O P * S: Subject, P: Predicate, O: Object Not enough information (Missing Concepts) A concept in the query(subject, predicate, or object) is not the ontology. Undefined; I don’t know (Neither true nor false) This applies only to yes/no questions where the ontology is missing the predicate link. Answer! 42 is the answer to the ultimate question of Life, the Universe, and Everything. RESPONSES EXPERIENCE Can Aerogel capture a Niacin molecule moving at 5 km/s? I don’t know.. Start a rule Enter condition Depth (Aerogel ) * Density(Aerogel)/ Density (Niacin molecule) > 8mm Enter next condition Speed (Niacin molecule) < 2 km/s Enter next condition Done Rule saved. No, Aerogel cannot capture a Niacin molecule moving at 5 km/s. ONTOLOGYUPDATE 2 Add/update rules Concept of Time SEVA handles three types of time information. First is the time of information entry or update. Timetagging of information enables SEVA to save and reference old information. Second is the format of time or date within the information. Third is the ability to understand intervals and tense. EXPERIENCE Triples are a basic form of knowledge representation in the form: (Subject, Predicate, Object) RDF Triples add a layer of context to information by assigning unique IDs to each concept These tools extract plain text triples of information from unstructured text and tables. Most information used by SE’s is in the form of text, tables, and graphs. User queries can be converted to SPARQL (SPARQL Protocol and RDF Query Language) which is used for basic querying of RDF triple databases OwlExporter can convert natural text to an OWL Ontology, but it requires annotations to be added manually by the user. Why it matters to NASA? Systems engineers deal with huge amounts of data in their everyday work. Their role in mission planning requires systems engineers to handle all of this information and to keep track of diverse requirements and changing variables. This extensive “bookkeeping” is both tedious and potentially dangerous, as it allows for the possibility of human error to leak into a project or mission. The ability of systems engineers could be greatly enhanced by a system that could handle the bookkeeping while leaving the creative problem solving to the engineer. SEVA is being developed with this goal in mind: to assist systems engineers and enhance their problemsolving abilities by keeping track of the huge amounts of information of a project and using the information to answer reasoning, recall, and schedule queries from the user. Systems engineers are a vital part of any successful NASA mission, so the research and development of SEVA is directly beneficial to the future goals and missions of NASA. Acknowledgements Edward G. Amatucci Aerospace Engineer NASA GSFC Code 592 Logan Brentzel & Nick Perkins Summer Interns NASA GSFC Code 606.3 Patrick Coronado Direct Readout Laboratory NASA GSFC Code 606.3 Thanks to everyone in the Direct Readout Laboratory & Special thanks to Sallie M. Smith and Maria Ealey NASA GSFC Code 606.3 GSFC Education Office NASA GSFC Code 160 Tools: SPARQL INTERACTIVE ENVIRONMENT Types of queries Types of responses ONTOLOGY UPDATE 1 All interactions with missing concepts. Example: Your name is SEVA. Lets user create their own rules. SEVA can use it in the future when the user makes similar queries with the same predicates. Example: Can Titanium capture an Inositol molecule moving at 2 km/s? Convert to rules for rule based reasoning Reasoning using forward/ backward chaining or hybrid inference engines Hypothetical Mode During the interaction, the user can command SEVA to enter hypothetical mode, where the ontology is changed only temporarily, toanswer ‘what if’ scenarios. How is SEVA different? Other Expert Systems? SEVA is designed in such a way that the knowledge base is free of traditional casebased structures. An important capability of SEVA is to remember scenarios as Experiences from the interaction with the user. This is similar to casebased reasoning but the structure is created in realtime and specific to the user. Usual expert systems, such as Medical Ontologies, reflect knowledge of an entire domain whereas SEVA reflects the knowledge of one specific systems engineer. This implies that SEVA’s ontology is constantly evolving with each interaction. SEVA’s ability to handle the concepts of time and ingest different types of input documents makes it a unique system. MBSE? – Does not do inference Although SEVA is not intended to be an alternative to MBSE as their problems and approaches are different, conceptually a mature SEVA can be inclusive of all functionalities of MBSE. Similarities Differences Both MBSE and SEVA transitionfrom paper based book keepingto electronic form. The target of MBSE is a mission, its life cycles, and the integrationof multiple domain knowledge. The target of SEVA is an individualSystems Engineer and thus its knowledge reflects the knowledge of the user rather than the knowledge of an entire domain. Both MBSE and SEVA try to reduce human error and conflicts. SEVA is designed to have a much easier learning curve as it operates just like an assistant, quite literally! This means that the user won’t have to spend days learning how to operate the system.

JitinKrishnan ASystemsEngineer’sVirtualAssistant(SEVA)

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

A  Systems  Engineer’s  Virtual  Assistant  (SEVA)A  Systems  Engineer’s  Virtual  Assistant  (SEVA)Jitin  Krishnan

George  Mason  UniversityTrevor  Reed

University  of  Southern  California

Patrick  CoronadoDirect  Readout  LabGSFC  Code  606.3

SEVA’s  ArchitectureAbstractSystems Engineer's Virtual Assistant (SEVA) is a real-­‐‑time, interactive system designed to assist a SystemsEngineer in their daily work environment through complex informationmanagement and high level query-­‐‑answering which will augment their problem-­‐‑solving abilities. SEVA collects information by ingestingvarious types of discipline-­‐‑specific documents including text, tables, graphs, and keyboard input. It usesnatural language processing tools to convert the information into a knowledge base which is represented asan Ontology. It has the ability to handle information relating to schedule and resources. All information istime-­‐‑tagged and saved, so that older versions of modified information can still be queried. SEVA is apersonal system that becomes attuned to the individual using it. The main function of SEVA is to makelogical inferences and derive new information when needed in order to answer questions asked by the user.SEVA also has an important capability to remember scenarios as experiences, thus making the knowledgerepresentation a function of questions, answers, and rules which in turn keeps it free of traditional case-­‐‑based structures. This research effort proposes an efficient combination of tools to be implemented inSEVA’s information extraction, ontology building, and reasoning processes in addition to testing thesoundness and feasibility of the designed system. The end result of this research will be a conceptualarchitecture forSEVA.

Introduction

Interactive  Dialogue

USER  INTERFACE

INFERENCE  ENGINE

KNOWLEDGE  BASE

SEVA  is  more  than  just  a  typical  Expert  System(                Basic  research  and  development)

UPDATE

DOCUMENTS  GRAPHSTABLES

COMMAND  LINE

USER  /  SYSTEMS  ENGINEER

NATURAL  LANGUAGE  PARSER

QUERY

ANSWER

All  humans  are  mortal.  Trevor  is  a  human.  Is  Trevor  

mortal?

Yes.

What  is  the  density  of  aluminum?

What  is  density?

Density  is  mass  divided  by  volume.

The  density  of  aluminum  is  2.7  grams  per  cubic  centimeter

What  is  the  mass  of  component  X?

3  kilograms.

What  was  the  previous  mass?

2.5  kilograms.

When  did  the  mass  last  change?

July  1,  2015.

INPUT

Tools:  Open  IE,  ClausIE,  CSD-­‐‑IE,  TANGO,  pdf2table

INFORMATION  EXTRACTION

TRIPLES

(Subject,  Predicate,  Object)

(Helium,  is  a,  gas)

(Fuel,  is  a  part  of,  Rocket)

(Helium,  is  used  as,  fuel)

(a,  b,  c)

.  .  .

Examples:  Information  from  MEL,  Scheduler,  Excel,  operations  manual…

Triple:  (Helium,  is  a,  gas)

DICTIONARY

Helium,  …  Sodium,  is  a,  has  component,  liquid,  …gas  …

Helium,    type  of,  gasID:    (1345)        (745)      (6341)

Using  a  dictionary…

Tools:  WordNet

…  build  an  ontology

Tools: RDF  Triples/  RDF  Graph

ONTOLOGY

used  as

part  ofHelium

Fuel

Rocket

propertyclass

has  componentmatter

gaspart  of  

type

subclass  of inverse  of

type

type  of

TOOLS:  OWLRL

Concepts  extracted  from  the  input

RULES

Axiomatic  RulesDefining  relation  of  ‘same  as’  being  symmetric:

If  triple  (x,  same   as,  y)  is  true,  then  (y,  same   as  ,  x)  is  also  true.

In  OWLRL:  T(?x,  owl:sameAs,  ?y)  à T(?y,  owl:sameAs,  ?x)

Contextual  Rules  T(?x,  has  component,  ?y)  à T(?y,  part  of,  ?x)

Rules  based  on  inputT(Helium,  type  of,  gas)  à trueT(Helium,  used  as,  fuel)  à true  

Rules   from  experienceIF:    T(  X,  instance  of,  Aerogel),  T(Y,  instance  of,    Niacin  molecule),  T(Y,  has  speed,  S),  T(S,  less  than,  2),  …  THEN:  T(X,  can  capture,  Y)  à trueELSE:  T(X,  can  capture,  Y)  à false

INFERENCE  ENGINE

Two  main  tasks:  -­‐‑ Check  consistency  of  information  in  

the  ontology  -­‐‑ Answer  queries

Example   of  inference  by  Forward  chainingInformation   in  the  ontology:T(Trevor,  type  of,  Human)  à trueT(Human,  has  property,  Mortal)  à true

Inferred   information:  T(Trevor,  has  property,  Mortal)

à true

Tools:  BaseVISor,  Jess,  Jena,  DLEJena,  OWLIM,  RuQAr

QUERY

Recall  

Example:  What,  When

Yes/  No

Example:  Can,  is,  true/false

S S OP

*  S:  Subject,  P:  Predicate,  O:  Object

Not  enough   information(Missing  Concepts)

A  concept  in  the  query(subject,  predicate,  or  object)  is  not  the  

ontology.

Undefined;   I  don’t  know(Neither   true  nor  false)

This  applies  only  to  yes/no  questions  where  the  ontology  is  missing  the  predicate  link.  

Answer!

42 is  the  answer  to  the  ultimate  question  of  Life,  the  Universe,  and  Everything.

RESPONSES

EXPERIENCE

Can  Aerogel  capture  a  Niacin  molecule  moving  at  5  km/s?

I  don’t  know..  Start  a  rule

Enter  condition

Depth  (Aerogel  )  *    Density(Aerogel)  /  Density  (Niacin  molecule)  >  8mm

Enter  next  condition

Speed  (Niacin  molecule)  <  2  km/s

Enter  next  conditionDone

Rule  saved.  No,  Aerogel  cannot  capture  a  Niacin  molecule  moving  

at  5  km/s.

ONTOLOGY  UPDATE  2Add/update  rules

Concept  of  Time

SEVA handles three types of time information. First is thetime of information entry or update. Time-­‐‑tagging ofinformation enables SEVA to save and reference oldinformation. Second is the format of time or date within theinformation. Third is the ability to understand intervals andtense.

EXPERIENCE

Triples  are  a  basic  form  of  knowledge  representation  

in  the  form:  (Subject,  Predicate,  

Object)  

RDF  Triples  add  a  layer  of  context  to  information  by  assigning  unique  IDs  to  

each  concept

These  tools  extract  plain  text  triples  of  information  from  unstructured  text  

and  tables.Most  information  used  by  SE’s  is  in  the  form  of  text,  

tables,  and  graphs.

User  queries  can  be  converted  to  SPARQL  (SPARQL  Protocol  and  RDF  Query  Language)  which  is  used  for  basic  querying  of  RDF   triple  

databases

OwlExporter can  convert  natural  text  to  an  OWL  Ontology,  but  it  requires  annotations  

to  be  added  manually  by  the  user.

Why  it  matters  to  NASA?Systems engineers deal with huge amounts of data in their everyday work. Their role in mission planningrequires systems engineers to handle all of this information and to keep track of diverse requirements andchanging variables. This extensive “bookkeeping” is both tedious and potentially dangerous, as it allows forthe possibility of human error to leak into a project or mission. The ability of systems engineers could begreatly enhanced by a system that could handle the bookkeeping while leaving the creative problem-­‐‑solving to the engineer. SEVA is being developed with this goal in mind: to assist systems engineers andenhance their problem-­‐‑solving abilities by keeping track of the huge amounts of information of a projectand using the information to answer reasoning, recall, and schedule queries from the user. Systemsengineers are a vital part of any successful NASA mission, so the research and development of SEVA isdirectly beneficial to the future goals and missions ofNASA.

Acknowledgements

Edward  G.  AmatucciAerospace  EngineerNASA  GSFC  Code  592

Logan  Brentzel&  Nick  PerkinsSummer  Interns

NASA  GSFC  Code  606.3

Patrick  CoronadoDirect  Readout  LaboratoryNASA  GSFC  Code  606.3

Thanks  to  everyone  in  theDirect  Readout  Laboratory

&Special  thanks  to  

Sallie  M.  Smith  and  Maria  Ealey  NASA  GSFC  Code  606.3

GSFC  Education  OfficeNASA  GSFC  Code  160

Tools:  SPARQL

INTERACTIVE    ENVIRO

NMENT

Types  of  queries

Types  of  responses

ONTOLOGY  UPDATE  1All   interactions  with  missing  concepts.

Example:  Your  name   is  SEVA.

Lets  user  create  their  own  rules.SEVA  can  use  it  in  the  future  when  the  user  makes  

similar  queries  with  the  same  predicates.Example:  Can Titanium capture an  Inositol  

molecule moving  at  2  km/s?

Convert to  rules  for  rule  based  reasoning

Reasoning  using  forward/  backward  chaining or  hybrid  inference  engines

Hypothetical  Mode

During  the  interaction,  the  user  can  command  SEVA  to  enter  hypothetical  mode,  where  the  ontology  is  changed  only  temporarily,  to  answer  ‘what  if’  scenarios.

How  is  SEVA  different?Other  Expert  Systems?SEVA is designed in such a way that the knowledge base is free of traditional case-­‐‑based structures. An important capability of SEVA is toremember scenarios as Experiences from the interaction with the user. This is similar to case-­‐‑based reasoning but the structure iscreated in real-­‐‑time and specific to the user. Usual expert systems, such as Medical Ontologies, reflect knowledge of an entire domainwhereas SEVA reflects the knowledge of one specific systems engineer. This implies that SEVA’s ontology is constantly evolving with eachinteraction. SEVA’s ability to handle the concepts of time and ingest different types of input documentsmakes it a unique system.

MBSE?  – Does  not  do  inferenceAlthough  SEVA  is  not  intended  to  be  an  alternative  to  MBSE  as  their  problems  and  approaches  are  different,  conceptually  a  mature SEVA  can  be  inclusive  of  all  functionalities  of  MBSE.  

Similarities DifferencesBoth  MBSE  and  SEVAtransition  from  paper-­‐based  book-­‐keeping  to  electronic  form.

The  target  of  MBSE  is  a  mission,  its  life  cycles,  and  the  integration  of  multiple  domain  knowledge.The  target  of  SEVA  is  an  individual  Systems  Engineer  and  thus  its  knowledge  reflects  the  knowledge  of  the  user  rather  than  the  knowledge  of  an  entire  domain.

Both  MBSE  and  SEVA  try  to  reduce  human  error  and  conflicts.  

SEVA  is  designed  to  have  a  much  easier  learning  curve  as  it  operates  just  like  an  assistant,  quite  literally!  This  means  that  the  user  won’t  have  to  spend  days  learning  how  to  operate  the  system.