8
CBIS 2007 CB CKB with the GIG Scott Kothenbeutel, Battelle PM Moses Kamai, Battelle PI

CBIS 2007 CB CKB with the GIG - … CKB Overview Description: • Chemical-Biological Common Knowledge Base • CB CKB is an application and virtual data store that allows users to

  • Upload
    vophuc

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

CBIS 2007CB CKB with the GIG

Scott Kothenbeutel, Battelle PMMoses Kamai, Battelle PI

CB CKB Overview

Description:• Chemical-Biological Common Knowledge

Base• CB CKB is an application and virtual data

store that allows users to publish, locate and retrieve relevant CB data, while maintaining stewardship of access and control.

• 3 phases to the project, 1st –implementation planning; 2nd- Pilot system; 3rd – Secure on-line system.

• Current effort is 2nd Phase – includes development of a pilot system focusing on M&S resources and locally available data such as CDMD, Agent Fate, ASK.

Client need addressed:– Secure on-line common knowledge

base of chemical and biological weapon related data, accessible and validated.

– Identification of gaps in CB related data.

– Process to accredit data sources.– Foster communication within CB

community.

Future Potential:• Baseline system for input intoDTRA Program of Record

CB CKB Overview

Implementation PlanPhase 1 Phase 2 Phase 3

The key objectives for this program are:• Identify and fuse CBIAC and other CB repositories relevant to the CB Community• Implement and sustain a DoD information assured compliant system• Implement proven and best practices for the CB Community• Leverage subject matter experts and available tools to identify and analyze

relevant CB data

Pilot System Secure On-line CB CKB

May 05 – Jun 06 Jun 06- May 07 Jun 07 – TBD

RDBMS VDS

VDS Hybrid

User QueryIndex and

Search Engine

Identified Source Repositories

ASKAccess

Agent FateExcel Spreadsheets on FTP

Server

CBIACOracle RDBMS

BACWORTH 2Native XML DB

ASKNative XML (xIndice)

User Portal

User Access

Web Services - Data

Application Access

Services -

Directory

Data Advertisement

JACKSUnknown Doc Mgmt Sys (DMS)

CB IDELiveLink DMS

Learning Lexicon

RDBMS VDS

VDS Hybrid

RDBMS VDS

VDS Hybrid

RDBMS VDS

VDS Hybrid

User QueryIndex and

Search Engine

Identified Source Repositories

ASKAccessASK

Access

Agent FateExcel Spreadsheets on FTP

Server

Agent FateExcel Spreadsheets on FTP

Server

CBIACOracle RDBMS

CBIACOracle RDBMS

BACWORTH 2Native XML DB

BACWORTH 2Native XML DB

ASKNative XML (xIndice)

ASKNative XML (xIndice)

User PortalUser Portal

User Access

Web Services - Data

Application Access

Services -

Directory

Data Advertisement

JACKSUnknown Doc Mgmt Sys (DMS)

JACKSUnknown Doc Mgmt Sys (DMS)

CB IDELiveLink DMS

CB IDELiveLink DMS

Learning LexiconLearning Lexicon

Data Architect Station

Information Fusion Build Process

Identified Source Repositories

Access

Excel Spreadsheets on FTP Server

Oracle RDBMS

XML Docs/flat file

Analyzer

Harvester

Native XML (xIndice)

ODBC MS Access

XML:DB API (TBD)

ODBC Ora

cleODBC MS Exc

elXML:DB API (TBD)

Reports

RDBMS VDS

VDS Hybrid

Data Architect Station

• Networked PC using ODBC

• Analyzer accesses data source locally or remotely

• Outputs data files and relationships (heuristic matching)

• Builds data maps

Data ModelOracle RDBMS

ODBC

Ora

cle

Manual Adjustments

Export as flat file

Information Fusion Example

RDBMS VDS

VDS Hybrid

User Query

Index and

Search Engine

Identified Source Repositories

Access

Excel Spreadsheets on FTP Server

Oracle RDBMS

Native XML DB

Native XML (xIndice)

Federated Data Source

User Portal

User Access

Web Services - Data

Application Access

Services -

Directory

Data Advertisement

Document Management Sys (DMS)

LiveLink DMSUnstructured Data Sources

Structured/ Semi-Structured

Data Sources

Learning Lexicon

Phase 2

Chemical-Biological Material Effects Database(CBMED Extract)

Typical Analyzer Results

Automated Heuristic Matching

0

10000

20000

30000

40000

50000

60000

70000

80000

90000

100000

500 1500 2500# of Records Analyzed

Tim

e in

Sec

onds

Time Required

Matched DataSamplesExpon. (TimeRequired)

Demonstration• Analyzer

– Configuration and Automated Mapping

• Harvester: User interface to– Review Analyzer results– Correct/Update Mappings– Create Virtual Data Store (VDS)

• Federated Query– Simple query example