30
Data Warehouse Data Warehouse Overview Overview September 28, 2012 September 28, 2012 presented by presented by Terry Bilskie Terry Bilskie

Data Warehouse Overview September 28, 2012 presented by Terry Bilskie

Embed Size (px)

Citation preview

  • Data WarehouseOverview

    September 28, 2012

    presented byTerry Bilskie

  • Presentation Objectives:Data Warehouse OverviewDefinitionBenefits & ConsiderationsTerminologyArchitectureInformation Access Maturity Roadmap to a more Data Driven Institution

  • Data Warehouse, is it clear to you ?

  • Data Warehouse Definition A data warehouse is-subject-oriented,-integrated,-time-variant,-nonvolatilecollection of data in support of managementsdecision making process.

  • Data Warehouse is not: A single physical piece of hardware or a software product. A single project with an end A single solution or product

  • Data Warehouse is: A necessary component in order to achieve higher end reporting and analysis capability with respect to historical data, current trends, and future projections. A data source A combination of software and hardware

  • Subject-orientedData warehouse is organized around subjects such as admissions, particular degree conferred and students.It focuses on modeling and analysis of data for decision makers.Excludes data not useful in decision support process.

  • IntegrationData Warehouse is constructed by integrating multiple heterogeneous sources.Data Preprocessing are applied to ensure consistency.RDBMSLegacySystemDataWarehouseFlat FileData ProcessingData Transformation

  • Time-variantProvides information from historical perspective e.g. past 5-10 yearsEvery key structure contains either implicitly or explicitly an element of time

  • NonvolatileData once recorded cannot be updated.Data warehouse requires two operations in data accessingInitial loading of dataAccess of dataloadaccess

  • Data Warehouse BenefitsSpeed up reportingReduce reporting load on transactional systemsMake institutional data more user-friendly and accessibleIntegrate data from different source systemsEnable point-in-time analysis and trending over timeTo help identify and resolve data integrity issues, either in the warehouse itself or in the source systems that collect the data

  • Data Warehouse BenefitsHas a subject area orientation Integrates data from multiple, diverse sources Allows for analysis of data over time Adds ad hoc reporting and enquiry Provides analysis capabilities to decision makers Relieves the development burden on IT

  • Data Warehouse BenefitsProvides improved performance for complex analytical queries Relieves processing burden on transaction oriented databases Allows for a continuous planning process Converts corporate data into strategic information

  • Data Warehouse Considerations

    High-level supportIdentification of reporting needs by subject area and organizational roleBridging the gap between reporting needs and technical specificationsPartnerships with central and campus administrative areasCustomer support and training

  • Data Warehouse TerminologyData WarehouseA copy of transaction data specifically structured for querying and reportingData MartA logical subset of the complete data warehouseOLAP (On-Line Analytic Processing)The activity of querying and presenting text and number data, usually with underlying multidimensional cubes of dataDimensional ModelingA specific discipline for modeling data that is an alternative to entity-relationship (E/R) modeling; usually employed in data warehouses and OLAP systems.

  • Data Warehouse ArchitectureWhat makes up a Data Warehouse ?

    ConceptsCharacteristicsLogical & PhysicalComponents

  • ArchitectedData MartReplicationData Set DistributionAccess & AnalysisResource Scheduling & DistributionEnd UserWorkstationsA Data Warehouse Is A Component

  • *Tiered ArchitectureDataWarehouseData MartsData StorageTier1:Data Warehouse ServerAnalysisQuery/ReportsData miningFront-End ToolsTier3:Clients

  • Data Warehouse ArchitectureData Warehouse serveralmost always a relational DBMS,rarely flat filesOLAP serversto support and operate on multi-dimensional data structuresClientsQuery and reporting toolsAnalysis toolsData mining tools

  • Data Warehouse from a logical perspective

  • Another look from a logical perspective

  • How it fits into Business Intelligence Viewpoint

  • Data Warehouse from a conceptual perspective

    A data warehouse is based on a multidimensional data model which views data in the form of a data cube

  • *Conceptual ModelStudent Profile Data ViewType of StudentCampusAt Rsiksum First TimeTransferReturning1234VincennesJasperIndianapolisOut of StateALL

  • Data to Knowledge Process

  • How a Data Warehouse fits within our overall Data Governance

  • Current Strategy / Approach

  • Current Data Access Delivery Mechanisms & ToolsAd-hoc Reporting AccessScheduled and On-Demand Report GenerationUsing tools such as e~print, discoverer, ms access and excel, jobsub, population selection, argos, etc.

  • Data Driven FrameworkPillars of Success

  • Questions and AnswersData Warehouse Concepts

    *********