144
SCDL – 4 th Semester – Data Mining LIST OF ATTEMPTED QUESTIONS AND ANSWERS Select The Blank Question Semantic integration of ________ genome database is the important task of DNA analysis. Correct Answer Heterogeneous and distributed Your Answer Heterogeneous and distributed Multiple Choice Single Answer Question Main advantage of following which method is it's fast processing? Correct Answer Grid based Your Answer Partioning based Select The Blank Question With the widespread option of ________ real-time connection is viable for data warehouse. Correct Answer TCP/IP Your Answer HTTP Select The Blank Question ________ are responsible for running queries and reports against data warehouse tables. Correct Answer End users Your Answer End users Multiple Choice Multiple Answer Question Advantages of Wavelet transformation for clustering are :- Correct Answer Unsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your Answer Unsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy Multiple Choice Single Answer Question Query tool is meant for :- Correct Answer Data acquisition Page 1 of 144

Data Mining

Embed Size (px)

DESCRIPTION

Data mining assignments.

Citation preview

Marks : 2

SCDL 4th Semester Data Mining

Top of Form

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Select The BlankQuestionSemantic integration of ________ genome database is the important task of DNA analysis.Correct AnswerHeterogeneous and distributedYour AnswerHeterogeneous and distributed

Multiple Choice Single AnswerQuestionMain advantage of following which method is it's fast processing?Correct AnswerGrid basedYour AnswerPartioning based

Select The BlankQuestionWith the widespread option of ________ real-time connection is viable for data warehouse.Correct AnswerTCP/IPYour AnswerHTTP

Select The BlankQuestion________ are responsible for running queries and reports against data warehouse tables.Correct AnswerEnd usersYour AnswerEnd users

Multiple Choice Multiple AnswerQuestionAdvantages of Wavelet transformation for clustering are :-Correct AnswerUnsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your AnswerUnsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy

Multiple Choice Single AnswerQuestionQuery tool is meant for :-Correct AnswerData acquisitionYour AnswerInformation delivery

Multiple Choice Single AnswerQuestionWhich of the following function involves data cleaning, data standardizing and summarizing?Correct AnswerTransforming dataYour AnswerStoring data

Multiple Choice Multiple AnswerQuestionWhich of the following clustering analysis method uses multiresolution approach?Correct AnswerSTING , Wave Cluster Your AnswerSTING , Wave Cluster

Multiple Choice Single AnswerQuestionWhich type of following clustering computes augumented cluster ordering?Correct AnswerOPTICSYour AnswerCLQUE

Multiple Choice Multiple AnswerQuestionTime variant nature of the data in data warehouse :-Correct AnswerAllows for analysis of the past , Relate information to the present , Enables forecasts for the future Your AnswerAllows for analysis of the past , Relate information to the present , Enables forecasts for the future

True/FalseQuestionThe Structure that brings all the components together is known as Architecture.Correct AnswerTrueYour AnswerTrue

Multiple Choice Multiple AnswerQuestionData compression is to compress the given data by encoding in terms of :-Correct AnswerAssociation rule , Decision tree , Cluster Your AnswerBytes , Cluster

Multiple Choice Multiple AnswerQuestionThe different definitions of metadata are :-Correct AnswerData about data , Catalog of data , Data warehouse roadmap Your AnswerData about data , Catalog of data , Data warehouse roadmap

True/FalseQuestionA distinct feature of DB Miner is its data cube based online analytical mining.Correct AnswerTrueYour AnswerFalse

Multiple Choice Single AnswerQuestionAssociation rules mining is based on :-Correct AnswerClustering and Employing rules for classificationYour AnswerClustering and Employing rules for classification

True/FalseQuestionA distinguishing feature of Clementine is its object oriented extended module interface.Correct AnswerTrueYour AnswerTrue

Select The BlankQuestion________ includes Normalization and Aggregation as data preprocessing procedures.Correct AnswerData transformationYour AnswerData transformation

True/FalseQuestionTo remove noise from data is called as Smoothing.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionData matrix is :-Correct AnswerObject by variable structureYour AnswerObject by variable structure

True/FalseQuestionData updates are common place in an operational database.Correct AnswerTrueYour AnswerTrue

True/FalseQuestionIn decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectanglesCorrect AnswerFalseYour AnswerTrue

True/FalseQuestionFrom a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming.Correct AnswerTrueYour AnswerTrue

Match The FollowingQuestionCorrect AnswerYour AnswerDisparate dataProduction dataQuery and analysisNon volatile dataQuery and analysisArchive dataData granularityLevel of detailLevel of detailData from external sourceExternal dataExternal data

Multiple Choice Multiple AnswerQuestionIn physical design of data warehouse administration provides features like :-Correct AnswerAvoiding reorganizing of tables , Support backup and recovery , Query processing Your AnswerSupport backup and recovery , Manage store area , Query processing

Select The BlankQuestion________ is the user who has system access privileges but no database administration privileges as well as not for table and views.Correct AnswerNetwork administratorYour AnswerEnd user

Multiple Choice Multiple AnswerQuestionData mining Functionalities are :-Correct AnswerCharactrization and Discrimination , Association Analysis , Cluster Analysis Your AnswerAssociation Analysis , Cluster Analysis , Time series Data Analysis

Select The BlankQuestion________ dimension of database in which primitive level data are spatial but generalization becomes non spatial.Correct AnswerSpatial to non spatialYour AnswerSpatial to non spatial

Multiple Choice Multiple AnswerQuestionSource Data Component may be grouped into following categories :-Correct AnswerProduction Data , Internal External Data Your AnswerInternal External Data , Analyzed data , Non Analyzed data

Select The BlankQuestion________ technique is the statistical technique for analyzing data.Correct AnswerTime seriesYour AnswerTime series

Multiple Choice Multiple AnswerQuestionThe strategies for data reduction are :-Correct AnswerData aggregation , Dimension reduction , Numerocity reduction Your AnswerData aggregation , Dimension reduction , Numerocity reduction

Multiple Choice Single AnswerQuestionClassification rules are extracted fromCorrect AnswerDecision TreeYour AnswerRoot-Node

Match The FollowingQuestionCorrect AnswerYour AnswerData MiningKnowledge discoveryKnowledge discoveryMetadataRoadmap for userDetails of summaryData storageData managementData managementData stagingWorkbench for dataWorkbench for data

True/FalseQuestionData cube stores multidimensional aggregate information.Correct AnswerTrueYour AnswerTrue

Select The BlankQuestion________ is the method used to predict the value of response variable from one to more variables.Correct AnswerRegressionYour AnswerRegression

Select The BlankQuestion________ databases are one of the most poplularly available and rich information repositories.Correct AnswerRelationalYour AnswerObject oriented

True/FalseQuestionCOBWEB is a method of incremental conceptual clustering.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionMany methods for data smoothing are also methods for data reduction involving :-Correct AnswerDiscretizationYour AnswerClustering

Multiple Choice Single AnswerQuestionDimensionality reduction reduces the data set size by removing :-Correct AnswerIrrelevant attributesYour AnswerIrrelevant attributes

Multiple Choice Single AnswerQuestionEffect of one attibute value on a given class is independent of values of other attibute is calledCorrect AnswerValue independenceYour AnswerClass Conditional independence

Multiple Choice Single AnswerQuestionWhich from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct AnswerTriggersYour AnswerTriggers

Select The BlankQuestionA web server usually registers ________ entry for every access of a web pageCorrect AnswerWeblogYour AnswerLog

Multiple Choice Single AnswerQuestionBayes Theorem is :-Correct AnswerP(H|X)=P(X|H)(P)/P(X)Your AnswerP(H|X)=P(X|H)(P)/P(X)

True/FalseQuestionVisual display can help user to give clear impression and overview of the data characteristics in a database.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionWhich of the following is based on set of density distribution function clustering?Correct AnswerDBSCANYour AnswerDBSCAN

Multiple Choice Multiple AnswerQuestionMetadata in a data warehouse falls into following categories :-Correct AnswerOperational Metadata , Extraction and Transformation metadata , End-user Metadata Your AnswerOperational Metadata , Extraction and Transformation metadata , End-user Metadata

Multiple Choice Multiple AnswerQuestionKnowledge discovery process includes :-Correct AnswerData Cleaning , Data Intergration , Data Selectin Your AnswerData Cleaning , Data Intergration , Data Selectin

Select The BlankQuestionHuman being have around ________ gene.Correct Answer100000Your Answer1000000

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Select The BlankQuestion

Creating ________is violation of Normalization principles.

Correct Answer

Array

Your Answer

Array

True/FalseQuestion

Data Mining refers to extracting knowledge from larger amount of data.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Which of the following of Grid based clustering method explorates statistical information?

Correct Answer

STING

Your Answer

CLIQUE

Select The BlankQuestion

In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.

Correct Answer

Smoothing by bin boundaries

Your Answer

Smoothing by medians

Select The BlankQuestion

________ can store aggregate and detail data at varying levels of resolution or abstraction.

Correct Answer

Index tree

Your Answer

R-Tree

Select The BlankQuestion

________ is the platform for complex data transformation for the purpose of cleanse it

Correct Answer

Separate optimal Platform

Your Answer

Legacy platform

Multiple Choice Multiple AnswerQuestion

SMP provides the features like :-

Correct Answer

Each node has access to common set of disks , Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus

Your Answer

Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , It is cluster of nodes

Multiple Choice Single AnswerQuestion

In intermediate data extraction data capture through transaction log uses transaction from :-

Correct Answer

Recovery from failure

Your Answer

Recovery from failure

Multiple Choice Multiple AnswerQuestion

In data storage area , DBA uses metadata for processes of :-

Correct Answer

Backup , Recovery , Tuning Database

Your Answer

Backup , Recovery , Management

Multiple Choice Multiple AnswerQuestion

Foundation infrastructure of warehouse includes many elements such as :-

Correct Answer

Basic Computing platform , Hardware and operating system , DBMS and Query

Your Answer

Basic Computing platform , DBMS and Query , Query processing components

Match The FollowingQuestion

Correct Answer

Your Answer

Data producer

Responsible for data quality

Responsible for data quality

Domain values

Prevalent problem

Foreign key preserved

Update security

Prevention of unauthorized updates

Prevalent problem

Referential integrity

Foreign key preserved

Prevention of unauthorized updates

Select The BlankQuestion

________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis

Correct Answer

DBSCAN

Your Answer

DBSCAN

Multiple Choice Multiple AnswerQuestion

Building blocks of Data Warehouse are :-

Correct Answer

Source Data , Data Staging , Management and Control

Your Answer

Data Staging , Data Manager , Management and Control

True/FalseQuestion

All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-

Correct Answer

Huge size of data

Your Answer

Huge size of data

Match The FollowingQuestion

Correct Answer

Your Answer

Clustering tool

To group different cases

To detect unusual attribute

Data visualization tool

Transaction activity using graph

To filter unrelated attributes

Linkage analysis tool

To identify links

To group different cases

Classification tool

To filter unrelated attributes

To identify links

Multiple Choice Multiple AnswerQuestion

Generalized linear model includes :-

Correct Answer

Logistic regression , Poisson regression

Your Answer

Poisson regression , Linear regression , Polynomial Regression

True/FalseQuestion

Metadata acts like a nerve center.

Correct Answer

True

Your Answer

False

Multiple Choice Single AnswerQuestion

OLAP is used for :-

Correct Answer

Online Analytical Processing

Your Answer

Online Application Processing

Multiple Choice Multiple AnswerQuestion

The dimensions of spatial data cube are :-

Correct Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Your Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Multiple Choice Single AnswerQuestion

Maintenance of cache consistency is the limitation of :-

Correct Answer

MPP

Your Answer

NUMA

Select The BlankQuestion

In ________ duplicate sub trees exist within the tree.

Correct Answer

Repetition

Your Answer

Replication

Select The BlankQuestion

Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords

Correct Answer

Web Search

Your Answer

Web Search

Multiple Choice Single AnswerQuestion

Redundancies can be deleted by :-

Correct Answer

Co-relational analysis

Your Answer

Coherent analysis

True/FalseQuestion

To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Common areas of application for mixed effect model includes :-

Correct Answer

Multiple data , Repeated measures data , Block designs

Your Answer

Multiple data , Dimensional data , Block designs

Select The BlankQuestion

In data ________, data encoding or transformations are applied to obtain reduced or compressed representation.

Correct Answer

Compression

Your Answer

Compression

Multiple Choice Single AnswerQuestion

Grouped data can be analyzed with the technique :-

Correct Answer

Mixed effect model

Your Answer

Factor analysis

Select The BlankQuestion

________ is the navigational map of data warehouse.

Correct Answer

End user Metadata

Your Answer

Extraction Metadata

Multiple Choice Multiple AnswerQuestion

Business metadata is useful for :-

Correct Answer

Providing support to end users , For external view of data , Provides technical support to search data

Your Answer

Providing support to end users , For external view of data , Provides technical support to search data

True/FalseQuestion

The elements of warehouse infrastructure are classified into operational and physical infrastructure.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Data reduction by volume can be used for data representation using which type of reduction?

Correct Answer

Numerosity reduction

Your Answer

Histograms

True/FalseQuestion

Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database

Correct Answer

False

Your Answer

False

Multiple Choice Single AnswerQuestion

Queries run faster to find exact match using which type of indexing?

Correct Answer

Clustered index

Your Answer

Sequential index

Multiple Choice Single AnswerQuestion

Data can be smoothed by filling the data to function such as :-

Correct Answer

Regression

Your Answer

Clustering

True/FalseQuestion

Data classification is two step process in which first step includes classfication of model and in second step model describes set of data.

Correct Answer

False

Your Answer

True

Select The BlankQuestion

In data warehouse architecture, the ________ component interleaves with and connects other components.

Correct Answer

Metadata

Your Answer

Metadata

True/FalseQuestion

Legacy data resides on Hierarchical or Network database.

Correct Answer

True

Your Answer

True

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Multiple AnswerQuestion

Metadata is essential for IT for :-

Correct Answer

Source data structures , Data summarization

Your Answer

Web enabling , Source data structures , Data summarization

Multiple Choice Multiple AnswerQuestion

Financial data called for banking and financial industry are often relatively :-

Correct Answer

Complete , Reliable , High Quality

Your Answer

Complete , Reliable , Correct

Select The BlankQuestion

________ option of warehouse architecture provides incremental growth.

Correct Answer

Cluster

Your Answer

Cluster

Match The FollowingQuestion

Correct Answer

Your Answer

Operating systems compatibility

Security, reliability, availability

Security, reliability, availability

Data Acquisition

Data Extraction, Transformation, clensing, integration

Data Extraction, Transformation, clensing, integration

Data Storage

Data loading , Archiving

Data loading , Archiving

Information Delivery

Report generation, query processing and complex analysis

Report generation, query processing and complex analysis

True/FalseQuestion

A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Which of the following method creates copies of data in distributed environment?

Correct Answer

Replication

Your Answer

Replication

Multiple Choice Single AnswerQuestion

Capture at data source and that's why this method is quite reliable :-

Correct Answer

Capture by database Triggers

Your Answer

Capture in source application

Multiple Choice Single AnswerQuestion

For Banking and financial data which type of analysis is used?

Correct Answer

Multidimensional

Your Answer

Relational

Multiple Choice Single AnswerQuestion

Which of the following methods for regression is used on sparse data :-

Correct Answer

Regression and log-linear model

Your Answer

Regression and transformation

Multiple Choice Multiple AnswerQuestion

Following data transformation methods are used in analysis of time series data :-

Correct Answer

Scaling , Normalization , Windows Stiching

Your Answer

Scaling , Normalization , Windows Stiching

Select The BlankQuestion

________ function of data staging component involves many forms of combining pieces of data from different sources.

Correct Answer

Data Transformation

Your Answer

Data Loading

Multiple Choice Single AnswerQuestion

Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-

Correct Answer

Huge size of data

Your Answer

Relational data

Select The BlankQuestion

Creating ________is violation of Normalization principles.

Correct Answer

Array

Your Answer

Structure

Multiple Choice Multiple AnswerQuestion

The tools of metadata falls in following categories :-

Correct Answer

Development tools for IT professional , Information access tool for End user

Your Answer

Access tool , Development tools for IT professional , Information access tool for End user

True/FalseQuestion

Architecture comes first, tools follows it.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ is an alternative aggolomerative hierarchical clustering algorithm.

Correct Answer

ROCK

Your Answer

ROKE

Multiple Choice Multiple AnswerQuestion

In data storage area , DBA uses metadata for processes of :-

Correct Answer

Backup , Recovery , Tuning Database

Your Answer

Backup , Recovery , Tuning Database

True/FalseQuestion

Data cleansing means removing noisy and inconsistent data.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Data processing techniques are :-

Correct Answer

Cleansing , Integration , Transformation

Your Answer

Cleansing , Transformation , Collection

Multiple Choice Single AnswerQuestion

Data can be smoothed by filling the data to function such as :-

Correct Answer

Regression

Your Answer

Clustering

Multiple Choice Single AnswerQuestion

Deviation based outlier detection identifes outliers by :-

Correct Answer

Examining character of objects in groups

Your Answer

Examining objects in group

Multiple Choice Single AnswerQuestion

Data partitioning, data clustering are the techniques for :-

Correct Answer

Performance enhancement

Your Answer

Performance enhancement

Multiple Choice Multiple AnswerQuestion

Following are the issues to consider during data integration :-

Correct Answer

Schema integration , Redundancy , Detection and resolution of data values

Your Answer

Schema integration , Redundancy , Inconsistency

True/FalseQuestion

Management architectural component manages and controls data acquisition functions.

Correct Answer

True

Your Answer

True

Match The FollowingQuestion

Correct Answer

Your Answer

Data loading tool

Primary key generation

Primary key generation

Data modeling tool

Reverse Engineering capabilities

Reverse Engineering capabilities

Data Extraction tool

Bulk extraction for full refresh

Bulk extraction for full refresh

Data transformation tool

Default values

Replication

Multiple Choice Multiple AnswerQuestion

DNA sequences are comprised of :-

Correct Answer

Adenine , Gaunine , Thymine

Your Answer

Adenine , Cytocine , Gaunine

Multiple Choice Single AnswerQuestion

Large number of indexes affects the loading process because :-

Correct Answer

Indexes are created for new records

Your Answer

Indexes are created for old records

Select The BlankQuestion

The technique of ________ enables concurrent input/output operations and improves file's access performance substantially.

Correct Answer

File striping

Your Answer

Data migration

Multiple Choice Multiple AnswerQuestion

Warehouse Operational infrastructure is to support each architecture component consists of :-

Correct Answer

People , Procedures , Management software

Your Answer

People , Procedures , Management software

True/FalseQuestion

In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.

Correct Answer

True

Your Answer

False

Select The BlankQuestion

________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval.

Correct Answer

Descretization

Your Answer

Compression

True/FalseQuestion

Data cubes created for varying levels of abstraction are referred as cuboids.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Which of the following approach requires more computation?

Correct Answer

Filter approach

Your Answer

Filter approach

Select The BlankQuestion

________components consists all the different ways of making the information from the data warehouse available to the user.

Correct Answer

Information Delivery

Your Answer

Metadata

Multiple Choice Multiple AnswerQuestion

Data transformation includes :-

Correct Answer

Smoothing , Aggregation , Generalization

Your Answer

Smoothing , Aggregation

True/FalseQuestion

In Linear regression data are modeled to fit a straight line.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Methods for outlier detection are categorised into following approaches :-

Correct Answer

Statistical , Distance based , Deviation based

Your Answer

Statistical , Distance based , Deviation based

Multiple Choice Multiple AnswerQuestion

Data base miner provides multiple data mining algorithms including :-

Correct Answer

Discovery driven OLAP analysis , Association , Classification

Your Answer

Discovery driven OLAP analysis , Association , Regression

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Single AnswerQuestion

Deviation based outlier detection identifes outliers by :-

Correct Answer

Examining character of objects in groups

Your Answer

Examining character of objects in groups

Select The BlankQuestion

________ component of warehouse is responsible for coordinating services and activities within the data warehouse.

Correct Answer

Management and Control

Your Answer

Management and Control

True/FalseQuestion

Sequential pattern analysis and similarity search techniques have been developed in data mining.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

For operational system, the stored data contains ________values.

Correct Answer

Current data

Your Answer

Current data

True/FalseQuestion

Intelligent miner is an IBM data mining product.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

The technique of ________ enables concurrent input/output operations and improves file's access performance substantially.

Correct Answer

File striping

Your Answer

File striping

Multiple Choice Multiple AnswerQuestion

SMP provides the features like :-

Correct Answer

Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks

Your Answer

Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus

Match The FollowingQuestion

Correct Answer

Your Answer

Incremental data capture

Differed data capture

Differed data capture

Initial load of data warehouse

"as-is" data capture

"as-is" data capture

Static data

Capture of data in given point of time

Capture of data in given point of time

Data revision

Incremental data capture

Incremental data capture

True/FalseQuestion

In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.

Correct Answer

True

Your Answer

False

True/FalseQuestion

Data preprocessing is an important step in knowledge discovery process.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

The dimensions of spatial data cube are :-

Correct Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Your Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

True/FalseQuestion

Data mining often requires data integration.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

In data storage area , DBA uses metadata for processes of :-

Correct Answer

Backup , Recovery , Tuning Database

Your Answer

Backup , Recovery , Tuning Database

Select The BlankQuestion

________components consists all the different ways of making the information from the data warehouse available to the user.

Correct Answer

Information Delivery

Your Answer

Information Delivery

Multiple Choice Multiple AnswerQuestion

Data processing techniques are :-

Correct Answer

Cleansing , Integration , Transformation

Your Answer

Integration , Transformation , Cleansing

Match The FollowingQuestion

Correct Answer

Your Answer

Information Delivery

Report generation, query processing and complex analysis

Report generation, query processing and complex analysis

Operating systems compatibility

Security, reliability, availability

Security, reliability, availability

Data Acquisition

Data Extraction, Transformation, clensing, integration

Data Extraction, Transformation, clensing, integration

Data Storage

Data loading , Archiving

Data loading , Archiving

Select The BlankQuestion

In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.

Correct Answer

Smoothing by bin boundaries

Your Answer

Smoothing by bin boundaries

Multiple Choice Single AnswerQuestion

Data partitioning, data clustering are the techniques for :-

Correct Answer

Performance enhancement

Your Answer

Data extraction

Select The BlankQuestion

Most of the warehouses employ ________ database Management System.

Correct Answer

Relational

Your Answer

Relational

True/FalseQuestion

NUMA provides better scalability than SMP.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Data migration affects performance requiring multiple blocks to be read which can be adjusted by :-

Correct Answer

Block percent free

Your Answer

Block percent free

Multiple Choice Single AnswerQuestion

Redundancies can be deleted by :-

Correct Answer

Co-relational analysis

Your Answer

Co-relational analysis

Multiple Choice Multiple AnswerQuestion

The functions of data acquisition are :-

Correct Answer

Data Transformation , Data Extraction

Your Answer

Data Extraction , Data Transformation , Data cleansing

Multiple Choice Single AnswerQuestion

SMP stands for :-

Correct Answer

Symmetric Multiprocessing

Your Answer

Symmetric Multiprocessing

Multiple Choice Multiple AnswerQuestion

Mining values can be removed by :-

Correct Answer

Filling values manually , Use of global constant , Use of attribute mean

Your Answer

Filling values manually , Use of attribute mean

Multiple Choice Single AnswerQuestion

Which from the following is used for classification and prediction?

Correct Answer

Regression trees

Your Answer

Regression

Multiple Choice Multiple AnswerQuestion

Before moving data to data warehouse is has to go through :-

Correct Answer

Transformation , Integration , Consolidation

Your Answer

Transformation , Integration , Consolidation

Select The BlankQuestion

________ is the navigational map of data warehouse.

Correct Answer

End user Metadata

Your Answer

Operational Metadata

True/FalseQuestion

Architecture comes first, tools follows it.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Which technique analyze experimental data?

Correct Answer

Analysis of variance

Your Answer

Regression

Multiple Choice Multiple AnswerQuestion

The need for metadata is for :-

Correct Answer

Using data warehouse , Building data warehouse , Administration of warehouse

Your Answer

Building data warehouse , Administration of warehouse

Multiple Choice Single AnswerQuestion

Development and deployment of your data warehouse is joint effort between :-

Correct Answer

IT staff and user representatives

Your Answer

IT staff and user representatives

Select The BlankQuestion

________ function of data staging component involves many forms of combining pieces of data from different sources.

Correct Answer

Data Transformation

Your Answer

Data Transformation

Multiple Choice Multiple AnswerQuestion

When you use tool for design and development, following things take place with metadata :-

Correct Answer

Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process

Your Answer

Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process

Multiple Choice Multiple AnswerQuestion

The main categories of Metadata in warehouse are :-

Correct Answer

Operational , Extraction and transformation Metadata , End user Metadata

Your Answer

Operational , Extraction and transformation Metadata , End user Metadata

Select The BlankQuestion

________ is the type of pilot for early delivery with broader scope and may be integrated.

Correct Answer

Broad business pilot

Your Answer

Proof of concept pilot

True/FalseQuestion

A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering

Correct Answer

True

Your Answer

True

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Single AnswerQuestion

Which type of Grid clustering depends on the granularity of lowest level of grid structure?

Correct Answer

STING

Your Answer

OPTICS

Multiple Choice Single AnswerQuestion

Which of the following option of data extraction is known as application assisted data capture?

Correct Answer

Capture in source application

Your Answer

Capture by comparing files

True/FalseQuestion

Moving data into staging area and performing data transformation function is a part of data acquisition.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

The objective for physical design of data warehouse are :-

Correct Answer

Improve performance , Ensure scalability , Manage store

Your Answer

Improve performance , Ensure scalability , Manage database

Multiple Choice Multiple AnswerQuestion

User must have proper access to metadata for performing responsibilities of :-

Correct Answer

Design , Administration

Your Answer

Design , Administration , Management

Multiple Choice Multiple AnswerQuestion

In Intelligent miner the data mining product provides data mining algorithm including

Correct Answer

Association , Classification , Regression

Your Answer

Association , Regression , Aggregation

Multiple Choice Single AnswerQuestion

The big difference between data warehouse and any operational system is its :-

Correct Answer

Usage

Your Answer

Organization

True/FalseQuestion

Loan payment prediction and customer credit analysis are critical to business of bank.

Correct Answer

True

Your Answer

False

Multiple Choice Single AnswerQuestion

Which of the option is not considered as the major function needed to get data ready?

Correct Answer

Storing data

Your Answer

Extracting data

True/FalseQuestion

In the data acquisition area, the data flow begins at the data sources and pauses at staging area.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

Most of the warehouses employ ________ database Management System.

Correct Answer

Relational

Your Answer

Relational

True/FalseQuestion

NUMA provides better scalability than SMP.

Correct Answer

True

Your Answer

True

Match The FollowingQuestion

Correct Answer

Your Answer

Interactive visual data mining

Visualization tool

Audio signal

Data visualization

Visual display

Graphical display

Data mining result visualization

Presentation of knowledge

Visualization tool

Data mining process visualization

Data mining in visual format

Data mining in visual format

Multiple Choice Single AnswerQuestion

Deliberate splitting of a table and its index data into manageable part is known as :-

Correct Answer

Partitioning

Your Answer

Decomposing

Multiple Choice Multiple AnswerQuestion

Data mining is applicable to :-

Correct Answer

Relational Database , Data Warehouse , Transaction Database

Your Answer

Relational Database , Data Warehouse , Transaction Database

True/FalseQuestion

Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.

Correct Answer

True

Your Answer

False

True/FalseQuestion

Data cleansing means removing noisy and inconsistent data.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Which from the following is used for classification and prediction?

Correct Answer

Regression trees

Your Answer

Generalized linear model

Multiple Choice Multiple AnswerQuestion

Data cleansing routines work to clean the data by :-

Correct Answer

Filling missing values , Smoothing noisy data

Your Answer

Filling missing values , Smoothing noisy data , Resolving inconsistency

Select The BlankQuestion

________ is the type of pilot for early delivery with broader scope and may be integrated.

Correct Answer

Broad business pilot

Your Answer

Proof of concept pilot

Multiple Choice Single AnswerQuestion

The data warehouse DBMS executes on :-

Correct Answer

Data server component

Your Answer

Data server component

True/FalseQuestion

A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering

Correct Answer

True

Your Answer

False

Select The BlankQuestion

________ component of warehouse is responsible for coordinating services and activities within the data warehouse.

Correct Answer

Management and Control

Your Answer

Management and Control

Multiple Choice Single AnswerQuestion

Large number of indexes affects the loading process because :-

Correct Answer

Indexes are created for new records

Your Answer

Records are reshuffled

Match The FollowingQuestion

Correct Answer

Your Answer

Chasm

Challenges

Method to solve problem

Early majority

Nature technology

Technology to die out

Innovators

Method to solve problem

Challenges

Early adaptors

Increased interest

Increased interest

Select The BlankQuestion

________ is an alternative aggolomerative hierarchical clustering algorithm.

Correct Answer

ROCK

Your Answer

ROKE

Multiple Choice Single AnswerQuestion

Which technique is used to predict categorical response variable?

Correct Answer

Discriminant analysis

Your Answer

Factor analysis

Multiple Choice Single AnswerQuestion

Deviation based outlier detection identifes outliers by :-

Correct Answer

Examining character of objects in groups

Your Answer

Examining character of objects in groups

Multiple Choice Multiple AnswerQuestion

The information delivery methods from data warehouse are :-

Correct Answer

Complex queries , MD Analysis , Statistical Analysis

Your Answer

Complex queries , MD Analysis , ETS System

Select The BlankQuestion

________ does not handle categorical attributes.

Correct Answer

CURE

Your Answer

Chameleon

Multiple Choice Multiple AnswerQuestion

Data warehouse environment is functionally divided into following areas :-

Correct Answer

Data acquisition , Data storage , Information delivery

Your Answer

Data storage , Information delivery , Data transformation

True/FalseQuestion

Data mining often requires data integration.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ method of regression is useful when errors fails to satisfy normal conditions.

Correct Answer

Robust

Your Answer

Polynomial

Multiple Choice Multiple AnswerQuestion

The areas of classification for metadata are :-

Correct Answer

Development/usage , Technical/business , BackRoom/Front Room

Your Answer

Development/usage , Technical/business , Administration

Multiple Choice Multiple AnswerQuestion

Data base miner provides multiple data mining algorithms including :-

Correct Answer

Discovery driven OLAP analysis , Association , Classification

Your Answer

Association , Classification , Regression

Select The BlankQuestion

The ________ record is one-to-many relationship with corresponding fact table record.

Correct Answer

Dimension tables

Your Answer

Fact table

Multiple Choice Single AnswerQuestion

For Incremental data loads the sequence is :-

Correct Answer

Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing

Your Answer

Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing

Multiple Choice Multiple AnswerQuestion

The platform of Data warehouse consists of :-

Correct Answer

Basic hardware components , Operating System , Network and Network software

Your Answer

Basic hardware components , Network and Network software , Utility software

Multiple Choice Multiple AnswerQuestion

The smoothing techniques are :-

Correct Answer

Binning , Clustering , Regression

Your Answer

Clustering , Regression , Insertion

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Select The BlankQuestion

________ method of regression is useful when errors fails to satisfy normal conditions.

Correct Answer

Robust

Your Answer

Robust

True/FalseQuestion

Data classification is two step process in which first step includes classfication of model and in second step model describes set of data.

Correct Answer

False

Your Answer

True

True/FalseQuestion

Data cleansing means removing noisy and inconsistent data.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Following factors play important role in financial analysis :-

Correct Answer

Data warehouse , Data cubes , Outliner analysis

Your Answer

Data warehouse , Data cubes , Data accuracy

Multiple Choice Multiple AnswerQuestion

The dimensions of spatial data cube are :-

Correct Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Your Answer

Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Multiple Choice Single AnswerQuestion

OLAP is used for :-

Correct Answer

Online Analytical Processing

Your Answer

Online Analytical Processing

True/FalseQuestion

Metadata acts like a nerve center.

Correct Answer

True

Your Answer

True

Match The FollowingQuestion

Correct Answer

Your Answer

Constructive merge

New record supercedes

Populating data warehouse table first time

Initial Load

Populating data warehouse table first time

Populating data warehouse table first time

Incremental Load

Applying ongoing changes

Applying ongoing changes

Load Image

To correspond to target files

Applying data

Multiple Choice Single AnswerQuestion

Disparity is the significant & disturbing characteristic of which type of data?

Correct Answer

Production data

Your Answer

Production data

True/FalseQuestion

Audio data mining can be an interesting alternative to visual mining.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ platform is the platform on which the data warehouse DBMS runs and database exist.

Correct Answer

Data storage

Your Answer

Data storage

True/FalseQuestion

Smoothing by bin means each value in bin is replaced by the mean value of the bucket.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Following clustering method is classified as being agglomerative or divisive :-

Correct Answer

Grid based

Your Answer

Hierarchical Method

Multiple Choice Multiple AnswerQuestion

Data processing is done for :-

Correct Answer

Improving the efficiency , Ease of mining

Your Answer

Improving the efficiency , Ease of mining , Removing redundancy

Multiple Choice Single AnswerQuestion

For Banking and financial data which type of analysis is used?

Correct Answer

Multidimensional

Your Answer

Relational

Multiple Choice Multiple AnswerQuestion

Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-

Correct Answer

Data Cleaning , Relevance Analysis , Data Transformation

Your Answer

Data Cleaning , Relevance Analysis , Data Transformation

Multiple Choice Multiple AnswerQuestion

The functions of data acquisition are :-

Correct Answer

Data Extraction , Data Transformation

Your Answer

Data Extraction , Data Transformation , Data cleansing

Multiple Choice Single AnswerQuestion

Data partitioning, data clustering are the techniques for :-

Correct Answer

Performance enhancement

Your Answer

Performance enhancement

Multiple Choice Multiple AnswerQuestion

The Main areas of Data Warehouse are :-

Correct Answer

Data acquisition , Data Storage , Information Delivery

Your Answer

Data acquisition , Data Storage , Information Delivery

Select The BlankQuestion

________ is an alternative aggolomerative hierarchical clustering algorithm.

Correct Answer

ROCK

Your Answer

ROCK

Select The BlankQuestion

________ is the platform for complex data transformation for the purpose of cleanse it

Correct Answer

Separate optimal Platform

Your Answer

Separate optimal Platform

Multiple Choice Multiple AnswerQuestion

Metadata recorded in information delivery functional area is related to :-

Correct Answer

Predefined queries , Input parameter definition , Reports

Your Answer

Predefined queries , Reports

True/FalseQuestion

Data cubes created for varying levels of abstraction are referred as cuboids.

Correct Answer

True

Your Answer

True

True/FalseQuestion

Moving data into staging area and performing data transformation function is a part of data acquisition.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Methods for outlier detection are categorised into following approaches :-

Correct Answer

Statistical , Distance based , Deviation based

Your Answer

Statistical , Distance based , Deviation based

Multiple Choice Single AnswerQuestion

The first step of attibute oriented induction is :-

Correct Answer

Data focusing

Your Answer

Data Collection

True/FalseQuestion

Legacy data resides on Hierarchical or Network database.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ option of warehouse architecture provides incremental growth.

Correct Answer

Cluster

Your Answer

Cluster

Multiple Choice Single AnswerQuestion

Data can be smoothed by filling the data to function such as :-

Correct Answer

Regression

Your Answer

Regression

Multiple Choice Multiple AnswerQuestion

Data mining is applicable to :-

Correct Answer

Relational Database , Data Warehouse , Transaction Database

Your Answer

Relational Database , Data Warehouse , Transaction Database

Multiple Choice Single AnswerQuestion

The data warehouse DBMS executes on :-

Correct Answer

Data server component

Your Answer

Data server component

Match The FollowingQuestion

Correct Answer

Your Answer

Metadata

Roadmap for user

Details of summary

Data storage

Data management

Data management

Data staging

Workbench for data

Workbench for data

Data Mining

Knowledge discovery

Knowledge discovery

True/FalseQuestion

Data Mining refers to extracting knowledge from larger amount of data.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

Most of the warehouses employ ________ database Management System.

Correct Answer

Relational

Your Answer

Relational

HTMLCONTROL Forms.HTML:Hidden.1 LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Single AnswerQuestion

The technique of data clustering facilitates :-

Correct Answer

Serial access

Your Answer

Indexed access

Select The BlankQuestion

In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.

Correct Answer

Smoothing by bin boundaries

Your Answer

Smoothing by bin boundaries

Multiple Choice Multiple AnswerQuestion

The ways of Intra query parallelization are :-

Correct Answer

Horizontal parallelization , Vertical Parallelization , Hybrid parallelization

Your Answer

Vertical Parallelization , Homogenous parallelization

True/FalseQuestion

One of the most important search problem in genetic analysis is similarity search and comparison among DNA sequence.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

User must have proper access to metadata for performing responsibilities of :-

Correct Answer

Design , Administration

Your Answer

Administration , Management , Accessing

Select The BlankQuestion

________ is the platform for complex data transformation for the purpose of cleanse it

Correct Answer

Separate optimal Platform

Your Answer

Legacy platform

Multiple Choice Multiple AnswerQuestion

Classification and Prediction have following applications :-

Correct Answer

Credit approval , Medical Diagnosis , Performance Prediction

Your Answer

Credit approval , Selective Marketing

Multiple Choice Multiple AnswerQuestion

In data storage area , DBA uses metadata for processes of :-

Correct Answer

Tuning Database , Backup , Recovery

Your Answer

Tuning Database , Management

Multiple Choice Single AnswerQuestion

Data can be smoothed by filling the data to function such as :-

Correct Answer

Regression

Your Answer

Binning

True/FalseQuestion

Tools perform major functions in data warehouse environment.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ option of warehouse architecture provides incremental growth.

Correct Answer

Cluster

Your Answer

Cluster

True/FalseQuestion

Data staging and data storage may start out on same computing platform.

Correct Answer

True

Your Answer

False

Match The FollowingQuestion

Correct Answer

Your Answer

Middleware & connectivity tool

Transparent access to source system

Assist data ware house administration

Data Quality tool

Locating data errors

Locating data errors

OLAP tools

Channel queries

Channel queries

Alert system tool

Users attention on exceptions

Users attention on exceptions

Multiple Choice Single AnswerQuestion

Attribute construction is the part of :-

Correct Answer

Transformation

Your Answer

Smoothing

Multiple Choice Single AnswerQuestion

Deliberate splitting of a table and its index data into manageable part is known as :-

Correct Answer

Partitioning

Your Answer

Partitioning

Multiple Choice Single AnswerQuestion

Simple matching approach is used for computing disimilarity between two objects for :-

Correct Answer

Nominal variable

Your Answer

Invariant variable

True/FalseQuestion

Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.

Correct Answer

True

Your Answer

False

Multiple Choice Single AnswerQuestion

Following clustering method is classified as being agglomerative or divisive :-

Correct Answer

Grid based

Your Answer

Density based

Select The BlankQuestion

________ clustering method follows statistical and neural network approach.

Correct Answer

Model based

Your Answer

Grid based

Multiple Choice Multiple AnswerQuestion

The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-

Correct Answer

Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

Your Answer

Linkage analysis tool , Outlier analysis tool , Complexity definition tool

Multiple Choice Multiple AnswerQuestion

DNA sequences are comprised of :-

Correct Answer

Adenine , Gaunine , Thymine

Your Answer

Adenine , Cytocine , Gaunine , Thymine

True/FalseQuestion

Management architectural component manages and controls data acquisition functions.

Correct Answer

True

Your Answer

False

Multiple Choice Single AnswerQuestion

If many indexes are needed, then on which table which option is more preferable?

Correct Answer

Splitting of tables

Your Answer

Splitting of tables

True/FalseQuestion

To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

It is good practice to drop ________ before initial load.

Correct Answer

Index

Your Answer

Index

True/FalseQuestion

All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Deviation based outlier detection identifes outliers by :-

Correct Answer

Examining character of objects in groups

Your Answer

Examining character of objects in groups

Select The BlankQuestion

________ method of regression is useful when errors fails to satisfy normal conditions.

Correct Answer

Robust

Your Answer

Polynomial

Multiple Choice Multiple AnswerQuestion

The functional areas of metadata are :-

Correct Answer

Data Acquisition , Data storage , Information delivery

Your Answer

Data Acquisition , Data storage , Information delivery

Match The FollowingQuestion

Correct Answer

Your Answer

Load Utility

High performance data loading, recovery

High performance data loading, recovery

Query Governer

Abort runaway query

Balancing extraction of query

Query Optimizer

Parsing, optimizing query

Parsing, optimizing query

Query Management

Balancing extraction of query

Execution and rescheduling queries

Multiple Choice Single AnswerQuestion

The first step of attibute oriented induction is :-

Correct Answer

Data focusing

Your Answer

Data Classification

True/FalseQuestion

Architecture comes first, tools follows it.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Data cleansing routines work to clean the data by :-

Correct Answer

Filling missing values , Smoothing noisy data

Your Answer

Smoothing noisy data , Resolving inconsistency

Select The BlankQuestion

Most of the warehouses employ ________ database Management System.

Correct Answer

Relational

Your Answer

Multidimensional

Multiple Choice Single AnswerQuestion

Which of the following method creates copies of data in distributed environment?

Correct Answer

Replication

Your Answer

Replication

Select The BlankQuestion

Human being have around ________ gene.

Correct Answer

100000

Your Answer

100000

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Multiple Choice Multiple AnswerQuestion

DNA sequences are comprised of :-

Correct Answer

Gaunine , Thymine , Adenine

Your Answer

Gaunine , Thymine , Adenine , Cytocine

True/FalseQuestion

Loan payment prediction and customer credit analysis are critical to business of bank.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-

Correct Answer

Data Cleaning , Relevance Analysis , Data Transformation

Your Answer

Data Cleaning , Relevance Analysis , Data Transformation

Multiple Choice Single AnswerQuestion

The big difference between data warehouse and any operational system is its :-

Correct Answer

Usage

Your Answer

Usage

True/FalseQuestion

Data cleansing means removing noisy and inconsistent data.

Correct Answer

True

Your Answer

True

True/FalseQuestion

Moving data into staging area and performing data transformation function is a part of data acquisition.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ option of warehouse architecture provides incremental growth.

Correct Answer

Cluster

Your Answer

Cluster

Select The BlankQuestion

For operational system, the stored data contains ________values.

Correct Answer

Current data

Your Answer

Current data

Multiple Choice Multiple AnswerQuestion

Splitting of data into smaller partition decision tree induction is prone to :-

Correct Answer

Fragmentation , Replication , Repetation

Your Answer

Fragmentation , Generalization

Multiple Choice Single AnswerQuestion

Bitmapped indexes are more suitable for data warehouse environment than for an OLTP system

Correct Answer

Bitmapped index

Your Answer

Clustered index

Select The BlankQuestion

________ is the type of pilot for early delivery with broader scope and may be integrated.

Correct Answer

Broad business pilot

Your Answer

User tool appreciation

Match The FollowingQuestion

Correct Answer

Your Answer

Data Mining

Knowledge discovery

Knowledge discovery

Metadata

Roadmap for user

Roadmap for user

Data storage

Data management

Data management

Data staging

Workbench for data

Workbench for data

Multiple Choice Single AnswerQuestion

A gene is usually comprised of hundreds of individual :-

Correct Answer

Nucleotides

Your Answer

Nucleotides

True/FalseQuestion

NUMA provides better scalability than SMP.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Deviation based outlier detection identifes outliers by :-

Correct Answer

Examining character of objects in groups

Your Answer

Examining distance between objects

Select The BlankQuestion

________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis

Correct Answer

DBSCAN

Your Answer

DBSCAN

Multiple Choice Single AnswerQuestion

Enterprise miner technique provides data mining algorithms including distinguishing feature as :-

Correct Answer

Advanced Statistical and advanced visualization tool

Your Answer

Advanced Statistical and classification tool

Match The FollowingQuestion

Correct Answer

Your Answer

Load Image

To correspond to target files

To correspond to target files

Constructive merge

New record supercedes

New record supercedes

Initial Load

Populating data warehouse table first time

Populating data warehouse table first time

Incremental Load

Applying ongoing changes

Applying ongoing changes

True/FalseQuestion

A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

Development and deployment of your data warehouse is joint effort between :-

Correct Answer

IT staff and user representatives

Your Answer

IT staff and user representatives

Multiple Choice Single AnswerQuestion

Attribute construction is the part of :-

Correct Answer

Transformation

Your Answer

Aggregation

Multiple Choice Single AnswerQuestion

Which of the following data warehouse component includes dependent data marts, special multidimensional database and full range of query and reporting facilities?

Correct Answer

Information Delivery component

Your Answer

Data Staging component

Multiple Choice Single AnswerQuestion

Which technique analyze experimental data?

Correct Answer

Analysis of variance

Your Answer

Analysis of variance

Select The BlankQuestion

________ function of data staging component involves many forms of combining pieces of data from different sources.

Correct Answer

Data Transformation

Your Answer

Data Transformation

Multiple Choice Multiple AnswerQuestion

Metadata is essential for IT for :-

Correct Answer

Source data structures , Data summarization

Your Answer

Source data structures , Data summarization , Aggregation

Multiple Choice Multiple AnswerQuestion

Methods for outlier detection are categorised into following approaches :-

Correct Answer

Statistical , Distance based , Deviation based

Your Answer

Statistical , Distance based , Deviation based

Multiple Choice Multiple AnswerQuestion

Data base miner provides multiple data mining algorithms including :-

Correct Answer

Discovery driven OLAP analysis , Association , Classification

Your Answer

Discovery driven OLAP analysis , Association , Classification

True/FalseQuestion

In Linear regression data are modeled to fit a straight line.

Correct Answer

True

Your Answer

True

True/FalseQuestion

Data in data warehouse cuts across application.

Correct Answer

True

Your Answer

True

Multiple Choice Single AnswerQuestion

If many indexes are needed, then on which table which option is more preferable?

Correct Answer

Splitting of tables

Your Answer

Rearranging of tables

Multiple Choice Single AnswerQuestion

Which technique is used to predict categorical response variable?

Correct Answer

Discriminant analysis

Your Answer

Discriminant analysis

Multiple Choice Multiple AnswerQuestion

Following data transformation methods are used in analysis of time series data :-

Correct Answer

Scaling , Normalization , Windows Stiching

Your Answer

Scaling , Normalization , Windows Stiching

Multiple Choice Single AnswerQuestion

Concept Description generates description for :-

Correct Answer

Charaterisation and Comparison

Your Answer

Charaterisation and Comparison

True/FalseQuestion

Data preprocessing is an important step in knowledge discovery process.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Data Mining means :-

Correct Answer

Knowledge mining from database , Data /Pattern analysis , Data Archelogy

Your Answer

Knowledge mining from database , Data /Pattern analysis , Data Archelogy

Multiple Choice Single AnswerQuestion

What improves accuracy and speed of subsequent mining process?

Correct Answer

Integration

Your Answer

Integration

Multiple Choice Multiple AnswerQuestion

Data mining is applicable to :-

Correct Answer

Relational Database , Data Warehouse , Transaction Database

Your Answer

Relational Database , Data Warehouse , Transaction Database

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

Select The BlankQuestion

________ is a summarization of general characteristics or features of a target class of data.

Correct Answer

Data Characterization

Your Answer

Data Generalization

Multiple Choice Single AnswerQuestion

The pilot which is useful for user and project team both as it touches all important functions is :-

Correct Answer

Expanded seed pilot

Your Answer

User tool appreciation pilot

Multiple Choice Single AnswerQuestion

Which of the following technique involves placing and managing related units of data in same physical block of storage

Correct Answer

Clustering

Your Answer

Clustering

Multiple Choice Multiple AnswerQuestion

History of metadata includes :-

Correct Answer

Changes to source system , Data extraction methods , Data transformation algorithm

Your Answer

Changes to source system , Data extraction methods

Multiple Choice Single AnswerQuestion

Which of the following approach requires more computation?

Correct Answer

Filter approach

Your Answer

Filter approach

True/FalseQuestion

The substantial part of historical data comes form antiquated legacy system.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

Data reduction includes :-

Correct Answer

Single value decomposition , Wavelets , Regression

Your Answer

Single value decomposition , Wavelets , Regression

Multiple Choice Single AnswerQuestion

Establish the importance of data quality, Form data quality steering committee, Institute a data quality framework, Assign roles and responsibilities. These are the steps of :-

Correct Answer

Data purification

Your Answer

Data quality control

Multiple Choice Single AnswerQuestion

Which is the typical example of Grid based clustering method

Correct Answer

STING

Your Answer

STING

Match The FollowingQuestion

Correct Answer

Your Answer

Normalization

Scattered data

Constructing small units of data

Smoothing

Removal of noisy data

Removal of noisy data

Aggregation

Summary operations

Constructing new attributes

Generalization

Data hierarchies

Data hierarchies

True/FalseQuestion

Bitmapped indexing does not apply to fault tables.

Correct Answer

True

Your Answer

True

Multiple Choice Multiple AnswerQuestion

For processing metadata in informal delivery area, data can be referred back for :-

Correct Answer

Source data configuration , Data structure , Data transformation

Your Answer

Source data configuration , Data structure , Data transformation

True/FalseQuestion

The precision measure is the % of retrieved documents that are in fact relevant to query.

Correct Answer

True

Your Answer

False

Select The BlankQuestion

Analysis of frequent sequential patterns is important in analysis ________ in generic sequence.

Correct Answer

Dismilarity and similarity

Your Answer

Similarity

Select The BlankQuestion

________ is the clustering method which encounters difficultes regarding the selection of merge/split points

Correct Answer

Hierachical

Your Answer

Hierachical

Multiple Choice Single AnswerQuestion

Following clustering method is classified as being agglomerative or divisive :-

Correct Answer

Grid based

Your Answer

Grid based

Multiple Choice Multiple AnswerQuestion

Normalization improves :-

Correct Answer

Efficiency , Accuracy

Your Answer

Efficiency , Accuracy

Multiple Choice Single AnswerQuestion

A Wavelet transformation is :-

Correct Answer

Single processing Technique that decomposes signals into different frequency subbands

Your Answer

Single processing Technique that decomposes signals into different frequency subbands

Multiple Choice Single AnswerQuestion

The Clustering method DBSCAN stands for :-

Correct Answer

Desity Based Spatial clustering of Application with Noise

Your Answer

Desity Based Spatial clustering of Application with Noise

Select The BlankQuestion

________ can store aggregate and detail data at varying levels of resolution or abstraction.

Correct Answer

Index tree

Your Answer

Index tree

Multiple Choice Single AnswerQuestion

Behavioral data of objects can be derived by the application of :-

Correct Answer

Method

Your Answer

Method

Select The BlankQuestion

________ is the type of pilot for early delivery with broader scope and may be integrated.

Correct Answer

Broad business pilot

Your Answer

Broad business pilot

Multiple Choice Multiple AnswerQuestion

Metadata types can be classified as :-

Correct Answer

Business metadata , Technical metadata

Your Answer

Business metadata , Technical metadata

Multiple Choice Single AnswerQuestion

Simple matching approach is used for computing disimilarity between two objects for :-

Correct Answer

Nominal variable

Your Answer

Nominal variable

Multiple Choice Multiple AnswerQuestion

The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-

Correct Answer

Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

Your Answer

Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool

Multiple Choice Single AnswerQuestion

When DDL statements are created using database software, so to create an index system creates :-

Correct Answer

B-Tree index

Your Answer

B-Tree index

Multiple Choice Multiple AnswerQuestion

Data processing techniques are :-

Correct Answer

Cleansing , Integration , Transformation

Your Answer

Cleansing , Integration , Transformation

Match The FollowingQuestion

Correct Answer

Your Answer

Load Utility

High performance data loading, recovery

High performance data loading, recovery

Query Governer

Abort runaway query

Abort runaway query

Query Optimizer

Parsing, optimizing query

Parsing, optimizing query

Query Management

Balancing extraction of query

Balancing extraction of query

Select The BlankQuestion

Indexed ________ engines search index, web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords

Correct Answer

Web Search Engines

Your Answer

Web Search Engines

True/FalseQuestion

To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.

Correct Answer

True

Your Answer

True

Select The BlankQuestion

________ is the time consuming and less feasible approach for filling missing values.

Correct Answer

Filling missing values manually

Your Answer

Filling missing values manually

Multiple Choice Single AnswerQuestion

Which from the following is used for classification and prediction?

Correct Answer

Regression trees

Your Answer

Regression trees

Multiple Choice Multiple AnswerQuestion

Multimedia database stores and manages large collection of database such as :-

Correct Answer

Audio and Video , Sequence data , Text Markup and linkage

Your Answer

Audio and Video , Sequence data

Select The BlankQuestion

________ is an alternative aggolomerative hierarchical clustering algorithm.

Correct Answer

ROCK

Your Answer

ROCK

Select The BlankQuestion

________ architecture is more concerned with data access than memory access.

Correct Answer

MPP

Your Answer

MPP

True/FalseQuestion

Architecture comes first, tools follows it.

Correct Answer

True

Your Answer

True

True/FalseQuestion

Task of selection in data transformation forms part of extraction function.

Correct Answer

True

Your Answer

False

LIST OF ATTEMPTED QUESTIONS AND ANSWERS

True/FalseQuestionMatching the choice of DBMS with selected server hardware is not important for warehouse.Correct AnswerFalseYour AnswerFalse

Match The FollowingQuestionCorrect AnswerYour AnswerMetadataRoadmap for userRoadmap for userData storageData managementData managementData stagingWorkbench for dataWorkbench for dataData MiningKnowledge discoveryKnowledge discovery

True/FalseQuestionDatabase systems, data warehouse system and world wide web have become mainstream information system.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionBitmapped indexes are more suitable for data warehouse environment than for an OLTP systemCorrect AnswerBitmapped indexYour AnswerBitmapped index

Multiple Choice Single AnswerQuestionThe big difference between data warehouse and any operational system is its :-Correct AnswerUsageYour AnswerUsage

Multiple Choice Single AnswerQuestionOne major effort within data transformation is :-Correct AnswerImprovement of data qualityYour AnswerAnalysis of data quality

Multiple Choice Single AnswerQuestionWhich of the following technique is used to display group summary statistics?Correct AnswerQuality controlYour AnswerSurvival analysis

Select The BlankQuestion________ platform is the platform on which the data warehouse DBMS runs and database exist.Correct AnswerData storageYour AnswerData storage

Multiple Choice Multiple AnswerQuestionClass Comparison is performed through following steps :-Correct AnswerData Collection , Dimension relevance analysis , Presentation of derived comparison Your AnswerData Collection , Dimension relevance analysis , Presentation of derived comparison

Select The BlankQuestionIt is good practice to drop ________ before initial load.Correct AnswerIndexYour AnswerIndex

Select The BlankQuestion________ is the time consuming and less feasible approach for filling missing values.Correct AnswerFilling missing values manuallyYour AnswerFilling missing values manually

Multiple Choice Multiple AnswerQuestionBasic Heuristic method of attribute subset selection includes following techniques :-Correct AnswerStepwise forward selection , Stepwise backward elimination Your AnswerStepwise forward selection , Stepwise backward elimination , Combination of forward selection and backward elimination

True/FalseQuestionFor maintaining the quality of data proper naming conventions help to make data elements well understood by users.Correct AnswerTrueYour AnswerTrue

Select The BlankQuestionIn ________ duplicate sub trees exist within the tree.Correct AnswerRepetitionYour AnswerRepetition

Select The BlankQuestionThe technique of ________ enables concurrent input/output operations and improves file's access performance substantially.Correct AnswerFile stripingYour AnswerFile striping

Select The BlankQuestion________ does not handle categorical attributes.Correct AnswerCUREYour AnswerCURE

Select The BlankQuestionCreating ________is violation of Normalization principles.Correct AnswerArrayYour AnswerArray

True/FalseQuestionData in warehouse is primarily for query.Correct AnswerTrueYour AnswerTrue

Multiple Choice Multiple AnswerQuestionPreprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct AnswerData Cleaning , Relevance Analysis , Data Transformation Your AnswerData Cleaning , Relevance Analysis , Data Transformation

Multiple Choice Single AnswerQuestionWhich task in data transformation includes types of data manipulation on selected parts of source data?Correct AnswerSplitting/JoiningYour AnswerSplitting/Joining

True/FalseQuestionBusiness metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct AnswerTrueYour AnswerTrue

True/FalseQuestionData error discovery and data correction are two parts of data cleansing process.Correct AnswerTrueYour AnswerFalse

Multiple Choice Multiple AnswerQuestionThe dimensions of spatial data cube are :-Correct AnswerNon- spatial dimension , Spatial to non spatial , Spatial to spatial Your AnswerNon- spatial dimension , Spatial to non spatial , Spatial to spatial

Select The BlankQuestion________ technique is known as snapshot differential technique.Correct AnswerCapture based on comparing filesYour AnswerCapture based on comparing files

Multiple Choice Multiple AnswerQuestionThe benefits of improved data quality are :-Correct AnswerBetter customer service , Improved productivity , Reliable strategic decision making Your AnswerBetter customer service , Improved productivity , Reliable strategic decision making

Multiple Choice Single AnswerQuestionWhich technique of data extraction is available to non relational databases?Correct AnswerCapture through transaction logYour AnswerCapture of static data

True/FalseQuestionNoise in data means error or variance in measured variable.Correct AnswerTrueYour AnswerTrue

Multiple Choice Multiple AnswerQuestionData mining at home can help to mine data related to :-Correct AnswerMedical History , Cancer , Chromosome abnormalities Your AnswerMedical History , Chromosome abnormalities , Physiological conditions

True/FalseQuestionData Mining refers to extracting knowledge from larger amount of data.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionSimple matching approach is used for computing disimilarity between two objects for :-Correct AnswerNominal variableYour AnswerNominal variable

Multiple Choice Multiple AnswerQuestionFollowing are the reasons for getting data polluted :-Correct AnswerData aging , Input errors , Fraud Your AnswerData aging , Input errors , Processing errors

Select The BlankQuestion________ is the type of pilot for early delivery with broader scope and may be integrated.Correct AnswerBroad business pilotYour AnswerBroad business pilot

Multiple Choice Multiple AnswerQuestionFollowing are the issues to consider during data integration :-Correct AnswerSchema integration , Redundancy , Detection and resolution of data values Your AnswerSchema integration , Redundancy , Detection and resolution of data values

Match The FollowingQuestionCorrect AnswerYour AnswerRough set ApproachNoisy DataPreviously unseen datak-Nearest Neighbour ClassifiersLearning AnalogyNoisy DataClass based TestingInstanace BasedLearning AnalogyGeneric AlgorithmsNatural EvolutionNatural Evolution

Multiple Choice Single AnswerQuestionWhen DDL statements are created using database software, so to create an index system creates :-Correct AnswerB-Tree indexYour AnswerB-Tree index

True/FalseQuestionThe difficulties encountered in data transformation function relate to heterogeneity of the source system.Correct AnswerTrueYour AnswerFalse

True/FalseQuestionData mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct AnswerTrueYour AnswerTrue

Multiple Choice Single AnswerQuestionWhen current extent on disk storage for a file is full, DBMS finds new extent and allows an insertion of new record is known as :-Correct AnswerDynamic extensionYour AnswerDynamic extension

Multiple Choice Multiple AnswerQuestionFollowing are the types of normalization :-Correct AnswerMin-Max Normalization , Z-score normalization , Normalization by scaling Your AnswerMin-Max Normalization , Z-score normalization , Normalization by scaling

Multiple Choice Multiple AnswerQuestionIn generation of numerical hierarchies for cluster analysis following techniques are useful :-Correct AnswerBinning , Histogram analysis , Clustering Your AnswerBinning , Histogram analysis , Segmentation

Select The BlankQuestion________ is an alternative aggolomerative hierarchical clustering algorithm.Correct AnswerROCKYour AnswerROCK

Multiple Choice Multiple AnswerQuestionGeneralized linear model includes :-Correct AnswerLogistic regression , Poisson regression Your AnswerLogistic regression , Poisson regression

Multiple Choice Single AnswerQuestionInherently Architected, Single, central storage of data about content, Centralized rules and control, Seek quick result, these are the advantages of which type of data extraction?Correct AnswerTop down approachYour AnswerTop down approach

Multiple Choice Single AnswerQuestionQueries run faster to find exact match using which type of indexing?Correct AnswerClustered indexYour AnswerClustered index

LIST OF ATTEMPTED QUESTIONS AND ANSWERSSelect The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.

Correct Answer: Data Transformation

Your Answer: Data Transformation

Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :-

Correct Answer: Data acquisition , Data Storage , Information Delivery

Your Answer: Data acquisition , Data Storage , Information Delivery

Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis.

Correct Answer: Integration

Your Answer: Integration

Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :-

Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial

Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :-

Correct Answer: Replace data

Your Answer: Represent actual data

Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-

Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic

Your Answer: Different Objective Scope , Complete Analysis and Quick Response , Flexible and Dynamic

Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components.

Correct Answer: Metadata

Your Answer: Metadata

Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :-

Correct Answer: Statistical , Distance based , Deviation based

Your Answer: Statistical , Distance based , Deviation based

True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse.

Correct Answer: True

Your Answer: True

Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :-

Correct Answer: Complete , Reliable , High Quality

Your Answer: Complete , Reliable , High Quality

Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-

Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction

Your Answer: Credit approval , Medical Diagnosis , Performance Prediction

True/FalseQuestion: Data Integration means multiple resourses may be combined.

Correct Answer: True

Your Answer: True

Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction.

Correct Answer: Index tree

Your Answer: Multidimensional index tree

True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.

Correct Answer: True

Your Answer: True

True/FalseQuestion: Lower the level of detail, finer the data granularity.

Correct Answer: True

Your Answer: True

Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm.

Correct Answer: ROCK

Your Answer: ROCK

Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-

Correct Answer: Huge size of data

Your Answer: Huge size of data