Marks : 2
SCDL 4th Semester Data Mining
Top of Form
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The BlankQuestionSemantic integration of ________ genome database is the important task of DNA analysis.Correct AnswerHeterogeneous and distributedYour AnswerHeterogeneous and distributed
Multiple Choice Single AnswerQuestionMain advantage of following which method is it's fast processing?Correct AnswerGrid basedYour AnswerPartioning based
Select The BlankQuestionWith the widespread option of ________ real-time connection is viable for data warehouse.Correct AnswerTCP/IPYour AnswerHTTP
Select The BlankQuestion________ are responsible for running queries and reports against data warehouse tables.Correct AnswerEnd usersYour AnswerEnd users
Multiple Choice Multiple AnswerQuestionAdvantages of Wavelet transformation for clustering are :-Correct AnswerUnsupervised clustering , Detection of cluster for accuracy , Clustering is fast Your AnswerUnsupervised clustering , Clustering is fast , Decomposition of cluster for accuracy
Multiple Choice Single AnswerQuestionQuery tool is meant for :-Correct AnswerData acquisitionYour AnswerInformation delivery
Multiple Choice Single AnswerQuestionWhich of the following function involves data cleaning, data standardizing and summarizing?Correct AnswerTransforming dataYour AnswerStoring data
Multiple Choice Multiple AnswerQuestionWhich of the following clustering analysis method uses multiresolution approach?Correct AnswerSTING , Wave Cluster Your AnswerSTING , Wave Cluster
Multiple Choice Single AnswerQuestionWhich type of following clustering computes augumented cluster ordering?Correct AnswerOPTICSYour AnswerCLQUE
Multiple Choice Multiple AnswerQuestionTime variant nature of the data in data warehouse :-Correct AnswerAllows for analysis of the past , Relate information to the present , Enables forecasts for the future Your AnswerAllows for analysis of the past , Relate information to the present , Enables forecasts for the future
True/FalseQuestionThe Structure that brings all the components together is known as Architecture.Correct AnswerTrueYour AnswerTrue
Multiple Choice Multiple AnswerQuestionData compression is to compress the given data by encoding in terms of :-Correct AnswerAssociation rule , Decision tree , Cluster Your AnswerBytes , Cluster
Multiple Choice Multiple AnswerQuestionThe different definitions of metadata are :-Correct AnswerData about data , Catalog of data , Data warehouse roadmap Your AnswerData about data , Catalog of data , Data warehouse roadmap
True/FalseQuestionA distinct feature of DB Miner is its data cube based online analytical mining.Correct AnswerTrueYour AnswerFalse
Multiple Choice Single AnswerQuestionAssociation rules mining is based on :-Correct AnswerClustering and Employing rules for classificationYour AnswerClustering and Employing rules for classification
True/FalseQuestionA distinguishing feature of Clementine is its object oriented extended module interface.Correct AnswerTrueYour AnswerTrue
Select The BlankQuestion________ includes Normalization and Aggregation as data preprocessing procedures.Correct AnswerData transformationYour AnswerData transformation
True/FalseQuestionTo remove noise from data is called as Smoothing.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionData matrix is :-Correct AnswerObject by variable structureYour AnswerObject by variable structure
True/FalseQuestionData updates are common place in an operational database.Correct AnswerTrueYour AnswerTrue
True/FalseQuestionIn decision tree internal nodes are denoted by ovals and leaf nodes are denoted by rectanglesCorrect AnswerFalseYour AnswerTrue
True/FalseQuestionFrom a Dataware house perspective data mining canbe viewed as an advanced stage of Online Analytical Programming.Correct AnswerTrueYour AnswerTrue
Match The FollowingQuestionCorrect AnswerYour AnswerDisparate dataProduction dataQuery and analysisNon volatile dataQuery and analysisArchive dataData granularityLevel of detailLevel of detailData from external sourceExternal dataExternal data
Multiple Choice Multiple AnswerQuestionIn physical design of data warehouse administration provides features like :-Correct AnswerAvoiding reorganizing of tables , Support backup and recovery , Query processing Your AnswerSupport backup and recovery , Manage store area , Query processing
Select The BlankQuestion________ is the user who has system access privileges but no database administration privileges as well as not for table and views.Correct AnswerNetwork administratorYour AnswerEnd user
Multiple Choice Multiple AnswerQuestionData mining Functionalities are :-Correct AnswerCharactrization and Discrimination , Association Analysis , Cluster Analysis Your AnswerAssociation Analysis , Cluster Analysis , Time series Data Analysis
Select The BlankQuestion________ dimension of database in which primitive level data are spatial but generalization becomes non spatial.Correct AnswerSpatial to non spatialYour AnswerSpatial to non spatial
Multiple Choice Multiple AnswerQuestionSource Data Component may be grouped into following categories :-Correct AnswerProduction Data , Internal External Data Your AnswerInternal External Data , Analyzed data , Non Analyzed data
Select The BlankQuestion________ technique is the statistical technique for analyzing data.Correct AnswerTime seriesYour AnswerTime series
Multiple Choice Multiple AnswerQuestionThe strategies for data reduction are :-Correct AnswerData aggregation , Dimension reduction , Numerocity reduction Your AnswerData aggregation , Dimension reduction , Numerocity reduction
Multiple Choice Single AnswerQuestionClassification rules are extracted fromCorrect AnswerDecision TreeYour AnswerRoot-Node
Match The FollowingQuestionCorrect AnswerYour AnswerData MiningKnowledge discoveryKnowledge discoveryMetadataRoadmap for userDetails of summaryData storageData managementData managementData stagingWorkbench for dataWorkbench for data
True/FalseQuestionData cube stores multidimensional aggregate information.Correct AnswerTrueYour AnswerTrue
Select The BlankQuestion________ is the method used to predict the value of response variable from one to more variables.Correct AnswerRegressionYour AnswerRegression
Select The BlankQuestion________ databases are one of the most poplularly available and rich information repositories.Correct AnswerRelationalYour AnswerObject oriented
True/FalseQuestionCOBWEB is a method of incremental conceptual clustering.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionMany methods for data smoothing are also methods for data reduction involving :-Correct AnswerDiscretizationYour AnswerClustering
Multiple Choice Single AnswerQuestionDimensionality reduction reduces the data set size by removing :-Correct AnswerIrrelevant attributesYour AnswerIrrelevant attributes
Multiple Choice Single AnswerQuestionEffect of one attibute value on a given class is independent of values of other attibute is calledCorrect AnswerValue independenceYour AnswerClass Conditional independence
Multiple Choice Single AnswerQuestionWhich from the following are special programs that are stored on database and fired when certain predefined action occurs?Correct AnswerTriggersYour AnswerTriggers
Select The BlankQuestionA web server usually registers ________ entry for every access of a web pageCorrect AnswerWeblogYour AnswerLog
Multiple Choice Single AnswerQuestionBayes Theorem is :-Correct AnswerP(H|X)=P(X|H)(P)/P(X)Your AnswerP(H|X)=P(X|H)(P)/P(X)
True/FalseQuestionVisual display can help user to give clear impression and overview of the data characteristics in a database.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionWhich of the following is based on set of density distribution function clustering?Correct AnswerDBSCANYour AnswerDBSCAN
Multiple Choice Multiple AnswerQuestionMetadata in a data warehouse falls into following categories :-Correct AnswerOperational Metadata , Extraction and Transformation metadata , End-user Metadata Your AnswerOperational Metadata , Extraction and Transformation metadata , End-user Metadata
Multiple Choice Multiple AnswerQuestionKnowledge discovery process includes :-Correct AnswerData Cleaning , Data Intergration , Data Selectin Your AnswerData Cleaning , Data Intergration , Data Selectin
Select The BlankQuestionHuman being have around ________ gene.Correct Answer100000Your Answer1000000
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The BlankQuestion
Creating ________is violation of Normalization principles.
Correct Answer
Array
Your Answer
Array
True/FalseQuestion
Data Mining refers to extracting knowledge from larger amount of data.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Which of the following of Grid based clustering method explorates statistical information?
Correct Answer
STING
Your Answer
CLIQUE
Select The BlankQuestion
In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.
Correct Answer
Smoothing by bin boundaries
Your Answer
Smoothing by medians
Select The BlankQuestion
________ can store aggregate and detail data at varying levels of resolution or abstraction.
Correct Answer
Index tree
Your Answer
R-Tree
Select The BlankQuestion
________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer
Legacy platform
Multiple Choice Multiple AnswerQuestion
SMP provides the features like :-
Correct Answer
Each node has access to common set of disks , Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus
Your Answer
Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , It is cluster of nodes
Multiple Choice Single AnswerQuestion
In intermediate data extraction data capture through transaction log uses transaction from :-
Correct Answer
Recovery from failure
Your Answer
Recovery from failure
Multiple Choice Multiple AnswerQuestion
In data storage area , DBA uses metadata for processes of :-
Correct Answer
Backup , Recovery , Tuning Database
Your Answer
Backup , Recovery , Management
Multiple Choice Multiple AnswerQuestion
Foundation infrastructure of warehouse includes many elements such as :-
Correct Answer
Basic Computing platform , Hardware and operating system , DBMS and Query
Your Answer
Basic Computing platform , DBMS and Query , Query processing components
Match The FollowingQuestion
Correct Answer
Your Answer
Data producer
Responsible for data quality
Responsible for data quality
Domain values
Prevalent problem
Foreign key preserved
Update security
Prevention of unauthorized updates
Prevalent problem
Referential integrity
Foreign key preserved
Prevention of unauthorized updates
Select The BlankQuestion
________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis
Correct Answer
DBSCAN
Your Answer
DBSCAN
Multiple Choice Multiple AnswerQuestion
Building blocks of Data Warehouse are :-
Correct Answer
Source Data , Data Staging , Management and Control
Your Answer
Data Staging , Data Manager , Management and Control
True/FalseQuestion
All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-
Correct Answer
Huge size of data
Your Answer
Huge size of data
Match The FollowingQuestion
Correct Answer
Your Answer
Clustering tool
To group different cases
To detect unusual attribute
Data visualization tool
Transaction activity using graph
To filter unrelated attributes
Linkage analysis tool
To identify links
To group different cases
Classification tool
To filter unrelated attributes
To identify links
Multiple Choice Multiple AnswerQuestion
Generalized linear model includes :-
Correct Answer
Logistic regression , Poisson regression
Your Answer
Poisson regression , Linear regression , Polynomial Regression
True/FalseQuestion
Metadata acts like a nerve center.
Correct Answer
True
Your Answer
False
Multiple Choice Single AnswerQuestion
OLAP is used for :-
Correct Answer
Online Analytical Processing
Your Answer
Online Application Processing
Multiple Choice Multiple AnswerQuestion
The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Multiple Choice Single AnswerQuestion
Maintenance of cache consistency is the limitation of :-
Correct Answer
MPP
Your Answer
NUMA
Select The BlankQuestion
In ________ duplicate sub trees exist within the tree.
Correct Answer
Repetition
Your Answer
Replication
Select The BlankQuestion
Indexed ________ engines search index,web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords
Correct Answer
Web Search
Your Answer
Web Search
Multiple Choice Single AnswerQuestion
Redundancies can be deleted by :-
Correct Answer
Co-relational analysis
Your Answer
Coherent analysis
True/FalseQuestion
To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Common areas of application for mixed effect model includes :-
Correct Answer
Multiple data , Repeated measures data , Block designs
Your Answer
Multiple data , Dimensional data , Block designs
Select The BlankQuestion
In data ________, data encoding or transformations are applied to obtain reduced or compressed representation.
Correct Answer
Compression
Your Answer
Compression
Multiple Choice Single AnswerQuestion
Grouped data can be analyzed with the technique :-
Correct Answer
Mixed effect model
Your Answer
Factor analysis
Select The BlankQuestion
________ is the navigational map of data warehouse.
Correct Answer
End user Metadata
Your Answer
Extraction Metadata
Multiple Choice Multiple AnswerQuestion
Business metadata is useful for :-
Correct Answer
Providing support to end users , For external view of data , Provides technical support to search data
Your Answer
Providing support to end users , For external view of data , Provides technical support to search data
True/FalseQuestion
The elements of warehouse infrastructure are classified into operational and physical infrastructure.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Data reduction by volume can be used for data representation using which type of reduction?
Correct Answer
Numerosity reduction
Your Answer
Histograms
True/FalseQuestion
Descriptive mining takes perform ingerence on current data which predictive mining characterize the general properties of data in database
Correct Answer
False
Your Answer
False
Multiple Choice Single AnswerQuestion
Queries run faster to find exact match using which type of indexing?
Correct Answer
Clustered index
Your Answer
Sequential index
Multiple Choice Single AnswerQuestion
Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer
Clustering
True/FalseQuestion
Data classification is two step process in which first step includes classfication of model and in second step model describes set of data.
Correct Answer
False
Your Answer
True
Select The BlankQuestion
In data warehouse architecture, the ________ component interleaves with and connects other components.
Correct Answer
Metadata
Your Answer
Metadata
True/FalseQuestion
Legacy data resides on Hierarchical or Network database.
Correct Answer
True
Your Answer
True
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple AnswerQuestion
Metadata is essential for IT for :-
Correct Answer
Source data structures , Data summarization
Your Answer
Web enabling , Source data structures , Data summarization
Multiple Choice Multiple AnswerQuestion
Financial data called for banking and financial industry are often relatively :-
Correct Answer
Complete , Reliable , High Quality
Your Answer
Complete , Reliable , Correct
Select The BlankQuestion
________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer
Cluster
Match The FollowingQuestion
Correct Answer
Your Answer
Operating systems compatibility
Security, reliability, availability
Security, reliability, availability
Data Acquisition
Data Extraction, Transformation, clensing, integration
Data Extraction, Transformation, clensing, integration
Data Storage
Data loading , Archiving
Data loading , Archiving
Information Delivery
Report generation, query processing and complex analysis
Report generation, query processing and complex analysis
True/FalseQuestion
A cluster is a collection of similar data objects in same cluster and disimilar to objects in another cluster.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Which of the following method creates copies of data in distributed environment?
Correct Answer
Replication
Your Answer
Replication
Multiple Choice Single AnswerQuestion
Capture at data source and that's why this method is quite reliable :-
Correct Answer
Capture by database Triggers
Your Answer
Capture in source application
Multiple Choice Single AnswerQuestion
For Banking and financial data which type of analysis is used?
Correct Answer
Multidimensional
Your Answer
Relational
Multiple Choice Single AnswerQuestion
Which of the following methods for regression is used on sparse data :-
Correct Answer
Regression and log-linear model
Your Answer
Regression and transformation
Multiple Choice Multiple AnswerQuestion
Following data transformation methods are used in analysis of time series data :-
Correct Answer
Scaling , Normalization , Windows Stiching
Your Answer
Scaling , Normalization , Windows Stiching
Select The BlankQuestion
________ function of data staging component involves many forms of combining pieces of data from different sources.
Correct Answer
Data Transformation
Your Answer
Data Loading
Multiple Choice Single AnswerQuestion
Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-
Correct Answer
Huge size of data
Your Answer
Relational data
Select The BlankQuestion
Creating ________is violation of Normalization principles.
Correct Answer
Array
Your Answer
Structure
Multiple Choice Multiple AnswerQuestion
The tools of metadata falls in following categories :-
Correct Answer
Development tools for IT professional , Information access tool for End user
Your Answer
Access tool , Development tools for IT professional , Information access tool for End user
True/FalseQuestion
Architecture comes first, tools follows it.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer
ROKE
Multiple Choice Multiple AnswerQuestion
In data storage area , DBA uses metadata for processes of :-
Correct Answer
Backup , Recovery , Tuning Database
Your Answer
Backup , Recovery , Tuning Database
True/FalseQuestion
Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Data processing techniques are :-
Correct Answer
Cleansing , Integration , Transformation
Your Answer
Cleansing , Transformation , Collection
Multiple Choice Single AnswerQuestion
Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer
Clustering
Multiple Choice Single AnswerQuestion
Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer
Examining objects in group
Multiple Choice Single AnswerQuestion
Data partitioning, data clustering are the techniques for :-
Correct Answer
Performance enhancement
Your Answer
Performance enhancement
Multiple Choice Multiple AnswerQuestion
Following are the issues to consider during data integration :-
Correct Answer
Schema integration , Redundancy , Detection and resolution of data values
Your Answer
Schema integration , Redundancy , Inconsistency
True/FalseQuestion
Management architectural component manages and controls data acquisition functions.
Correct Answer
True
Your Answer
True
Match The FollowingQuestion
Correct Answer
Your Answer
Data loading tool
Primary key generation
Primary key generation
Data modeling tool
Reverse Engineering capabilities
Reverse Engineering capabilities
Data Extraction tool
Bulk extraction for full refresh
Bulk extraction for full refresh
Data transformation tool
Default values
Replication
Multiple Choice Multiple AnswerQuestion
DNA sequences are comprised of :-
Correct Answer
Adenine , Gaunine , Thymine
Your Answer
Adenine , Cytocine , Gaunine
Multiple Choice Single AnswerQuestion
Large number of indexes affects the loading process because :-
Correct Answer
Indexes are created for new records
Your Answer
Indexes are created for old records
Select The BlankQuestion
The technique of ________ enables concurrent input/output operations and improves file's access performance substantially.
Correct Answer
File striping
Your Answer
Data migration
Multiple Choice Multiple AnswerQuestion
Warehouse Operational infrastructure is to support each architecture component consists of :-
Correct Answer
People , Procedures , Management software
Your Answer
People , Procedures , Management software
True/FalseQuestion
In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.
Correct Answer
True
Your Answer
False
Select The BlankQuestion
________ technique can be used to reduce the number of values for a given continuous attribute by dividing range of attributes into interval.
Correct Answer
Descretization
Your Answer
Compression
True/FalseQuestion
Data cubes created for varying levels of abstraction are referred as cuboids.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Which of the following approach requires more computation?
Correct Answer
Filter approach
Your Answer
Filter approach
Select The BlankQuestion
________components consists all the different ways of making the information from the data warehouse available to the user.
Correct Answer
Information Delivery
Your Answer
Metadata
Multiple Choice Multiple AnswerQuestion
Data transformation includes :-
Correct Answer
Smoothing , Aggregation , Generalization
Your Answer
Smoothing , Aggregation
True/FalseQuestion
In Linear regression data are modeled to fit a straight line.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer
Statistical , Distance based , Deviation based
Multiple Choice Multiple AnswerQuestion
Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer
Discovery driven OLAP analysis , Association , Regression
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single AnswerQuestion
Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer
Examining character of objects in groups
Select The BlankQuestion
________ component of warehouse is responsible for coordinating services and activities within the data warehouse.
Correct Answer
Management and Control
Your Answer
Management and Control
True/FalseQuestion
Sequential pattern analysis and similarity search techniques have been developed in data mining.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
For operational system, the stored data contains ________values.
Correct Answer
Current data
Your Answer
Current data
True/FalseQuestion
Intelligent miner is an IBM data mining product.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
The technique of ________ enables concurrent input/output operations and improves file's access performance substantially.
Correct Answer
File striping
Your Answer
File striping
Multiple Choice Multiple AnswerQuestion
SMP provides the features like :-
Correct Answer
Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus , Each node has access to common set of disks
Your Answer
Controllers which are accessible to all processors , Each processor has full access to the shared memory though common bus
Match The FollowingQuestion
Correct Answer
Your Answer
Incremental data capture
Differed data capture
Differed data capture
Initial load of data warehouse
"as-is" data capture
"as-is" data capture
Static data
Capture of data in given point of time
Capture of data in given point of time
Data revision
Incremental data capture
Incremental data capture
True/FalseQuestion
In Purning method, postpruning requires more computation than prepruning yet generally leads to more reliable.
Correct Answer
True
Your Answer
False
True/FalseQuestion
Data preprocessing is an important step in knowledge discovery process.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
True/FalseQuestion
Data mining often requires data integration.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
In data storage area , DBA uses metadata for processes of :-
Correct Answer
Backup , Recovery , Tuning Database
Your Answer
Backup , Recovery , Tuning Database
Select The BlankQuestion
________components consists all the different ways of making the information from the data warehouse available to the user.
Correct Answer
Information Delivery
Your Answer
Information Delivery
Multiple Choice Multiple AnswerQuestion
Data processing techniques are :-
Correct Answer
Cleansing , Integration , Transformation
Your Answer
Integration , Transformation , Cleansing
Match The FollowingQuestion
Correct Answer
Your Answer
Information Delivery
Report generation, query processing and complex analysis
Report generation, query processing and complex analysis
Operating systems compatibility
Security, reliability, availability
Security, reliability, availability
Data Acquisition
Data Extraction, Transformation, clensing, integration
Data Extraction, Transformation, clensing, integration
Data Storage
Data loading , Archiving
Data loading , Archiving
Select The BlankQuestion
In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.
Correct Answer
Smoothing by bin boundaries
Your Answer
Smoothing by bin boundaries
Multiple Choice Single AnswerQuestion
Data partitioning, data clustering are the techniques for :-
Correct Answer
Performance enhancement
Your Answer
Data extraction
Select The BlankQuestion
Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer
Relational
True/FalseQuestion
NUMA provides better scalability than SMP.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Data migration affects performance requiring multiple blocks to be read which can be adjusted by :-
Correct Answer
Block percent free
Your Answer
Block percent free
Multiple Choice Single AnswerQuestion
Redundancies can be deleted by :-
Correct Answer
Co-relational analysis
Your Answer
Co-relational analysis
Multiple Choice Multiple AnswerQuestion
The functions of data acquisition are :-
Correct Answer
Data Transformation , Data Extraction
Your Answer
Data Extraction , Data Transformation , Data cleansing
Multiple Choice Single AnswerQuestion
SMP stands for :-
Correct Answer
Symmetric Multiprocessing
Your Answer
Symmetric Multiprocessing
Multiple Choice Multiple AnswerQuestion
Mining values can be removed by :-
Correct Answer
Filling values manually , Use of global constant , Use of attribute mean
Your Answer
Filling values manually , Use of attribute mean
Multiple Choice Single AnswerQuestion
Which from the following is used for classification and prediction?
Correct Answer
Regression trees
Your Answer
Regression
Multiple Choice Multiple AnswerQuestion
Before moving data to data warehouse is has to go through :-
Correct Answer
Transformation , Integration , Consolidation
Your Answer
Transformation , Integration , Consolidation
Select The BlankQuestion
________ is the navigational map of data warehouse.
Correct Answer
End user Metadata
Your Answer
Operational Metadata
True/FalseQuestion
Architecture comes first, tools follows it.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Which technique analyze experimental data?
Correct Answer
Analysis of variance
Your Answer
Regression
Multiple Choice Multiple AnswerQuestion
The need for metadata is for :-
Correct Answer
Using data warehouse , Building data warehouse , Administration of warehouse
Your Answer
Building data warehouse , Administration of warehouse
Multiple Choice Single AnswerQuestion
Development and deployment of your data warehouse is joint effort between :-
Correct Answer
IT staff and user representatives
Your Answer
IT staff and user representatives
Select The BlankQuestion
________ function of data staging component involves many forms of combining pieces of data from different sources.
Correct Answer
Data Transformation
Your Answer
Data Transformation
Multiple Choice Multiple AnswerQuestion
When you use tool for design and development, following things take place with metadata :-
Correct Answer
Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process
Your Answer
Metadata is no longer passive document , Metadata takes part in process , Metadata aids in automation of data warehouse process
Multiple Choice Multiple AnswerQuestion
The main categories of Metadata in warehouse are :-
Correct Answer
Operational , Extraction and transformation Metadata , End user Metadata
Your Answer
Operational , Extraction and transformation Metadata , End user Metadata
Select The BlankQuestion
________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer
Proof of concept pilot
True/FalseQuestion
A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering
Correct Answer
True
Your Answer
True
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single AnswerQuestion
Which type of Grid clustering depends on the granularity of lowest level of grid structure?
Correct Answer
STING
Your Answer
OPTICS
Multiple Choice Single AnswerQuestion
Which of the following option of data extraction is known as application assisted data capture?
Correct Answer
Capture in source application
Your Answer
Capture by comparing files
True/FalseQuestion
Moving data into staging area and performing data transformation function is a part of data acquisition.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
The objective for physical design of data warehouse are :-
Correct Answer
Improve performance , Ensure scalability , Manage store
Your Answer
Improve performance , Ensure scalability , Manage database
Multiple Choice Multiple AnswerQuestion
User must have proper access to metadata for performing responsibilities of :-
Correct Answer
Design , Administration
Your Answer
Design , Administration , Management
Multiple Choice Multiple AnswerQuestion
In Intelligent miner the data mining product provides data mining algorithm including
Correct Answer
Association , Classification , Regression
Your Answer
Association , Regression , Aggregation
Multiple Choice Single AnswerQuestion
The big difference between data warehouse and any operational system is its :-
Correct Answer
Usage
Your Answer
Organization
True/FalseQuestion
Loan payment prediction and customer credit analysis are critical to business of bank.
Correct Answer
True
Your Answer
False
Multiple Choice Single AnswerQuestion
Which of the option is not considered as the major function needed to get data ready?
Correct Answer
Storing data
Your Answer
Extracting data
True/FalseQuestion
In the data acquisition area, the data flow begins at the data sources and pauses at staging area.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer
Relational
True/FalseQuestion
NUMA provides better scalability than SMP.
Correct Answer
True
Your Answer
True
Match The FollowingQuestion
Correct Answer
Your Answer
Interactive visual data mining
Visualization tool
Audio signal
Data visualization
Visual display
Graphical display
Data mining result visualization
Presentation of knowledge
Visualization tool
Data mining process visualization
Data mining in visual format
Data mining in visual format
Multiple Choice Single AnswerQuestion
Deliberate splitting of a table and its index data into manageable part is known as :-
Correct Answer
Partitioning
Your Answer
Decomposing
Multiple Choice Multiple AnswerQuestion
Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer
Relational Database , Data Warehouse , Transaction Database
True/FalseQuestion
Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.
Correct Answer
True
Your Answer
False
True/FalseQuestion
Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Which from the following is used for classification and prediction?
Correct Answer
Regression trees
Your Answer
Generalized linear model
Multiple Choice Multiple AnswerQuestion
Data cleansing routines work to clean the data by :-
Correct Answer
Filling missing values , Smoothing noisy data
Your Answer
Filling missing values , Smoothing noisy data , Resolving inconsistency
Select The BlankQuestion
________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer
Proof of concept pilot
Multiple Choice Single AnswerQuestion
The data warehouse DBMS executes on :-
Correct Answer
Data server component
Your Answer
Data server component
True/FalseQuestion
A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering
Correct Answer
True
Your Answer
False
Select The BlankQuestion
________ component of warehouse is responsible for coordinating services and activities within the data warehouse.
Correct Answer
Management and Control
Your Answer
Management and Control
Multiple Choice Single AnswerQuestion
Large number of indexes affects the loading process because :-
Correct Answer
Indexes are created for new records
Your Answer
Records are reshuffled
Match The FollowingQuestion
Correct Answer
Your Answer
Chasm
Challenges
Method to solve problem
Early majority
Nature technology
Technology to die out
Innovators
Method to solve problem
Challenges
Early adaptors
Increased interest
Increased interest
Select The BlankQuestion
________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer
ROKE
Multiple Choice Single AnswerQuestion
Which technique is used to predict categorical response variable?
Correct Answer
Discriminant analysis
Your Answer
Factor analysis
Multiple Choice Single AnswerQuestion
Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer
Examining character of objects in groups
Multiple Choice Multiple AnswerQuestion
The information delivery methods from data warehouse are :-
Correct Answer
Complex queries , MD Analysis , Statistical Analysis
Your Answer
Complex queries , MD Analysis , ETS System
Select The BlankQuestion
________ does not handle categorical attributes.
Correct Answer
CURE
Your Answer
Chameleon
Multiple Choice Multiple AnswerQuestion
Data warehouse environment is functionally divided into following areas :-
Correct Answer
Data acquisition , Data storage , Information delivery
Your Answer
Data storage , Information delivery , Data transformation
True/FalseQuestion
Data mining often requires data integration.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer
Polynomial
Multiple Choice Multiple AnswerQuestion
The areas of classification for metadata are :-
Correct Answer
Development/usage , Technical/business , BackRoom/Front Room
Your Answer
Development/usage , Technical/business , Administration
Multiple Choice Multiple AnswerQuestion
Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer
Association , Classification , Regression
Select The BlankQuestion
The ________ record is one-to-many relationship with corresponding fact table record.
Correct Answer
Dimension tables
Your Answer
Fact table
Multiple Choice Single AnswerQuestion
For Incremental data loads the sequence is :-
Correct Answer
Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing
Your Answer
Triggering ->Filtering ->data extraction -> Transformation ->Integration ->cleansing
Multiple Choice Multiple AnswerQuestion
The platform of Data warehouse consists of :-
Correct Answer
Basic hardware components , Operating System , Network and Network software
Your Answer
Basic hardware components , Network and Network software , Utility software
Multiple Choice Multiple AnswerQuestion
The smoothing techniques are :-
Correct Answer
Binning , Clustering , Regression
Your Answer
Clustering , Regression , Insertion
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The BlankQuestion
________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer
Robust
True/FalseQuestion
Data classification is two step process in which first step includes classfication of model and in second step model describes set of data.
Correct Answer
False
Your Answer
True
True/FalseQuestion
Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Following factors play important role in financial analysis :-
Correct Answer
Data warehouse , Data cubes , Outliner analysis
Your Answer
Data warehouse , Data cubes , Data accuracy
Multiple Choice Multiple AnswerQuestion
The dimensions of spatial data cube are :-
Correct Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer
Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Multiple Choice Single AnswerQuestion
OLAP is used for :-
Correct Answer
Online Analytical Processing
Your Answer
Online Analytical Processing
True/FalseQuestion
Metadata acts like a nerve center.
Correct Answer
True
Your Answer
True
Match The FollowingQuestion
Correct Answer
Your Answer
Constructive merge
New record supercedes
Populating data warehouse table first time
Initial Load
Populating data warehouse table first time
Populating data warehouse table first time
Incremental Load
Applying ongoing changes
Applying ongoing changes
Load Image
To correspond to target files
Applying data
Multiple Choice Single AnswerQuestion
Disparity is the significant & disturbing characteristic of which type of data?
Correct Answer
Production data
Your Answer
Production data
True/FalseQuestion
Audio data mining can be an interesting alternative to visual mining.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ platform is the platform on which the data warehouse DBMS runs and database exist.
Correct Answer
Data storage
Your Answer
Data storage
True/FalseQuestion
Smoothing by bin means each value in bin is replaced by the mean value of the bucket.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Following clustering method is classified as being agglomerative or divisive :-
Correct Answer
Grid based
Your Answer
Hierarchical Method
Multiple Choice Multiple AnswerQuestion
Data processing is done for :-
Correct Answer
Improving the efficiency , Ease of mining
Your Answer
Improving the efficiency , Ease of mining , Removing redundancy
Multiple Choice Single AnswerQuestion
For Banking and financial data which type of analysis is used?
Correct Answer
Multidimensional
Your Answer
Relational
Multiple Choice Multiple AnswerQuestion
Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-
Correct Answer
Data Cleaning , Relevance Analysis , Data Transformation
Your Answer
Data Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Multiple AnswerQuestion
The functions of data acquisition are :-
Correct Answer
Data Extraction , Data Transformation
Your Answer
Data Extraction , Data Transformation , Data cleansing
Multiple Choice Single AnswerQuestion
Data partitioning, data clustering are the techniques for :-
Correct Answer
Performance enhancement
Your Answer
Performance enhancement
Multiple Choice Multiple AnswerQuestion
The Main areas of Data Warehouse are :-
Correct Answer
Data acquisition , Data Storage , Information Delivery
Your Answer
Data acquisition , Data Storage , Information Delivery
Select The BlankQuestion
________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer
ROCK
Select The BlankQuestion
________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer
Separate optimal Platform
Multiple Choice Multiple AnswerQuestion
Metadata recorded in information delivery functional area is related to :-
Correct Answer
Predefined queries , Input parameter definition , Reports
Your Answer
Predefined queries , Reports
True/FalseQuestion
Data cubes created for varying levels of abstraction are referred as cuboids.
Correct Answer
True
Your Answer
True
True/FalseQuestion
Moving data into staging area and performing data transformation function is a part of data acquisition.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer
Statistical , Distance based , Deviation based
Multiple Choice Single AnswerQuestion
The first step of attibute oriented induction is :-
Correct Answer
Data focusing
Your Answer
Data Collection
True/FalseQuestion
Legacy data resides on Hierarchical or Network database.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer
Cluster
Multiple Choice Single AnswerQuestion
Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer
Regression
Multiple Choice Multiple AnswerQuestion
Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer
Relational Database , Data Warehouse , Transaction Database
Multiple Choice Single AnswerQuestion
The data warehouse DBMS executes on :-
Correct Answer
Data server component
Your Answer
Data server component
Match The FollowingQuestion
Correct Answer
Your Answer
Metadata
Roadmap for user
Details of summary
Data storage
Data management
Data management
Data staging
Workbench for data
Workbench for data
Data Mining
Knowledge discovery
Knowledge discovery
True/FalseQuestion
Data Mining refers to extracting knowledge from larger amount of data.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer
Relational
HTMLCONTROL Forms.HTML:Hidden.1 LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Single AnswerQuestion
The technique of data clustering facilitates :-
Correct Answer
Serial access
Your Answer
Indexed access
Select The BlankQuestion
In ________ type smoothing, minimum and maximum values in given bin are identified as bin boundaries.
Correct Answer
Smoothing by bin boundaries
Your Answer
Smoothing by bin boundaries
Multiple Choice Multiple AnswerQuestion
The ways of Intra query parallelization are :-
Correct Answer
Horizontal parallelization , Vertical Parallelization , Hybrid parallelization
Your Answer
Vertical Parallelization , Homogenous parallelization
True/FalseQuestion
One of the most important search problem in genetic analysis is similarity search and comparison among DNA sequence.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
User must have proper access to metadata for performing responsibilities of :-
Correct Answer
Design , Administration
Your Answer
Administration , Management , Accessing
Select The BlankQuestion
________ is the platform for complex data transformation for the purpose of cleanse it
Correct Answer
Separate optimal Platform
Your Answer
Legacy platform
Multiple Choice Multiple AnswerQuestion
Classification and Prediction have following applications :-
Correct Answer
Credit approval , Medical Diagnosis , Performance Prediction
Your Answer
Credit approval , Selective Marketing
Multiple Choice Multiple AnswerQuestion
In data storage area , DBA uses metadata for processes of :-
Correct Answer
Tuning Database , Backup , Recovery
Your Answer
Tuning Database , Management
Multiple Choice Single AnswerQuestion
Data can be smoothed by filling the data to function such as :-
Correct Answer
Regression
Your Answer
Binning
True/FalseQuestion
Tools perform major functions in data warehouse environment.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer
Cluster
True/FalseQuestion
Data staging and data storage may start out on same computing platform.
Correct Answer
True
Your Answer
False
Match The FollowingQuestion
Correct Answer
Your Answer
Middleware & connectivity tool
Transparent access to source system
Assist data ware house administration
Data Quality tool
Locating data errors
Locating data errors
OLAP tools
Channel queries
Channel queries
Alert system tool
Users attention on exceptions
Users attention on exceptions
Multiple Choice Single AnswerQuestion
Attribute construction is the part of :-
Correct Answer
Transformation
Your Answer
Smoothing
Multiple Choice Single AnswerQuestion
Deliberate splitting of a table and its index data into manageable part is known as :-
Correct Answer
Partitioning
Your Answer
Partitioning
Multiple Choice Single AnswerQuestion
Simple matching approach is used for computing disimilarity between two objects for :-
Correct Answer
Nominal variable
Your Answer
Invariant variable
True/FalseQuestion
Data mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.
Correct Answer
True
Your Answer
False
Multiple Choice Single AnswerQuestion
Following clustering method is classified as being agglomerative or divisive :-
Correct Answer
Grid based
Your Answer
Density based
Select The BlankQuestion
________ clustering method follows statistical and neural network approach.
Correct Answer
Model based
Your Answer
Grid based
Multiple Choice Multiple AnswerQuestion
The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-
Correct Answer
Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Your Answer
Linkage analysis tool , Outlier analysis tool , Complexity definition tool
Multiple Choice Multiple AnswerQuestion
DNA sequences are comprised of :-
Correct Answer
Adenine , Gaunine , Thymine
Your Answer
Adenine , Cytocine , Gaunine , Thymine
True/FalseQuestion
Management architectural component manages and controls data acquisition functions.
Correct Answer
True
Your Answer
False
Multiple Choice Single AnswerQuestion
If many indexes are needed, then on which table which option is more preferable?
Correct Answer
Splitting of tables
Your Answer
Splitting of tables
True/FalseQuestion
To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
It is good practice to drop ________ before initial load.
Correct Answer
Index
Your Answer
Index
True/FalseQuestion
All data extraction, transformation, integration and staging jobs run on selected hardware under chosen operating system.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer
Examining character of objects in groups
Select The BlankQuestion
________ method of regression is useful when errors fails to satisfy normal conditions.
Correct Answer
Robust
Your Answer
Polynomial
Multiple Choice Multiple AnswerQuestion
The functional areas of metadata are :-
Correct Answer
Data Acquisition , Data storage , Information delivery
Your Answer
Data Acquisition , Data storage , Information delivery
Match The FollowingQuestion
Correct Answer
Your Answer
Load Utility
High performance data loading, recovery
High performance data loading, recovery
Query Governer
Abort runaway query
Balancing extraction of query
Query Optimizer
Parsing, optimizing query
Parsing, optimizing query
Query Management
Balancing extraction of query
Execution and rescheduling queries
Multiple Choice Single AnswerQuestion
The first step of attibute oriented induction is :-
Correct Answer
Data focusing
Your Answer
Data Classification
True/FalseQuestion
Architecture comes first, tools follows it.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Data cleansing routines work to clean the data by :-
Correct Answer
Filling missing values , Smoothing noisy data
Your Answer
Smoothing noisy data , Resolving inconsistency
Select The BlankQuestion
Most of the warehouses employ ________ database Management System.
Correct Answer
Relational
Your Answer
Multidimensional
Multiple Choice Single AnswerQuestion
Which of the following method creates copies of data in distributed environment?
Correct Answer
Replication
Your Answer
Replication
Select The BlankQuestion
Human being have around ________ gene.
Correct Answer
100000
Your Answer
100000
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Multiple Choice Multiple AnswerQuestion
DNA sequences are comprised of :-
Correct Answer
Gaunine , Thymine , Adenine
Your Answer
Gaunine , Thymine , Adenine , Cytocine
True/FalseQuestion
Loan payment prediction and customer credit analysis are critical to business of bank.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Preprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-
Correct Answer
Data Cleaning , Relevance Analysis , Data Transformation
Your Answer
Data Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Single AnswerQuestion
The big difference between data warehouse and any operational system is its :-
Correct Answer
Usage
Your Answer
Usage
True/FalseQuestion
Data cleansing means removing noisy and inconsistent data.
Correct Answer
True
Your Answer
True
True/FalseQuestion
Moving data into staging area and performing data transformation function is a part of data acquisition.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ option of warehouse architecture provides incremental growth.
Correct Answer
Cluster
Your Answer
Cluster
Select The BlankQuestion
For operational system, the stored data contains ________values.
Correct Answer
Current data
Your Answer
Current data
Multiple Choice Multiple AnswerQuestion
Splitting of data into smaller partition decision tree induction is prone to :-
Correct Answer
Fragmentation , Replication , Repetation
Your Answer
Fragmentation , Generalization
Multiple Choice Single AnswerQuestion
Bitmapped indexes are more suitable for data warehouse environment than for an OLTP system
Correct Answer
Bitmapped index
Your Answer
Clustered index
Select The BlankQuestion
________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer
User tool appreciation
Match The FollowingQuestion
Correct Answer
Your Answer
Data Mining
Knowledge discovery
Knowledge discovery
Metadata
Roadmap for user
Roadmap for user
Data storage
Data management
Data management
Data staging
Workbench for data
Workbench for data
Multiple Choice Single AnswerQuestion
A gene is usually comprised of hundreds of individual :-
Correct Answer
Nucleotides
Your Answer
Nucleotides
True/FalseQuestion
NUMA provides better scalability than SMP.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Deviation based outlier detection identifes outliers by :-
Correct Answer
Examining character of objects in groups
Your Answer
Examining distance between objects
Select The BlankQuestion
________ is density based clustering method which computes on augumented clustering ordering for automic ordering for automatic and interactive cluster analysis
Correct Answer
DBSCAN
Your Answer
DBSCAN
Multiple Choice Single AnswerQuestion
Enterprise miner technique provides data mining algorithms including distinguishing feature as :-
Correct Answer
Advanced Statistical and advanced visualization tool
Your Answer
Advanced Statistical and classification tool
Match The FollowingQuestion
Correct Answer
Your Answer
Load Image
To correspond to target files
To correspond to target files
Constructive merge
New record supercedes
New record supercedes
Initial Load
Populating data warehouse table first time
Populating data warehouse table first time
Incremental Load
Applying ongoing changes
Applying ongoing changes
True/FalseQuestion
A process of grouping a set of physical or abstract objects into classes of similar objects is called clusiering
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
Development and deployment of your data warehouse is joint effort between :-
Correct Answer
IT staff and user representatives
Your Answer
IT staff and user representatives
Multiple Choice Single AnswerQuestion
Attribute construction is the part of :-
Correct Answer
Transformation
Your Answer
Aggregation
Multiple Choice Single AnswerQuestion
Which of the following data warehouse component includes dependent data marts, special multidimensional database and full range of query and reporting facilities?
Correct Answer
Information Delivery component
Your Answer
Data Staging component
Multiple Choice Single AnswerQuestion
Which technique analyze experimental data?
Correct Answer
Analysis of variance
Your Answer
Analysis of variance
Select The BlankQuestion
________ function of data staging component involves many forms of combining pieces of data from different sources.
Correct Answer
Data Transformation
Your Answer
Data Transformation
Multiple Choice Multiple AnswerQuestion
Metadata is essential for IT for :-
Correct Answer
Source data structures , Data summarization
Your Answer
Source data structures , Data summarization , Aggregation
Multiple Choice Multiple AnswerQuestion
Methods for outlier detection are categorised into following approaches :-
Correct Answer
Statistical , Distance based , Deviation based
Your Answer
Statistical , Distance based , Deviation based
Multiple Choice Multiple AnswerQuestion
Data base miner provides multiple data mining algorithms including :-
Correct Answer
Discovery driven OLAP analysis , Association , Classification
Your Answer
Discovery driven OLAP analysis , Association , Classification
True/FalseQuestion
In Linear regression data are modeled to fit a straight line.
Correct Answer
True
Your Answer
True
True/FalseQuestion
Data in data warehouse cuts across application.
Correct Answer
True
Your Answer
True
Multiple Choice Single AnswerQuestion
If many indexes are needed, then on which table which option is more preferable?
Correct Answer
Splitting of tables
Your Answer
Rearranging of tables
Multiple Choice Single AnswerQuestion
Which technique is used to predict categorical response variable?
Correct Answer
Discriminant analysis
Your Answer
Discriminant analysis
Multiple Choice Multiple AnswerQuestion
Following data transformation methods are used in analysis of time series data :-
Correct Answer
Scaling , Normalization , Windows Stiching
Your Answer
Scaling , Normalization , Windows Stiching
Multiple Choice Single AnswerQuestion
Concept Description generates description for :-
Correct Answer
Charaterisation and Comparison
Your Answer
Charaterisation and Comparison
True/FalseQuestion
Data preprocessing is an important step in knowledge discovery process.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Data Mining means :-
Correct Answer
Knowledge mining from database , Data /Pattern analysis , Data Archelogy
Your Answer
Knowledge mining from database , Data /Pattern analysis , Data Archelogy
Multiple Choice Single AnswerQuestion
What improves accuracy and speed of subsequent mining process?
Correct Answer
Integration
Your Answer
Integration
Multiple Choice Multiple AnswerQuestion
Data mining is applicable to :-
Correct Answer
Relational Database , Data Warehouse , Transaction Database
Your Answer
Relational Database , Data Warehouse , Transaction Database
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
Select The BlankQuestion
________ is a summarization of general characteristics or features of a target class of data.
Correct Answer
Data Characterization
Your Answer
Data Generalization
Multiple Choice Single AnswerQuestion
The pilot which is useful for user and project team both as it touches all important functions is :-
Correct Answer
Expanded seed pilot
Your Answer
User tool appreciation pilot
Multiple Choice Single AnswerQuestion
Which of the following technique involves placing and managing related units of data in same physical block of storage
Correct Answer
Clustering
Your Answer
Clustering
Multiple Choice Multiple AnswerQuestion
History of metadata includes :-
Correct Answer
Changes to source system , Data extraction methods , Data transformation algorithm
Your Answer
Changes to source system , Data extraction methods
Multiple Choice Single AnswerQuestion
Which of the following approach requires more computation?
Correct Answer
Filter approach
Your Answer
Filter approach
True/FalseQuestion
The substantial part of historical data comes form antiquated legacy system.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
Data reduction includes :-
Correct Answer
Single value decomposition , Wavelets , Regression
Your Answer
Single value decomposition , Wavelets , Regression
Multiple Choice Single AnswerQuestion
Establish the importance of data quality, Form data quality steering committee, Institute a data quality framework, Assign roles and responsibilities. These are the steps of :-
Correct Answer
Data purification
Your Answer
Data quality control
Multiple Choice Single AnswerQuestion
Which is the typical example of Grid based clustering method
Correct Answer
STING
Your Answer
STING
Match The FollowingQuestion
Correct Answer
Your Answer
Normalization
Scattered data
Constructing small units of data
Smoothing
Removal of noisy data
Removal of noisy data
Aggregation
Summary operations
Constructing new attributes
Generalization
Data hierarchies
Data hierarchies
True/FalseQuestion
Bitmapped indexing does not apply to fault tables.
Correct Answer
True
Your Answer
True
Multiple Choice Multiple AnswerQuestion
For processing metadata in informal delivery area, data can be referred back for :-
Correct Answer
Source data configuration , Data structure , Data transformation
Your Answer
Source data configuration , Data structure , Data transformation
True/FalseQuestion
The precision measure is the % of retrieved documents that are in fact relevant to query.
Correct Answer
True
Your Answer
False
Select The BlankQuestion
Analysis of frequent sequential patterns is important in analysis ________ in generic sequence.
Correct Answer
Dismilarity and similarity
Your Answer
Similarity
Select The BlankQuestion
________ is the clustering method which encounters difficultes regarding the selection of merge/split points
Correct Answer
Hierachical
Your Answer
Hierachical
Multiple Choice Single AnswerQuestion
Following clustering method is classified as being agglomerative or divisive :-
Correct Answer
Grid based
Your Answer
Grid based
Multiple Choice Multiple AnswerQuestion
Normalization improves :-
Correct Answer
Efficiency , Accuracy
Your Answer
Efficiency , Accuracy
Multiple Choice Single AnswerQuestion
A Wavelet transformation is :-
Correct Answer
Single processing Technique that decomposes signals into different frequency subbands
Your Answer
Single processing Technique that decomposes signals into different frequency subbands
Multiple Choice Single AnswerQuestion
The Clustering method DBSCAN stands for :-
Correct Answer
Desity Based Spatial clustering of Application with Noise
Your Answer
Desity Based Spatial clustering of Application with Noise
Select The BlankQuestion
________ can store aggregate and detail data at varying levels of resolution or abstraction.
Correct Answer
Index tree
Your Answer
Index tree
Multiple Choice Single AnswerQuestion
Behavioral data of objects can be derived by the application of :-
Correct Answer
Method
Your Answer
Method
Select The BlankQuestion
________ is the type of pilot for early delivery with broader scope and may be integrated.
Correct Answer
Broad business pilot
Your Answer
Broad business pilot
Multiple Choice Multiple AnswerQuestion
Metadata types can be classified as :-
Correct Answer
Business metadata , Technical metadata
Your Answer
Business metadata , Technical metadata
Multiple Choice Single AnswerQuestion
Simple matching approach is used for computing disimilarity between two objects for :-
Correct Answer
Nominal variable
Your Answer
Nominal variable
Multiple Choice Multiple AnswerQuestion
The different analysis tools which are useful to detect unusual patterns such as large amount of cash flow at certain period by certain group of people are :-
Correct Answer
Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Your Answer
Linkage analysis tool , Outlier analysis tool , Sequential pattern analysis tool
Multiple Choice Single AnswerQuestion
When DDL statements are created using database software, so to create an index system creates :-
Correct Answer
B-Tree index
Your Answer
B-Tree index
Multiple Choice Multiple AnswerQuestion
Data processing techniques are :-
Correct Answer
Cleansing , Integration , Transformation
Your Answer
Cleansing , Integration , Transformation
Match The FollowingQuestion
Correct Answer
Your Answer
Load Utility
High performance data loading, recovery
High performance data loading, recovery
Query Governer
Abort runaway query
Abort runaway query
Query Optimizer
Parsing, optimizing query
Parsing, optimizing query
Query Management
Balancing extraction of query
Balancing extraction of query
Select The BlankQuestion
Indexed ________ engines search index, web pages and build huge keyword based indices which help to search sets of web pages containing certain keywords
Correct Answer
Web Search Engines
Your Answer
Web Search Engines
True/FalseQuestion
To detect money laundering and other financial crimes, it is important to integrate information for multiple databases.
Correct Answer
True
Your Answer
True
Select The BlankQuestion
________ is the time consuming and less feasible approach for filling missing values.
Correct Answer
Filling missing values manually
Your Answer
Filling missing values manually
Multiple Choice Single AnswerQuestion
Which from the following is used for classification and prediction?
Correct Answer
Regression trees
Your Answer
Regression trees
Multiple Choice Multiple AnswerQuestion
Multimedia database stores and manages large collection of database such as :-
Correct Answer
Audio and Video , Sequence data , Text Markup and linkage
Your Answer
Audio and Video , Sequence data
Select The BlankQuestion
________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer
ROCK
Your Answer
ROCK
Select The BlankQuestion
________ architecture is more concerned with data access than memory access.
Correct Answer
MPP
Your Answer
MPP
True/FalseQuestion
Architecture comes first, tools follows it.
Correct Answer
True
Your Answer
True
True/FalseQuestion
Task of selection in data transformation forms part of extraction function.
Correct Answer
True
Your Answer
False
LIST OF ATTEMPTED QUESTIONS AND ANSWERS
True/FalseQuestionMatching the choice of DBMS with selected server hardware is not important for warehouse.Correct AnswerFalseYour AnswerFalse
Match The FollowingQuestionCorrect AnswerYour AnswerMetadataRoadmap for userRoadmap for userData storageData managementData managementData stagingWorkbench for dataWorkbench for dataData MiningKnowledge discoveryKnowledge discovery
True/FalseQuestionDatabase systems, data warehouse system and world wide web have become mainstream information system.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionBitmapped indexes are more suitable for data warehouse environment than for an OLTP systemCorrect AnswerBitmapped indexYour AnswerBitmapped index
Multiple Choice Single AnswerQuestionThe big difference between data warehouse and any operational system is its :-Correct AnswerUsageYour AnswerUsage
Multiple Choice Single AnswerQuestionOne major effort within data transformation is :-Correct AnswerImprovement of data qualityYour AnswerAnalysis of data quality
Multiple Choice Single AnswerQuestionWhich of the following technique is used to display group summary statistics?Correct AnswerQuality controlYour AnswerSurvival analysis
Select The BlankQuestion________ platform is the platform on which the data warehouse DBMS runs and database exist.Correct AnswerData storageYour AnswerData storage
Multiple Choice Multiple AnswerQuestionClass Comparison is performed through following steps :-Correct AnswerData Collection , Dimension relevance analysis , Presentation of derived comparison Your AnswerData Collection , Dimension relevance analysis , Presentation of derived comparison
Select The BlankQuestionIt is good practice to drop ________ before initial load.Correct AnswerIndexYour AnswerIndex
Select The BlankQuestion________ is the time consuming and less feasible approach for filling missing values.Correct AnswerFilling missing values manuallyYour AnswerFilling missing values manually
Multiple Choice Multiple AnswerQuestionBasic Heuristic method of attribute subset selection includes following techniques :-Correct AnswerStepwise forward selection , Stepwise backward elimination Your AnswerStepwise forward selection , Stepwise backward elimination , Combination of forward selection and backward elimination
True/FalseQuestionFor maintaining the quality of data proper naming conventions help to make data elements well understood by users.Correct AnswerTrueYour AnswerTrue
Select The BlankQuestionIn ________ duplicate sub trees exist within the tree.Correct AnswerRepetitionYour AnswerRepetition
Select The BlankQuestionThe technique of ________ enables concurrent input/output operations and improves file's access performance substantially.Correct AnswerFile stripingYour AnswerFile striping
Select The BlankQuestion________ does not handle categorical attributes.Correct AnswerCUREYour AnswerCURE
Select The BlankQuestionCreating ________is violation of Normalization principles.Correct AnswerArrayYour AnswerArray
True/FalseQuestionData in warehouse is primarily for query.Correct AnswerTrueYour AnswerTrue
Multiple Choice Multiple AnswerQuestionPreprocessing steps of data in order to help improve accuracy, efficiency and scalability of classification & prediction are :-Correct AnswerData Cleaning , Relevance Analysis , Data Transformation Your AnswerData Cleaning , Relevance Analysis , Data Transformation
Multiple Choice Single AnswerQuestionWhich task in data transformation includes types of data manipulation on selected parts of source data?Correct AnswerSplitting/JoiningYour AnswerSplitting/Joining
True/FalseQuestionBusiness metadata is like a roadmap or easy to use information directory showing contents and how to get there.Correct AnswerTrueYour AnswerTrue
True/FalseQuestionData error discovery and data correction are two parts of data cleansing process.Correct AnswerTrueYour AnswerFalse
Multiple Choice Multiple AnswerQuestionThe dimensions of spatial data cube are :-Correct AnswerNon- spatial dimension , Spatial to non spatial , Spatial to spatial Your AnswerNon- spatial dimension , Spatial to non spatial , Spatial to spatial
Select The BlankQuestion________ technique is known as snapshot differential technique.Correct AnswerCapture based on comparing filesYour AnswerCapture based on comparing files
Multiple Choice Multiple AnswerQuestionThe benefits of improved data quality are :-Correct AnswerBetter customer service , Improved productivity , Reliable strategic decision making Your AnswerBetter customer service , Improved productivity , Reliable strategic decision making
Multiple Choice Single AnswerQuestionWhich technique of data extraction is available to non relational databases?Correct AnswerCapture through transaction logYour AnswerCapture of static data
True/FalseQuestionNoise in data means error or variance in measured variable.Correct AnswerTrueYour AnswerTrue
Multiple Choice Multiple AnswerQuestionData mining at home can help to mine data related to :-Correct AnswerMedical History , Cancer , Chromosome abnormalities Your AnswerMedical History , Chromosome abnormalities , Physiological conditions
True/FalseQuestionData Mining refers to extracting knowledge from larger amount of data.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionSimple matching approach is used for computing disimilarity between two objects for :-Correct AnswerNominal variableYour AnswerNominal variable
Multiple Choice Multiple AnswerQuestionFollowing are the reasons for getting data polluted :-Correct AnswerData aging , Input errors , Fraud Your AnswerData aging , Input errors , Processing errors
Select The BlankQuestion________ is the type of pilot for early delivery with broader scope and may be integrated.Correct AnswerBroad business pilotYour AnswerBroad business pilot
Multiple Choice Multiple AnswerQuestionFollowing are the issues to consider during data integration :-Correct AnswerSchema integration , Redundancy , Detection and resolution of data values Your AnswerSchema integration , Redundancy , Detection and resolution of data values
Match The FollowingQuestionCorrect AnswerYour AnswerRough set ApproachNoisy DataPreviously unseen datak-Nearest Neighbour ClassifiersLearning AnalogyNoisy DataClass based TestingInstanace BasedLearning AnalogyGeneric AlgorithmsNatural EvolutionNatural Evolution
Multiple Choice Single AnswerQuestionWhen DDL statements are created using database software, so to create an index system creates :-Correct AnswerB-Tree indexYour AnswerB-Tree index
True/FalseQuestionThe difficulties encountered in data transformation function relate to heterogeneity of the source system.Correct AnswerTrueYour AnswerFalse
True/FalseQuestionData mining is not that much powerful tool for vast data such as gene sequences in DNA analysis.Correct AnswerTrueYour AnswerTrue
Multiple Choice Single AnswerQuestionWhen current extent on disk storage for a file is full, DBMS finds new extent and allows an insertion of new record is known as :-Correct AnswerDynamic extensionYour AnswerDynamic extension
Multiple Choice Multiple AnswerQuestionFollowing are the types of normalization :-Correct AnswerMin-Max Normalization , Z-score normalization , Normalization by scaling Your AnswerMin-Max Normalization , Z-score normalization , Normalization by scaling
Multiple Choice Multiple AnswerQuestionIn generation of numerical hierarchies for cluster analysis following techniques are useful :-Correct AnswerBinning , Histogram analysis , Clustering Your AnswerBinning , Histogram analysis , Segmentation
Select The BlankQuestion________ is an alternative aggolomerative hierarchical clustering algorithm.Correct AnswerROCKYour AnswerROCK
Multiple Choice Multiple AnswerQuestionGeneralized linear model includes :-Correct AnswerLogistic regression , Poisson regression Your AnswerLogistic regression , Poisson regression
Multiple Choice Single AnswerQuestionInherently Architected, Single, central storage of data about content, Centralized rules and control, Seek quick result, these are the advantages of which type of data extraction?Correct AnswerTop down approachYour AnswerTop down approach
Multiple Choice Single AnswerQuestionQueries run faster to find exact match using which type of indexing?Correct AnswerClustered indexYour AnswerClustered index
LIST OF ATTEMPTED QUESTIONS AND ANSWERSSelect The BlankQuestion: ________ function of data staging component involves many forms of combining pieces of data from different sources.
Correct Answer: Data Transformation
Your Answer: Data Transformation
Multiple Choice Multiple AnswerQuestion: The Main areas of Data Warehouse are :-
Correct Answer: Data acquisition , Data Storage , Information Delivery
Your Answer: Data acquisition , Data Storage , Information Delivery
Select The BlankQuestion: Data cleansing and ________ methods of data mining helps in integration of genetic data and construction of warehouse for genetic data analysis.
Correct Answer: Integration
Your Answer: Integration
Multiple Choice Multiple AnswerQuestion: The dimensions of spatial data cube are :-
Correct Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Your Answer: Non- spatial dimension , Spatial to non spatial , Spatial to spatial
Multiple Choice Single AnswerQuestion: In data reduction, the cluster representations of data are used to :-
Correct Answer: Replace data
Your Answer: Represent actual data
Multiple Choice Multiple AnswerQuestion: Distinguishing characteristics of data warehouse architecture are :-
Correct Answer: Different Objective Scope , Data Content , Flexible and Dynamic
Your Answer: Different Objective Scope , Complete Analysis and Quick Response , Flexible and Dynamic
Select The BlankQuestion: In data warehouse architecture, the ________ component interleaves with and connects other components.
Correct Answer: Metadata
Your Answer: Metadata
Multiple Choice Multiple AnswerQuestion: Methods for outlier detection are categorised into following approaches :-
Correct Answer: Statistical , Distance based , Deviation based
Your Answer: Statistical , Distance based , Deviation based
True/FalseQuestion: Metadata describes all the pertinent aspects of the data in data warehouse.
Correct Answer: True
Your Answer: True
Multiple Choice Multiple AnswerQuestion: Financial data called for banking and financial industry are often relatively :-
Correct Answer: Complete , Reliable , High Quality
Your Answer: Complete , Reliable , High Quality
Multiple Choice Multiple AnswerQuestion: Classification and Prediction have following applications :-
Correct Answer: Credit approval , Medical Diagnosis , Performance Prediction
Your Answer: Credit approval , Medical Diagnosis , Performance Prediction
True/FalseQuestion: Data Integration means multiple resourses may be combined.
Correct Answer: True
Your Answer: True
Select The BlankQuestion: ________ can store aggregate and detail data at varying levels of resolution or abstraction.
Correct Answer: Index tree
Your Answer: Multidimensional index tree
True/FalseQuestion: Moving data into staging area and performing data transformation function is a part of data acquisition.
Correct Answer: True
Your Answer: True
True/FalseQuestion: Lower the level of detail, finer the data granularity.
Correct Answer: True
Your Answer: True
Select The BlankQuestion: ________ is an alternative aggolomerative hierarchical clustering algorithm.
Correct Answer: ROCK
Your Answer: ROCK
Multiple Choice Single AnswerQuestion: Real world databases are highly susceptible to noisy, missing and inconsistent data due to :-
Correct Answer: Huge size of data
Your Answer: Huge size of data