Best 50 Data Mining Notes MCQ Questions & Answers with Explanation

Data Mining Notes

There is a huge amount of data available in the Information Industry. This data is of no use until it is converted into useful information. Applications of data mining, data mining tasks, motivation and challenges, types of data attributes and measurements, data quality. It is necessary to analyze this huge amount of data and extract useful information from it.

What are the tasks of Data Mining ?

Data mining involves six common classes of tasks:
1. Anomaly detection (Outlier/change/deviation detection)
2. Association rule learning (Dependency modelling)
3. Clustering
4. Classification
5. Regression
6. Summarization

What is Data Mining?

Data Mining is defined as extracting information from huge sets of data. In other words, we can say that data mining is the procedure of mining knowledge from data. The information or knowledge extracted so can be used for any of the following applications −

  • Market Analysis
  • Fraud Detection
  • Customer Retention
  • Production Control
  • Science Exploration
OLAP to OLAM Data Mining

Data Mining Applications

Data mining is highly useful in the following domains −

  • Market Analysis and Management
  • Corporate Analysis & Risk Management
  • Fraud Detection

Apart from these, data mining can also be used in the areas of production control, customer retention, science exploration, sports, astrology, and Internet Web Surf-Aid

Mining Methodology and User Interaction Issues

It refers to the following kinds of issues −

  • Mining different kinds of knowledge in databases − Different users may be interested in different kinds of knowledge. Therefore it is necessary for data mining to cover a broad range of knowledge discovery task.
  • Interactive mining of knowledge at multiple levels of abstraction − The data mining process needs to be interactive because it allows users to focus the search for patterns, providing and refining data mining requests based on the returned results.
  • Incorporation of background knowledge − To guide discovery process and to express the discovered patterns, the background knowledge can be used. Background knowledge may be used to express the discovered patterns not only in concise terms but at multiple levels of abstraction.
  • Data mining query languages and ad hoc data mining − Data Mining Query language that allows the user to describe ad hoc mining tasks, should be integrated with a data warehouse query language and optimized for efficient and flexible data mining.
  • Presentation and visualization of data mining results − Once the patterns are discovered it needs to be expressed in high level languages, and visual representations. These representations should be easily understandable.
  • Handling noisy or incomplete data − The data cleaning methods are required to handle the noise and incomplete objects while mining the data regularities. If the data cleaning methods are not there then the accuracy of the discovered patterns will be poor.
  • Pattern evaluation − The patterns discovered should be interesting because either they represent common knowledge or lack novelty.

Data Mining MCQ Questions 2021

What do you think Go is a case sensitive language?
a) True
b) False

Answer : This is True , Go is a case sensitive language.

Which of the following association measure helps in identifying how frequently the item appears in a dataset?
Choose the correct answer from below list
a) Confidence
b) Lift
c) Support

Answer:-c) Support

Clustering process works on _ measure.
Choose the correct answer from below list
a) Lift
b) Support
c) Confidence
d) Probability
e) Distance

Answer:-e) Distance

__ step of KDD process helps in identifying valuable patterns.
Choose the correct answer from below list
a) Pattern Evaluation
b) Knowledge Presentation
c) Data Mining

Answer:-a) Pattern Evaluation

__ aids in identifying associations, correlations, and frequent patterns in data.
Choose the correct answer from below list
a) Association Rule Mining
b) Classification
c) Clustering

Answer:-a) Association Rule Mining

__ term portrays the process of discovering small pieces from a large volume of raw material.
a) Choose the correct answer from below list
b) Data
c) Data Cleaning
d) Mining

Answer:-d) Mining

__ outlier significantly deviates based on the context selected.
Choose the correct answer from below list
a) Collective Outlier
b) Global Outlier
c) Contextual Outlier
d) None of the options

Answer:-c) Contextual Outlier

__________statistics provides inferences on population.
Choose the correct answer from below list
a) Descriptive
b) Inferential

Answer:-b) Inferential

In Association Rules, the Antecedent and Consequent form a disjoint set.
Choose the correct answer from below list
a) True
b) False

Answer:-a) True

Classification predicts the value of __ variable.
Choose the correct answer from below list
a) Continuous
b) Categorical

Answer:-b) Categorical

Advanced Data Mining Questions 2021

Derived relationships in Association Rule Mining are represented in the form of __.
Choose the correct answer from below list
a) Charts
b) Decision Tree
c) All the options
d) Rules

Answer:-d) Rules

The science of collecting, interpreting, and analyzing data is known as __.
Choose the correct answer from below list
a) Statistics
b) Probability
c) Data Collection
d) Data Description

Answer:-a) Statistics

Descriptive statistics is used in __ datasets.
Choose the correct answer from below list
a) Sample
b) Population
c) All the options

Answer:-a) Sample

__ parameter of regression helps in identifying the direction of relationship between variables.
Choose the correct answer from below list
a) Measure of Discrepancy
b) Regression Coefficient

Answer:-b) Regression Coefficient

Which among the following is/are (an) outlier detection method(s)?
Choose the correct answer from below list
a) All the options
b) None of the options
c) Proximity-based approach
d) Clustering-based approach
e) Classification approach
f) Statistical approach

Answer:-a) All the options

__ stage of data science process helps in converting raw data into a machine-readable format.
Choose the correct answer from below list
a) Data Description
b) Data Cleaning
c) Exploratory Data Analysis
d) Data Gathering

Answer:-c) Exploratory Data Analysis

Inferential statistics is used in __ datasets.
Choose the correct answer from below list
a) Sample
b) Population
c) All the Options

Answer:-b) Population

Which of the following helps in measuring the dispersion range of the data?
Choose the correct answer from below list
a) Variance
b) None of the options
c) All the options
d) Standard Deviation
e) Range
f) Interquartile range

Answer:-c) All the options

Distance measure(s) used in clustering process of Numeric Dataset is/are __.
a) Minkowski
b) Hamming
c) All the options
d) Manhattan Distance

Answer:-c) All the options

Jacard Index distance measure is used on __.
Choose the correct answer from below list
a) Numeric dataset
b) Non-numeric dataset

Answer:-b) Non-numeric dataset

Which of the following helps in measuring the central tendency of the dataset?
Choose the correct option from below list
a) Median
b) Mode
c) All the options
d) Mean

Answer:-c) All the options

__________association measure compares the confidence with the expected confidence.
Choose the correct option from below list
a) Lift
b) Confidence
c) Support

Answer:-a) Lift

Identify the Unsupervised Learning method.
Choose the correct option from below list
a) Classification
b) Clustering
c) Association Rule Mining

Answer:-b) Clustering

Regression can be used in predicting/forecasting Applications.
Choose the correct answer from below list
a) True
b) False

Answer:-a) True

Collective outlier significantly deviates from the entire dataset.
Choose the correct answer from below list
a) True
b) False

Answer:-b) False

What is KDD and its Process
Some of the people says that data mining as a synonym of Knowledge Discovery in Databases or KDD and some others consider Data Mining as a vital step in the KDD process.

Below are the steps in KDD Process

a) Data Cleaning – Here we will Remove the noisy and inconsistent data.
b) Data Integration – Here data from diverse sources are unified.
c) Data Selection – Here we will get retrieved the relevant data.
d) Data Transformation – Here Data is transformed into appropriate forms.
e) Data Mining -This is Intelligent methods which is applied to extract knowledge and patterns.
f) Pattern Evaluation – This is used to identifies valuable patterns.
g) Knowledge Presentation- Visualization and presentation of the extracted knowledge and the identified patterns.

Identify the algorithm that works based on the concept of clustering.
Choose the correct answer from below list
a) K-Means
b) SVM
c) Decision Tree

Answer:-a) K-Means

Data Mining Questions for Fresher’s 2021

__ step of classification contributes to the construction of learning model.
Choose the correct answer from below list
a) Classification Step
b) Learning Step

Answer:-b) Learning Step

Which process of KDD aids in unifying data from different sources?
Choose the correct answer from below list
a) Data Cleaning
b) Data Selection
c) Data Mining
d) Pattern Evaluation
e) Data Integration

Answer:-e) Data Integration

Explanatory variable is a __.
Choose the correct answer from below list
a) Predictor Variable
b) Dependent Variable
c) None of the options
d) All the options
e) Response variable

Answer:-a) Predictor Variable

Response variable is a __.
Choose the correct answer from below list
a) Dependent Variable
b) Predictor Variable
c) Explanatory Variable
d) All the options

Answer:-a) Dependent Variable

Classification is a __ task.
Choose the correct answer from below list
a) Data Analysis
b) Data Transformation
c) Data Integration
d) Data Cleaning

Answer:-a) Data Analysis

Data Warehouse vs Data Mining -Detail Explanation Course

About Author


After years of Technical Work, I feel like an expert when it comes to Develop wordpress website. Check out How to Create a Wordpress Website in 5 Mins, and Earn Money Online Follow me on Facebook for all the latest updates.

Leave a Comment