Select Category
All Categories
Accounting
Agriculture
Astrobiology
Banking & Finance
Business World
Civilizations
Commerce & Economics
Competitive exams
Computer Science & IT
Current affairs
Earth Science
Education
Environment
Explore the Animal World
Food chain
General Knowledge
Geography Map
Green Life
History
Human Health
Journalism
Law
Science
Social Science
TAX
About Us
Contact Us
Login
Data Mining - Part 1
1
of
25
💡
Hints:
3
Q1. ___ often used for both the prelimi nary investigation of the data and the f inal data analysis
A. Aggregation
B. Feature creation
C. Sampling
D. Attribution transformation
Q2. DBSCAN has a drawback over OPTICS
A. fixed sized radiius
B. fixed sized points in radiius
C. core points
D. none of above
Q3. a data warehouse can include
A. flat-files
B. database table
C. online data
D. all
Q4. ___ refers to the grouping of records, observations, or cases into classes of similar objects
A. Clustering
B. Grouping
C. Classification
D. Gathering
Q5. Select the tools that can be used for data mining
A. KNIME
B. WEKA
C. RATTLE
D. All
Q6. Lift dari rule K ___ E adalah ___
×
A. 0.4
B. 0.6
C. 0.8
D. 1
Q7. k-means algorithm is sensitive to outliers
A. TRUE
B. FALSE
Q8. Of the following what are the distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. Hierarchical
D. All
Q9. Same person with multiple email addresses is an example of ___
A. Noise
B. Outliers
C. Missing values
D. Duplicate data
Q10. Select the skills mainly required as a competent data analyst/scientist/miner
A. SQL
B. R
C. Java
D. All
Q11. Of the following which is not a distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. BIRCH
D. DBSCAN
Q12. Which of the following is not involve in data mining
A. Data archaeology
B. Knowledge extraction
C. Data transformation
D. Data exploration
Q13. A collection of integrated, subject oriented databases designed to support the decision-support functions
A. Database
B. Data Collection
C. Data Warehouse
D. Data retrieval
Q14. Find the median of these numbers:4,2,7,4,3
A. 2
B. 5
C. 7
D. 4
Q15. Is Logistic regression a supervised machine learning algorithm?
A. TRUE
B. FALSE
Q16. In statistical distribution the outlier is identified using ___
A. working hypothesis
B. discontency test
C. alternative hypothesis
D. none of the above
Q17. The mass of the beaker was 122 g.
A. Qualitative Data
B. Quantitative Data
Q18. This is a term that describes the large volume of data; both structured and unstructured.
A. Big Data
B. Big Knowledge
C. Big Information
D. Data at rest
Q19. The view over an operational data warehouse is known as virtual warehouse
A. TRUE
B. FALSE
Q20. Suppose a cluster contain the points (1, 3), (3, 3), (2, 1). What is the centroid of the cluster?
A. (2, 2.33)
B. (2.33, 2)
C. (2, 3)
D. None
Q21. Ordinal is an example of ___
A. Ratio class
B. Interval class
C. Categorical class
D. Range class
Q22. ___ is the output of KDD
A. Query
B. Useful Information
C. Data
D. Information
Q23. Application server and data server are kept separately in.
A. Peer to Peer based Processing
B. Master slave based Processing
C. Host based Processing
D. 3-Tier Client Server model
Q24. ___ refers to the mapping or classifi cation of a class with some predefined group or class.
A. Data Discrimination
B. Data Characterization
C. Data Definition
D. Data Visualization
Q25. Data mining is ___
A. A time variant non-Volatile collec tion of data
B. The actual discovery phase of a Knowledge
C. The stage of selecting the right data
D. None of these
Submitting Your Quiz...
Please wait while we process your answers