2.Cluster Analysis, Outlier Analysis
3.Machine Learning
Supervised learning
Unsupervised learning
Semi-supervised learning
Active learning
4. Present an example where data mining is crucial to the success of a business. What data
mining functionalities does this business need (e.g., think of the kinds of patterns that
could be mined)? Can such patterns be generated alternatively by data query processing
or simple statistical analysis?
5 Explain the difference and similarity between discrimination and classification, between
characterization and clustering, and between classification and regression.
6 Based on your observations, describe another possible kind of knowledge that needs to
be discovered by data mining methods but has not been listed in this chapter. Does it
require a mining methodology that is quite different from those outlined in this chapter?
7 Outliers are often discarded as noise. However, one person’s garbage could be another’s
treasure. For example, exceptions in credit card transactions can help us detect the
fraudulent use of credit cards. Using fraudulence detection as an example, propose two
methods that can be used to detect outliers and discuss which one is more reliable.
8 Describe three challenges to data mining regarding data mining methodology and user
interaction issues.
9 What are the major challenges of mining a huge amount of data (e.g., billions of tuples)
in comparison with mining a small amount of data (e.g., data set of a few hundred
tuple)?
10 Outline the major research challenges of data mining in one specific application domain,
such as stream/sensor data analysis, spatiotemporal data analysis, or bioinformatics.
No comments:
Post a Comment