1 / 88
文档名称:

Introduction to Data Mining - Chapter 6. Association Analysis - Basic Concepts and Algorithms.pdf

格式:pdf   页数:88
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

Introduction to Data Mining - Chapter 6. Association Analysis - Basic Concepts and Algorithms.pdf

上传人:bolee65 2014/1/9 文件大小:0 KB

下载得到文件列表

Introduction to Data Mining - Chapter 6. Association Analysis - Basic Concepts and Algorithms.pdf

文档介绍

文档介绍:6
Association Analysis:
Basic Concepts and
Algorithms
Many business enterprises accumulate large quantities of data from their day-
to-day operations. For example, huge amounts of customer purchase data are
collected daily at the checkout counters of grocery stores. Table illustrates
an example of such data, commonly known as market basket transactions.
Each row in this table corresponds to a transaction, which contains a unique
identifier labeled TID and a set of items bought by a given customer. Retail-
ers are interested in analyzing the data to learn about the purchasing behavior
of their customers. Such valuable information can be used to support a vari-
ety of business-related applications such as marketing promotions, inventory
management, and customer relationship management.
This chapter presents a methodology known as association analysis,
which is useful for discovering interesting relationships hidden in large data
sets. The uncovered relationships can be represented in the form of associa-
Table . An example of market basket transactions.
TID Items
1 {Bread, Milk}
2 {Bread, Diapers, Beer, Eggs}
3 {Milk, Diapers, Beer, Cola}
4 {Bread, Milk, Diapers, Beer}
5 {Bread, Milk, Diapers, Cola}
328 Chapter 6 Association Analysis
tion rules or sets of frequent items. For example, the following rule can be
extracted from the data set shown in Table :
{Diapers}−→{Beer}.
The rule suggests that a strong relationship exists between the sale of diapers
and beer because many customers who buy diapers also buy beer. Retailers
can use this type of rules to help them identify new opportunities for cross-
selling their products to the customers.
Besides market basket data, association analysis is also applicable to other
application domains such as bioinformatics, medical diagnosis, Web mining,
and scientific data analysis. In the analysis of Earth science data, for example,
the association patterns may reveal interesting