1 / 90
文档名称:

数据挖掘教材.ppt

格式:ppt   大小:1,308KB   页数:90页
下载后只包含 1 个 PPT 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

数据挖掘教材.ppt

上传人:allap 2020/11/5 文件大小:1.28 MB

下载得到文件列表

数据挖掘教材.ppt

文档介绍

文档介绍:Data Mining: An Overview from a Database Perspective Tutorial Outline PAKDD Conference 1998
Jiawei Han
Simon Fraser University, Canada
Where to Find the Updated Set of Tutorial Notes?
/~han
click <Other interested links>
PAKDD’98 tutorial notes
1. On-Line Analytical Mining
2. Clustering Techniques
3. Spatial Data Mining
4. Visual Data Mining
5. Towards Parallel On-Line Analytical Mining
6. Data Mining: The Next Step
7. References
Outline
Motivation
Data Warehousing and OLAP: An Overview
Data Mining: A General Introduction
On-Line Analytical Mining (OLAM): An Integration of OLAP with Mining
DBMiner: A System which Performs OLAP Mining
Future Research on On-Line Analytical Mining
Motivation: “Necessity is the Mother of Invention”
Data warehousing: Integration of data from multiple sources into large warehouses and support of on-line analytical processing and business decision making.
Data mining (knowledge discovery in databases): Extraction of interesting knowledge (rules, regularities, patterns, constraints) from data in large databases.
Necessity: Data explosion problem --- computerized data collection tools and mature database technology lead to tremendous amounts of data stored in databases.
We are drowning in data, but starving for knowledge!
Why Data Mining? Potential Applications
Marketing
Corporate Analysis
Fraud Detection
Other Applications
Marketing
Sales Analysis
associations between product sales
beer and diapers
Customer Profiling
data mining can tell you what types of customers buy what products
Identifying Customer Requirements
identify the best products for different customers
use prediction to find what factors will attract new customers
Corporate Analysis
Finances
cash flow analysis and prediction
Resources
summarize and compare the resources and spending
Competition
compare with other competitors by summarizing data to the same lev