文档介绍:Machine Learning with WEKA
WEKA: the bird
Copyright: Martin Kramer (******@)
9/4/2017
2
University of Waikato
WEKA: the software
Machine learning/data mining software written in Java (distributed under the GNU Public License)
Used for research, education, and plements “Data Mining” by Witten & Frank
Main features:
Comprehensive set of data pre-processing tools, learning algorithms and evaluation methods
Graphical user interfaces (incl. data visualization)
Environment paring learning algorithms
9/4/2017
3
University of Waikato
WEKA: versions
There are several versions of WEKA:
WEKA : “book version” compatible with description in data mining book
WEKA : “GUI version” adds graphical user interfaces (book version mand-line only)
WEKA : “development version” with lots of improvements
This talk is based on the latest snapshot of WEKA (soon to be WEKA )
9/4/2017
4
University of Waikato
***@relation heart-disease-simplified
***@attribute age numeric
***@attribute sex { female, male}
***@attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina}
***@attribute cholesterol numeric
***@attribute exercise_induced_angina { no, yes}
***@attribute class { present, not_present}
***@data
63,male,typ_angina,233,no,not_present
67,male,asympt,286,yes,present
67,male,asympt,229,yes,present
38,female,non_anginal,?,no,not_present
...
WEKA only deals with “flat” files
Flat file in
9/4/2017
5
University of Waikato
***@relation heart-disease-simplified
***@attribute age numeric
***@attribute sex { female, male}
***@attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina}
***@attribute cholesterol numeric
***@attribute exercise_induced_angina { no, yes}
***@attribute class { present, not_present}
***@data
63,male,typ_angina,233,no,not_present
67,male,asympt,286,yes,present
67,male,asympt,229,yes,present
38,female,non_anginal,?,no,not_present
...
WEKA only deals with “flat” files
numeric attribute
nominal attribute
9/4/2017
6
University of Waikato
9/4/201