1 / 15
文档名称:

机器学习十大算法:K-means.pdf

格式:pdf   大小:826KB   页数:15页
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

机器学习十大算法:K-means.pdf

上传人:jllzaxwb 2016/10/12 文件大小:826 KB

下载得到文件列表

机器学习十大算法:K-means.pdf

相关文档

文档介绍

文档介绍:Chapter 2K-MeansJoydeep Ghosh and Alexander Introduction............................................................ The k-means Algorithm................................................. Available Software...................................................... Examples............................................................... Advanced Topics........................................................ Summary............................................................... Exercises...............................................................33References................................................................... IntroductionIn this chapter, we describe thek-meansalgorithm, a straightforward and widelyused clustering algorithm. Given a set of objects (records), the goal of clusteringor segmentation is to divide these objects into groups or “clusters” such that objectswithin a group tend to be moresimilar to oneanother paredto objects belongingto different groups. In other words, clustering algorithms place similar points in thesame cluster while placing dissimilar points in different clusters. Note that, in contrasttosupervisedtasks such as regression or classi?cation where there is a notion of atarget value or class label, the objects that form the inputs to a clustering proceduredo e with an associated target. Therefore,clusteringis often referred toas unsupervised learning. Because there is no need for labeled data, unsupervisedalgorithms are suitable for many applications where labeled data is dif?cult to tasks such as clustering are also often used to explore and characterizethe dataset before running a supervised learning task. Since clustering makes no useof class labels, some notion of similarity must be de?ned based on the attributes of theobjects. Thede?nition of similarity and the methodin whichpoints are clustered differbased on the clustering algorithm being applied. Thus, diff