1 / 4
文档名称:

数据挖掘.pdf

格式:pdf   页数:4页
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

数据挖掘.pdf

上传人:sxlw2016 2016/4/25 文件大小:0 KB

下载得到文件列表

数据挖掘.pdf

相关文档

文档介绍

文档介绍:Dynamic Partial Coverage Based Feature Selection Method Yu Huang 1,2, Gongde Guo 1,2 , Tianqiang Huang 1,2 and Hong Chen 1,2 1 School of Mathematics puter Science, Fujian Normal University 2 Key Laboratory work Security and Cryptography ,Fujian Normal University Fuzhou, Fujian, China, 350007 yellowfish2001@, ******@fjnu. Abstract In this paper, we propose a novel feature selection method based on spatial coverage relations of features in multidimensional data space. As a filter solution, the algorithm can evaluate the weight of each feature by calculating the spatial coverage relations of features of instances with the same and different class labels in multidimensional data space. And the approach is simple to implement. The experimental results evaluated on some public data set downloaded from the UCI machine learning repository show that the proposed pares well with some classical feature selection methods such as Relief and SVMAttributeEval which are implemented in Weka. 1. Introduction In real-world applications, . in text mining or in predictive toxicology, the number of features in a dataset maybe very large as it could include many irrelevant or redundant features. Most of learning algorithms could not work well with the high dimensional data. This is so-called “curse of dimensionality”[1]. It is necessary to reduce the number of features in or