1 / 9
文档名称:

基于Map--Reduce大数据实体识别算法.pdf

格式:pdf   页数:9页
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

基于Map--Reduce大数据实体识别算法.pdf

上传人:2286107238 2015/12/6 文件大小:0 KB

下载得到文件列表

基于Map--Reduce大数据实体识别算法.pdf

文档介绍

文档介绍:计算机研究与发展
ISSN 1000- 11-1777?TP
Journal puter Research and Development 50(Suppl.):170-179,2013
基于的大数据实体识别算法
Map-Reduce
霍然王宏志朱鎔李建中高宏

哈尔滨工业大学计算机科学与技术学院哈尔滨
( 150001)
(******@hit.)
Map-Reduce Based Entity Identification in Big Data
Huo Ran,Wang Hongzhi,Zhu Rong,Li Jianzhong,and Gao Hong
(School puter Science and Technology,Harbin Institute of Technology,Harbin150001)
Abstract With the development of information technology,problems caused by“big data”and“dirty
data”have aroused widespread concern, which results an extensively research focus in data
management of quality and identification technology is one of the key problems for
quality-quantity management in big data and plays a decisive role in improving the quality of data,
which is to identify different records that describe the same object and the same record forms which
represents different we propose an entity identification algorithm (EIBM)in big data
based on map-reduce under the background of big data information algorithm firstly
computes an attribute-value based similarity between record pairs using map- then output
entity identification results by graph ,we have performed extensive experiments
in the Hadoop platform using the real dataset and artificial experiment results evaluate
the degree of p