1 / 8
文档名称:

A High-Performance Distributed Algorithm for Mining Association Rules.pdf

格式:pdf   页数:8
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

A High-Performance Distributed Algorithm for Mining Association Rules.pdf

上传人:bolee65 2014/4/13 文件大小:0 KB

下载得到文件列表

A High-Performance Distributed Algorithm for Mining Association Rules.pdf

文档介绍

文档介绍:A High-Performance Distributed Algorithm for Mining Association Rules
Assaf Schuster, Ran Wolff, and Dan Trock
Technion – Israel Institute of Technology
¡
Email: assaf,ranw,dtrock ¢ ***@
Abstract the largest itemset in Apriori [3], to typically just a single
scan in modern ARM algorithms such as Sampling and DIC
We present a new distributed association rule mining [17, 5].
(D-ARM) algorithm that demonstrates superlinear speedup Much progress has also been made in parallelized algo-
with the number puting nodes. The algorithm is rithms. With these, the architecture of the parallel system
the first D-ARM algorithm to perform a single scan over plays a key role. For instance, many algorithms were pro-
the database. As such, its performance is unmatched by posed which take advantage of the fast interconnect, or the
any previous algorithm. Scale-up experiments over stan- shared memory, of puters. The latest develop-
dard synthetic benchmarks demonstrate stable run time re- ment with these is [18], in which each process makes just
gardless of the number puters. Theoretical analysis two passes over its portion of the database.
reveals a tighter bound on error probability than the one
puters are, however, very costly. Hence,
shown in the corresponding sequential algorithm.
although these algorithms were shown to scale up to 128
processors, aniz