1 / 42
文档名称:

基于Lucene的垂直搜索引擎系统设计与实现论文.doc

格式:doc   大小:1,713KB   页数:42页
下载后只包含 1 个 DOC 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

基于Lucene的垂直搜索引擎系统设计与实现论文.doc

上传人:xunlai783 2018/1/26 文件大小:1.67 MB

下载得到文件列表

基于Lucene的垂直搜索引擎系统设计与实现论文.doc

相关文档

文档介绍

文档介绍:摘要
搜索引擎为人们在拥有海量信息的互联网上查找信息资源提供了方便。但网络信息资源的“爆炸”式增长使得人们获取有用信息变得日益困难,为此人们开始更多地关注如何快速有效地从海量的网络信息中抽取出潜在的、有价值的信息,使之有效地在管理和决策中发挥作用。针对这种情况,需要一个专注于特定领域、对硬件要求较低、数据全面深入、更新及时的搜索引擎系统。近几年来面向特定领域搜索引擎的研究与应用己经成为搜索技术的一个较热研究方向;而针对某一领域、某一特定人群或某一特定需求建立的搜索引擎我们称之为垂直搜索引擎。
本文将介绍垂直搜索引擎的原理,并对开源工具包Lucene的索引和检索技术进行分析,对比Lucene与关系数据库的区别。在Lucene检索体系的基础上,采用Spring框架,结合网络爬虫Heritrix、Ajax等技术,通过实现一个基于Java的Lucene手机垂直搜索引擎系统来阐释其在实际中的简单应用。
关键词:Lucene;垂直搜索引擎;网络爬虫;中文分词;检索
ABSTRACT

Search Engine provides a convenient way for searching information resource in work which owns magnanimous information. But work information resource is growing within “explosion mode”, people get useful information more and more difficultly; then people begin to give more attention to how to extract latent and valuable information quickly and efficiently from the magnanimous information, so as to use it efficiently to make effects in the management and decision. To resolve this problem, we need a search engine system which faces to the special area, needs low hardware configuration, can analyze the depth of the data and is updated timely. In recent years, the research and application of the search technology which faces to the special area has e a hot research direction. We call a search engine which faces to the special area, people mand just vertical search engine.
This article will give the introduction of the vertical search engine, then make a analysis of the Lucene’s index and retrieval technology by an open packet, pare the Lucene and the relational database. We will clarify its application through the mobile-phone vertical search engine based on JAVA which is based on Lucene’s retrieval architect, use Spring frame, contains web crawler technology like Heritrix and Ajax etc.
Keywords: Lucene;Vertical Search Engine;Web Crawler;
Chinese Word Segmentation;Retrieval
目录
摘要 I
ABSTRACT II
第1章引言 1
课题的背景及意义 1
系统简介 1
课题的主要工作及论文的组织 2
第2章垂直搜索引擎系统 3

垂直搜索引擎的概念及特点 3
垂直搜索引擎的优势及现状 3