文档介绍:-
LW42 网络搜索引擎的研究与开发(Eclipse+Tomcat+MySQL)
摘要:的迅速发展,上的信息成指数增长。由于网络信息资源的飞速增加,给人们在网上寻找所需信息带来了很大的困难。搜索引擎的出现增强了人们收集和定位所需信息的能力,能够帮助人们迅速找到所需要的信息。以后的几年里搜索引擎技术开始不断的发展,上的搜索引擎数量也是急剧的增加,的搜索引擎技术成为了研究的热点。随着搜索引擎应用的广泛化,人们对于搜索引擎的要求也越来越高,查准率和查全率成为衡量搜索引擎的新标准,无用信息的过滤成为人们开始关注的问题。
如今搜索引擎不仅仅考虑能够搜索信息,还要考虑最快速的获取用户所需要的信息。本文针对搜索的特点研究了搜索引擎的构建技术,包括从网页文档抓取、解析、再到建立索引、发布搜索、用户界面搭建的全过程,并基于开源的Lucene软件包实现了一个原型系统,取得了较好的搜索效果.
本设计说明书主要介绍了本课题的开发意义、完成的功能和开发过程,并着重说明了开发设计的思想、技术难点和解决方案。
关键词:Lucene, 搜索引擎, , 爬虫
毕业设计(论文)外文摘要
The research and development of Web Search Engine
Abstract: With the fast development of , the information of it growing very rapidly. Because of it, it is very difficult for people to search the information they need. Search engine improves the people’s ability to collect and locate the useful information and help them to find the information the need rapidly. In the following several years the technology of search engine begins developing continually, the amount of search engine system grows very rapidly, the technology of search engine system basing on the has e a hot research. Along with search engine application widespread, people are also getting higher and higher regarding search engine's request, the accuracy ratio and the recall e new standard weigh search engine, and people began to e concerned about filtering the information that is useless.
Now the search engine have to consider no