文档介绍:校园网搜索引擎设计
摘要
随着Internet的迅速发展与广泛应用,网络上的信息与日俱增,如何在 海量的信息中快速地定位自己感兴趣的信息,已成为人们最关注的问题之一。 而搜索引擎技术在用户和信息源之间架起了一道沟通的桥梁,为用户提供了 一个有效的信息检索手段。因此,本着整合校园网资源的目的,在研究搜索 引擎的基本原理、核心技术和处理流程的基础上,结合校园网搜索引擎的个 性化需求,本文设计了一个灵活、可配置、具有良好可扩展性且效率较高的 校园网搜索引擎系统。
论文介绍了系统开发的背景和国内外搜索引擎技术的发展现状,并详细 地说明了该搜索引擎系统的开发过程和方法。首先从功能需求和非功能需求 两个方面对校园网搜索引擎的个性化需求进行分析,然后根据需求分析的结 果提出了系统的实现目标和原则,继而从系统的功能架构和技术架构两个方 面描述了系统的整体功能和总体流程,最后具体描述了插件机制的设计和爬 取模块、文档解析模块及检索和索引模块几个关键模块的详细设计。
关键词:校园网;搜索引擎;网络爬虫;文档解析;索引
The Desine Of Campus Network Search Engine
ABSTRACT
With the Internet's rapid development ,How the information in the mass rapid positioning information of interest to them has become one of the most concern. The search engine technology between users and information sources to build a bridge to provide users with an effective means of information retrieval. Therefore, based on integration of campus network resources, in the study of the basic principles of search engine, the core technology and processes, based on the campus network search engine combined with the individual requirements, the paper design of a flexible, configurable, can be a good scalability and efficient search engine of campus network systems.
This paper introduces the context of system development and search engine technology at home and abroad to develop the status quo, and a detailed description of the search engine system development process and methods. First, from the functional requirements and non-functional requirements of the campus network the two aspects of the personalized search engine needs analysis, needs analysis based on the results of the system to achieve the objectives and principles, and then from the system architecture and technical structure of the two aspects describes the system's overall function and the overall process, and finally describes the plug-in mechanism for the specific design and climbing access module, document analysis and retrieval and indexing module of several modules of the detailed design of