蒋辉,阳小华,刘志明,闫仕宇,马家宇,李晓昀,李萌,周座.基于一种文档表示模型的站内搜索引擎设计与实现[J].南华大学学报(自然科学版),2013,27(4):77~81.[JIANG Hui,YANG Xiao-hua,LIU Zhi-ming,YAN Shi-yu,MA Jia-yu,LI Xiao-yun,LI Meng,ZHOU Zuo.Website Search Engine Design and Implementation Based on a Document Representation Model[J].Journal of University of South China(Science and Technology),2013,27(4):77~81.] |
基于一种文档表示模型的站内搜索引擎设计与实现 |
Website Search Engine Design and Implementation Based on a Document Representation Model |
|
DOI: |
中文关键词: lucene 站内搜索引擎 搜索引擎 信息检索 |
英文关键词:lucene website search engine search engine information retrieval |
基金项目:湖南省自然科学基金资助项目(11JJ6047);衡阳市科技计划基金资助项目(2011KJ14;2013KG67);湖南省科技计划基金资助项目(2011FJ3087);南华大学计算机科学与技术校级重点学科基金资助项目 |
|
摘要点击次数: 3121 |
全文下载次数: 3117 |
中文摘要: |
根据全信息理论,认识论信息是语法信息、语义信息和语用信息的三位一体,在信息检索的过程中加入语用信息能有效的提高信息检索的质量.基于查询与内容的文档表示模型较好的利用了语用信息,对站内搜索引擎的查准率的提高有着很好作用;Lucene是一个用java语言开发的开源的全文搜索引擎架构.本文利用Lucene设计和实现一个基于查询与内容的文档表示模型的站内搜索引擎,实验结果表明该模型能有效的提高信息检索的查准率。 |
英文摘要: |
According to the comprehensive information theory,epistemology information is the trinity of syntactic information,semantic information and pragmatic information.Making better use of pragmatic information in information retrieval can promote the quality of information retrieval.A document representation model based on query and content can make better use of pragmatic information,and it is good to promote the precision of the website search engine.Lucene is a open source full text search engine architecture which is developed using java language.We use lucene to design and implement a website engine based on document representation model using query and content.The experiment results show that this model can effectively improve precision rate in information retrieval. |
查看全文 查看/发表评论 下载PDF阅读器 |
关闭 |