A New Filtering Algorithm for Duplicate Document Based on Concept Analysis. A New Filtering Algorithm for Duplicate Document Based on Concept Analysis.

A New Filtering Algorithm for Duplicate Document Based on Concept Analysis‪.‬

Journal of Computer Science 2006, May, 2, 5

    • US$5.99
    • US$5.99

출판사 설명

Abstract: Data bases and web pages contain currently a huge number of duplicate document. It is then fundamental to have a filter which can be embedded, for instance, within an information retrieval system like a search engine in order to prohibit the redundant documents references to appear on the screen as a reply to the user's query. This filter can save the user time and increases his satisfaction. In this study, we propose a new algorithm based on concept analysis principle, which can act as a filter for duplicate document. It can be applied on a collection of documents or databases and reduce their storage spaces by eliminating redundant documents without loosing knowledge. Our experiments show that this algorithm increases the precision of the information retrieval system and improves its performance. Key words: Duplicate document, concept analysis, information retrieval, information filtering

장르
컴퓨터 및 인터넷
출시일
2006년
5월 1일
언어
EN
영어
길이
21
페이지
출판사
Science Publications
판매자
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
크기
193.7
KB
Analysis of Virus Algorithms (Report) Analysis of Virus Algorithms (Report)
2006년
A Fast Approximate String Searching Algorithm. A Fast Approximate String Searching Algorithm.
2005년
Management Information Systems Role in Decision-Making During Crises: Case Study (Report) Management Information Systems Role in Decision-Making During Crises: Case Study (Report)
2010년
A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf) A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf)
2007년
Fast Algorithms for Outlier Detection. Fast Algorithms for Outlier Detection.
2008년
The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report) The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report)
2011년