A New Filtering Algorithm for Duplicate Document Based on Concept Analysis. A New Filtering Algorithm for Duplicate Document Based on Concept Analysis.

A New Filtering Algorithm for Duplicate Document Based on Concept Analysis‪.‬

Journal of Computer Science 2006, May, 2, 5

    • $5.99
    • $5.99

Publisher Description

Abstract: Data bases and web pages contain currently a huge number of duplicate document. It is then fundamental to have a filter which can be embedded, for instance, within an information retrieval system like a search engine in order to prohibit the redundant documents references to appear on the screen as a reply to the user's query. This filter can save the user time and increases his satisfaction. In this study, we propose a new algorithm based on concept analysis principle, which can act as a filter for duplicate document. It can be applied on a collection of documents or databases and reduce their storage spaces by eliminating redundant documents without loosing knowledge. Our experiments show that this algorithm increases the precision of the information retrieval system and improves its performance. Key words: Duplicate document, concept analysis, information retrieval, information filtering

GENRE
Computers & Internet
RELEASED
2006
May 1
LANGUAGE
EN
English
LENGTH
21
Pages
PUBLISHER
Science Publications
SELLER
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
SIZE
193.7
KB
Experiment and Evaluation in Information Retrieval Models Experiment and Evaluation in Information Retrieval Models
2017
Transactions on Large-Scale Data- and Knowledge-Centered Systems XXIII Transactions on Large-Scale Data- and Knowledge-Centered Systems XXIII
2015
Information Retrieval Technology Information Retrieval Technology
2008
Advances in Distributed Agent-Based Retrieval Tools Advances in Distributed Agent-Based Retrieval Tools
2010
Advances in Databases and Information Systems Advances in Databases and Information Systems
2018
Information Search, Integration, and Personalization Information Search, Integration, and Personalization
2020
Analysis of Virus Algorithms (Report) Analysis of Virus Algorithms (Report)
2006
A Fast Approximate String Searching Algorithm. A Fast Approximate String Searching Algorithm.
2005
Management Information Systems Role in Decision-Making During Crises: Case Study (Report) Management Information Systems Role in Decision-Making During Crises: Case Study (Report)
2010
A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf) A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf)
2007
Fast Algorithms for Outlier Detection. Fast Algorithms for Outlier Detection.
2008
The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report) The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report)
2011