A New Filtering Algorithm for Duplicate Document Based on Concept Analysis. A New Filtering Algorithm for Duplicate Document Based on Concept Analysis.

A New Filtering Algorithm for Duplicate Document Based on Concept Analysis‪.‬

Journal of Computer Science 2006, May, 2, 5

    • ‏5٫99 US$
    • ‏5٫99 US$

وصف الناشر

Abstract: Data bases and web pages contain currently a huge number of duplicate document. It is then fundamental to have a filter which can be embedded, for instance, within an information retrieval system like a search engine in order to prohibit the redundant documents references to appear on the screen as a reply to the user's query. This filter can save the user time and increases his satisfaction. In this study, we propose a new algorithm based on concept analysis principle, which can act as a filter for duplicate document. It can be applied on a collection of documents or databases and reduce their storage spaces by eliminating redundant documents without loosing knowledge. Our experiments show that this algorithm increases the precision of the information retrieval system and improves its performance. Key words: Duplicate document, concept analysis, information retrieval, information filtering

النوع
كمبيوتر وإنترنت
تاريخ النشر
٢٠٠٦
١ مايو
اللغة
EN
الإنجليزية
عدد الصفحات
٢١
الناشر
Science Publications
البائع
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
الحجم
١٩٣٫٧
ك.ب.
Experiment and Evaluation in Information Retrieval Models Experiment and Evaluation in Information Retrieval Models
٢٠١٧
Transactions on Large-Scale Data- and Knowledge-Centered Systems XXIII Transactions on Large-Scale Data- and Knowledge-Centered Systems XXIII
٢٠١٥
Information Retrieval Technology Information Retrieval Technology
٢٠٠٨
Advances in Distributed Agent-Based Retrieval Tools Advances in Distributed Agent-Based Retrieval Tools
٢٠١٠
Advances in Databases and Information Systems Advances in Databases and Information Systems
٢٠١٨
Information Search, Integration, and Personalization Information Search, Integration, and Personalization
٢٠٢٠
Analysis of Virus Algorithms (Report) Analysis of Virus Algorithms (Report)
٢٠٠٦
A Fast Approximate String Searching Algorithm. A Fast Approximate String Searching Algorithm.
٢٠٠٥
Management Information Systems Role in Decision-Making During Crises: Case Study (Report) Management Information Systems Role in Decision-Making During Crises: Case Study (Report)
٢٠١٠
A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf) A Study of the Contracting and Procurement Process for COTS Software Projects (Commercial Off-The-Shelf)
٢٠٠٧
Fast Algorithms for Outlier Detection. Fast Algorithms for Outlier Detection.
٢٠٠٨
The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report) The Cyber Space and Information, Communication and Technology: A Tool for Westernization Or Orientalism Or both (Report)
٢٠١١