T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts. T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts.

T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts‪.‬

Journal of Digital Information Management 2005, Sept, 3, 3

    • ‏5٫99 US$
    • ‏5٫99 US$

وصف الناشر

ABSTRACT: Stemming has a large effect on Arabic information indexing and retrieval, at least partially due to the highly inflected nature of the language. Our work demonstrates the process of improving other stemmers, mainly that of [1]. We reached a recall difference of 28% over the work of [1]. The main part of improvement was due to the addition of more grammatical rules that facilitate the process of stemming. Following this part, we implemented a procedure that extracts the temporal references from the texts. This procedure is highly dependable on the stemming process. A list of all the temporal references is used. The type of the temporal word decides the procedure to treat this word and gives the importance of this temporal reference. These conditions, with the help of the stemmer, produced an excellent result of 95% precision rate and of 91% recall rate. Categories and Subject Descriptors

النوع
كمبيوتر وإنترنت
تاريخ النشر
٢٠٠٥
١ سبتمبر
اللغة
EN
الإنجليزية
عدد الصفحات
٣٤
الناشر
Digital Information Research Foundation
البائع
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
الحجم
٢١٩٫٩
ك.ب.
Arabic Language Processing: From Theory to Practice Arabic Language Processing: From Theory to Practice
٢٠١٩
Computational Linguistics and Intelligent Text Processing Computational Linguistics and Intelligent Text Processing
٢٠٠٨
Computational Processing of the Portuguese Language Computational Processing of the Portuguese Language
٢٠١٦
Arabic Language Processing: From Theory to Practice Arabic Language Processing: From Theory to Practice
٢٠١٨
Computational Linguistics Computational Linguistics
٢٠١٦
Human Language Technology. Challenges of the Information Society Human Language Technology. Challenges of the Information Society
٢٠٠٩
Semantic Notation and Retrieval in Art and Architecture Image Collections. Semantic Notation and Retrieval in Art and Architecture Image Collections.
٢٠٠٥
A Model to Predict Whether an Online RPG Makes Gamers Loyal. A Model to Predict Whether an Online RPG Makes Gamers Loyal.
٢٠٠٣
Collaborative Information Searching in an Information-Intensive Work Domain: Preliminary Results. Collaborative Information Searching in an Information-Intensive Work Domain: Preliminary Results.
٢٠٠٤
The City in Four Dimensions: The Nu.M.E. Project. The City in Four Dimensions: The Nu.M.E. Project.
٢٠٠٤
Cluster Based Mixed Coding Schemes for Inverted File Index Compression. Cluster Based Mixed Coding Schemes for Inverted File Index Compression.
٢٠٠٨
Citation Auctions As a Method to Improve Selection of Scientific Papers (Report) Citation Auctions As a Method to Improve Selection of Scientific Papers (Report)
٢٠٠٨