T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts. T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts.

T-Stem--a Superior Stemmer and Temporal Extractor for Arabic Texts‪.‬

Journal of Digital Information Management 2005, Sept, 3, 3

    • $5.99
    • $5.99

Publisher Description

ABSTRACT: Stemming has a large effect on Arabic information indexing and retrieval, at least partially due to the highly inflected nature of the language. Our work demonstrates the process of improving other stemmers, mainly that of [1]. We reached a recall difference of 28% over the work of [1]. The main part of improvement was due to the addition of more grammatical rules that facilitate the process of stemming. Following this part, we implemented a procedure that extracts the temporal references from the texts. This procedure is highly dependable on the stemming process. A list of all the temporal references is used. The type of the temporal word decides the procedure to treat this word and gives the importance of this temporal reference. These conditions, with the help of the stemmer, produced an excellent result of 95% precision rate and of 91% recall rate. Categories and Subject Descriptors

GENRE
Computers & Internet
RELEASED
2005
September 1
LANGUAGE
EN
English
LENGTH
34
Pages
PUBLISHER
Digital Information Research Foundation
SELLER
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
SIZE
219.9
KB
Arabic Language Processing: From Theory to Practice Arabic Language Processing: From Theory to Practice
2019
Computational Linguistics and Intelligent Text Processing Computational Linguistics and Intelligent Text Processing
2008
Computational Processing of the Portuguese Language Computational Processing of the Portuguese Language
2016
Arabic Language Processing: From Theory to Practice Arabic Language Processing: From Theory to Practice
2018
Computational Linguistics Computational Linguistics
2016
Human Language Technology. Challenges of the Information Society Human Language Technology. Challenges of the Information Society
2009
Semantic Notation and Retrieval in Art and Architecture Image Collections. Semantic Notation and Retrieval in Art and Architecture Image Collections.
2005
A Model to Predict Whether an Online RPG Makes Gamers Loyal. A Model to Predict Whether an Online RPG Makes Gamers Loyal.
2003
Collaborative Information Searching in an Information-Intensive Work Domain: Preliminary Results. Collaborative Information Searching in an Information-Intensive Work Domain: Preliminary Results.
2004
The City in Four Dimensions: The Nu.M.E. Project. The City in Four Dimensions: The Nu.M.E. Project.
2004
Cluster Based Mixed Coding Schemes for Inverted File Index Compression. Cluster Based Mixed Coding Schemes for Inverted File Index Compression.
2008
Citation Auctions As a Method to Improve Selection of Scientific Papers (Report) Citation Auctions As a Method to Improve Selection of Scientific Papers (Report)
2008