A New Text Mining Approach for Finding Protein-To-Disease Associations. A New Text Mining Approach for Finding Protein-To-Disease Associations.

A New Text Mining Approach for Finding Protein-To-Disease Associations‪.‬

American Journal of Biochemistry and Biotechnology 2005, Summer, 1, 3

    • $5.99
    • $5.99

Publisher Description

Abstract: Discovering significant relationships between biological entities from text documents is an important task for biologists in order to develop biological models for research and discovery, especially with the existing gigantic amounts of biomedical documents and the rate at which they are increasing everyday. We propose a new text mining method to extract associations between biological entities from text documents; and we focus and apply the method in our experiments on discovering proteins-to-diseases associations. The proposed method uses two sets of documents on the topic of interest [a negative set and positive (or relevant) set] and utilizes the concepts of expectation (ex), evidence (ev) and Z-scores in combining positive and negative evidences in determining the significant associations. Moreover, the method offers an efficient way to handle protein names, aliases and abbreviations and to disambiguate them from common abbreviations, gene symbols and such. We evaluated the method in discovering protein-to-disease associations from Medline abstracts and the results are very encouraging. We confirmed the correctness of the results, in each experiment, through articles from Medline. Our method was able to discover associations between certain proteins and various diseases like Alzheimer, Creutzfeldt-Jakob, Crohn Disease, Dengue, Jaundice, Lung cancer and more. For example, in Alzheimer test, the method ran on 83,933 abstracts and discovered that Alzheimer has significant association with 6 proteins, among them, Amyloid beta A4 protein precursor, Apolipoprotein E precursor and Presenilin 1 [PMIDs: 8596911, 1465129, 8346443, 12614323, 8766720 and 8878479]. We further tested our method on some already discovered and published relationships between genes and diseases and the method was also successful in supporting those discoveries. Key words: Biomedical text mining, information extraction, text mining, bioinformatics

GENRE
Professional & Technical
RELEASED
2005
22 June
LANGUAGE
EN
English
LENGTH
24
Pages
PUBLISHER
Science Publications
SELLER
The Gale Group, Inc., a Delaware corporation and an affiliate of Cengage Learning, Inc.
SIZE
226.8
KB
Biomedical Text Mining Biomedical Text Mining
2022
Computation in BioInformatics Computation in BioInformatics
2021
Proteomics for Biological Discovery Proteomics for Biological Discovery
2019
Proteomic Applications in Cancer Detection and Discovery Proteomic Applications in Cancer Detection and Discovery
2013
Translational Bioinformatics and Systems Biology Methods for Personalized Medicine Translational Bioinformatics and Systems Biology Methods for Personalized Medicine
2017
Between the Lines of Genetic Code Between the Lines of Genetic Code
2013
Protein Molecular Structures, Protein Subfractions, And Protein Availability Affected by Heat Processing: A Review (Report) Protein Molecular Structures, Protein Subfractions, And Protein Availability Affected by Heat Processing: A Review (Report)
2007
Camel's Milk Protects Against Aluminum Chloride-Induced Toxicity in the Liver and Kidney of White Albino Rats (Report) Camel's Milk Protects Against Aluminum Chloride-Induced Toxicity in the Liver and Kidney of White Albino Rats (Report)
2009
Effect of Stages of Maturity and Ripening Conditions on the Biochemical Characteristics of Tomato. Effect of Stages of Maturity and Ripening Conditions on the Biochemical Characteristics of Tomato.
2008
Perception of Extension Specialists About the Role of Extension in the Production and Adoption of the Genetically Modified Crops in Iran. Perception of Extension Specialists About the Role of Extension in the Production and Adoption of the Genetically Modified Crops in Iran.
2008
Soluble CD14, Sialic Acid and L-Fucose in Breast Milk and Their Role in Increasing the Immunity of Breast-Fed Infants (Report) Soluble CD14, Sialic Acid and L-Fucose in Breast Milk and Their Role in Increasing the Immunity of Breast-Fed Infants (Report)
2011
Cardiovascular Risk Factors in North Indians: A Case-Control Study. Cardiovascular Risk Factors in North Indians: A Case-Control Study.
2006