Low Resource Social Media Text Mining Low Resource Social Media Text Mining
SpringerBriefs in Computer Science

Low Resource Social Media Text Mining

    • 52,99 €
    • 52,99 €

Beschreibung des Verlags

This book focuses on methods that are unsupervised or require minimal supervision—vital in the low-resource domain. Over the past few years, rapid growth in Internet access across the globe has resulted in an explosion in user-generated text content in social media platforms. This effect is significantly pronounced in linguistically diverse areas of the world like South Asia, where over 400 million people regularly access social media platforms. YouTube, Facebook, and Twitter report a monthly active user base in excess of 200 million from this region. Natural language processing (NLP) research and publicly available resources such as models and corpora prioritize Web content authored primarily by a Western user base. Such content is authored in English by a user base fluent in the language and can be processed by a broad range of off-the-shelf NLP tools. In contrast, text from linguistically diverse regions features high levels of multilinguality, code-switching, and varied languageskill levels. Resources like corpora and models are also scarce. Due to these factors, newer methods are needed to process such text.

This book is designed for NLP practitioners well versed in recent advances in the field but unfamiliar with the landscape of low-resource multilingual NLP. The contents of this book introduce the various challenges associated with social media content, quantify these issues, and provide solutions and intuition. When possible, the methods discussed are evaluated on real-world social media data sets to emphasize their robustness to the noisy nature of the social media environment.


On completion of the book, the reader will be well-versed with the complexity of text-mining in multilingual, low-resource environments; will be aware of a broad set of off-the-shelf tools that can be applied to various problems; and will be able to conduct sophisticated analyses of such text.

GENRE
Computer und Internet
ERSCHIENEN
2021
1. Oktober
SPRACHE
EN
Englisch
UMFANG
71
Seiten
VERLAG
Springer Nature Singapore
ANBIETERINFO
Springer Science & Business Media LLC
GRÖSSE
5,3
 MB
Human Language Technology. Challenges for Computer Science and Linguistics Human Language Technology. Challenges for Computer Science and Linguistics
2018
Human Language Technology. Challenges for Computer Science and Linguistics Human Language Technology. Challenges for Computer Science and Linguistics
2022
Human Language Technology. Challenges for Computer Science and Linguistics Human Language Technology. Challenges for Computer Science and Linguistics
2020
Computational Processing of the Portuguese Language Computational Processing of the Portuguese Language
2020
Computational Linguistics and Intelligent Text Processing Computational Linguistics and Intelligent Text Processing
2023
Artificial Intelligence and Natural Language Artificial Intelligence and Natural Language
2018
The Amazing Journey of Reason The Amazing Journey of Reason
2019
Manifold Learning Manifold Learning
2024
The Mathematical Theory of Semantic Communication The Mathematical Theory of Semantic Communication
2025
Developing Sustainable and Energy-Efficient Software Systems Developing Sustainable and Energy-Efficient Software Systems
2023
Objective Information Theory Objective Information Theory
2023
Distributed Hash Table Distributed Hash Table
2013