Building and Using Comparable Corpora for Multilingual Natural Language Processing Building and Using Comparable Corpora for Multilingual Natural Language Processing
Synthesis Lectures on Human Language Technologies

Building and Using Comparable Corpora for Multilingual Natural Language Processing

Serge Sharoff and Others
    • $34.99
    • $34.99

Publisher Description

This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.

GENRE
Computers & Internet
RELEASED
2023
August 23
LANGUAGE
EN
English
LENGTH
141
Pages
PUBLISHER
Springer International Publishing
SELLER
Springer Nature B.V.
SIZE
8.2
MB

More Books by Serge Sharoff, Reinhard Rapp & Pierre Zweigenbaum

A Frequency Dictionary of Russian A Frequency Dictionary of Russian
2014
Building and Using Comparable Corpora Building and Using Comparable Corpora
2013
Genres on the Web Genres on the Web
2010

Other Books in This Series

Lifelong and Continual Learning Dialogue Systems Lifelong and Continual Learning Dialogue Systems
2024
Automatic Language Identification in Texts Automatic Language Identification in Texts
2024
Cognitive Plausibility in Natural Language Processing Cognitive Plausibility in Natural Language Processing
2023