Designing and Evaluating Language Corpora Designing and Evaluating Language Corpora

Designing and Evaluating Language Corpora

A Practical Framework for Corpus Representativeness

Jesse Egbert and Others
    • 35,99 €
    • 35,99 €

Publisher Description

Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' – highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.

GENRE
Professional & Technical
RELEASED
2022
14 April
LANGUAGE
EN
English
LENGTH
391
Pages
PUBLISHER
Cambridge University Press
SIZE
14.6
MB

More Books by Jesse Egbert, Douglas Biber & Bethany Gray

The Register-Functional Approach to Grammatical Complexity The Register-Functional Approach to Grammatical Complexity
2021
Using Corpus Methods to Triangulate Linguistic Analysis Using Corpus Methods to Triangulate Linguistic Analysis
2019
Doing Linguistics with a Corpus Doing Linguistics with a Corpus
2020
Register Variation Online Register Variation Online
2018
Triangulating Methodological Approaches in Corpus Linguistic Research Triangulating Methodological Approaches in Corpus Linguistic Research
2016