1.

Record Nr.

UNINA9910491024503321

Autore

Dash Niladri Sekhar <1967->

Titolo

Language Corpora Annotation and Processing / / by Niladri Sekhar Dash

Pubbl/distr/stampa

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2021

ISBN

981-16-2960-9

Edizione

[1st ed. 2021.]

Descrizione fisica

1 online resource (292 pages)

Disciplina

410.188

Soggetti

Computational linguistics

Linguistics - Methodology

Computational Linguistics

Research Methods in Language and Linguistics

Corpus (Lingüística)

Lingüística computacional

Llibres electrònics

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di bibliografia

Includes bibliographical references and index.

Nota di contenuto

Introduction -- Chapter 1. Corpora Annotation: Definition and Types -- Chapter 2. Maxims, Principles, & Rules of Text Annotation -- Chapter 3. Extratextual Documentative Annotation -- Chapter 4. Etymological Annotation -- Chapter 5. Concordance, KWIC, LWG and Collocation -- Chapter 6. Morphological Processing of Words -- Chapter 7. Part-of-Speech Tagging -- Chapter 8. Lemmatization of Inflected Nouns -- Chapter 9. Decomposition of Inflected Verbs -- Chapter 10. Parsing Sentences in a Text. .

Sommario/riassunto

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and



language technology.