1.

Record Nr.

UNINA9910788810903321

Autore

Ogrodniczuk Maciej

Titolo

Coreference : annotation, resolution and evaluation in Polish / / Maciej Ogrodniczuk [and four others]

Pubbl/distr/stampa

Berlin, [Germany] : , : Walter de Gruyter, Inc., , 2015

©2015

ISBN

1-61451-838-6

1-61451-995-1

Descrizione fisica

1 online resource (298 p.)

Disciplina

491.8/501456

Soggetti

Polish language - Semantics

Polish language - Syntax

Reference (Linguistics)

Anaphora (Linguistics)

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Note generali

Description based upon print version of record.

Nota di bibliografia

Includes bibliographical references.

Nota di contenuto

Front matter -- Contents -- Preface -- 1. Reference, anaphora, coreference -- 2. Polish coreference-related studies -- 3. Related work -- 4. Annotation models -- 5. Annotation guidelines -- 6. Annotation methodology -- 7. Annotation tools -- 8. Polish Coreference Corpus -- 9. Resolution approaches -- 10. Mention detection -- 11. Rule-based approach -- 12. Statistical approach -- 13. Manual annotation evaluation -- 14. Evaluation approaches -- 15. Evaluation results -- 16. Conclusions -- 17. Perspectives -- Acknowledgements -- Bibliography

Sommario/riassunto

‘Coreference’ presents specificities of reference, anaphora and coreference in Polish, establish identity-of-reference annotation model and present methodology used to create the corpus of Polish general nominal coreference. Various resolution approaches are presented, followed by their evaluation. By discussing the subsequent steps of building a coreference-related component of the natural language processing toolset and offering deeper explanation of the decisions taken, this volume might also serve as a reference book on state-of the art methods of carrying out coreference projects for new languages and



a tutorial for NLP practitioners. Apart from serving as a description of the fi rst complete approach to annotation and resolution of direct nominal coreference for Polish, this book is a useful starting point for further work on other types of anaphora/coreference, semantic annotation, cognitive linguistics (related to the topic of near-identity, discussed in the book) etc. With extended tutorial-like sections on important subtopics, such as evaluation metrics for coreference resolution, it can prove useful to both researchers and practitioners interested in semantic description of Balto-Slavic languages and their processing, engineers developing language resources, tools and linguistic processing chains, as well as computational linguists in general.