Challenging the myth of monolingual corpora / / edited by Arja Nurmi, Tanja Rutten, Paivi Pahta |
Pubbl/distr/stampa | Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 |
Descrizione fisica | 1 online resource (248 pages) : illustrations, tables |
Disciplina | 410.188 |
Collana | Language and Computers : Studies in digital linguistics |
Soggetto topico |
Pragmatics
Corpora (Linguistics) Indo-European languages - Discourse analysis Applied linguistics Linguistic analysis (Linguistics) Computational linguistics |
Soggetto genere / forma | Electronic books. |
ISBN | 90-04-27669-6 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Preliminary Material -- How Many Languages are there in a Monolingual Corpus? / Arja Nurmi and Tanja Rütten -- Indian English or Indian Englishes? Accounting for Speakers’ Multilingual Repertoires in Corpora of Postcolonial Englishes / Claudia Lange -- Mono- and Multilingualism in a Specialized Corpus of New Zealand Stories / Alexander Onysko and Marta Degani -- What Happens to Ongoing Change in Multilingual Settings? A Corpus Compiler’s Perspective on New Data and New Research Prospects / Mikko Laitinen -- Multilingual Speakers, Multilingual Texts: Multilingual Practices in Learner Corpora / Marcus Callies and Leonie Wiemeyer -- Multilingualism in English as a Lingua Franca: Flagging as an Indicator of Perceived Acceptability and Intelligibility / Niina Hynninen , Kaisa S. Pietikäinen and Svetlana Vetchinnikova -- English Commonplace Books as Multilingual Receiver Corpora / Thomas Kohnen -- Multilingual Practices in the Corpus of English Religious Prose: Annotation and Access / Tanja Rütten -- Semi-automatic Discovery of Multilingual Elements in English Historical Corpora: Methods and Challenges / Jukka Tyrkkö , Arja Nurmi and Jukka Tuominen -- ‘Multilinguality’ in Learner Corpora: The Case of the mile / Rolf Kreyer -- Multilingualism and Quotations from a Corpus-Linguistic Perspective: A Case Study of Samuel Taylor Coleridge’s Biographia Literaria / Mark Kaunisto. |
Record Nr. | UNINA-9910480269103321 |
Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Challenging the myth of monolingual corpora / / edited by Arja Nurmi, Tanja Rutten, Paivi Pahta |
Pubbl/distr/stampa | Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 |
Descrizione fisica | 1 online resource (248 pages) : illustrations, tables |
Disciplina | 410.188 |
Collana | Language and Computers : Studies in digital linguistics |
Soggetto topico |
Pragmatics
Corpora (Linguistics) Indo-European languages - Discourse analysis Applied linguistics Linguistic analysis (Linguistics) Computational linguistics |
ISBN | 90-04-27669-6 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Preliminary Material -- How Many Languages are there in a Monolingual Corpus? / Arja Nurmi and Tanja Rütten -- Indian English or Indian Englishes? Accounting for Speakers’ Multilingual Repertoires in Corpora of Postcolonial Englishes / Claudia Lange -- Mono- and Multilingualism in a Specialized Corpus of New Zealand Stories / Alexander Onysko and Marta Degani -- What Happens to Ongoing Change in Multilingual Settings? A Corpus Compiler’s Perspective on New Data and New Research Prospects / Mikko Laitinen -- Multilingual Speakers, Multilingual Texts: Multilingual Practices in Learner Corpora / Marcus Callies and Leonie Wiemeyer -- Multilingualism in English as a Lingua Franca: Flagging as an Indicator of Perceived Acceptability and Intelligibility / Niina Hynninen , Kaisa S. Pietikäinen and Svetlana Vetchinnikova -- English Commonplace Books as Multilingual Receiver Corpora / Thomas Kohnen -- Multilingual Practices in the Corpus of English Religious Prose: Annotation and Access / Tanja Rütten -- Semi-automatic Discovery of Multilingual Elements in English Historical Corpora: Methods and Challenges / Jukka Tyrkkö , Arja Nurmi and Jukka Tuominen -- ‘Multilinguality’ in Learner Corpora: The Case of the mile / Rolf Kreyer -- Multilingualism and Quotations from a Corpus-Linguistic Perspective: A Case Study of Samuel Taylor Coleridge’s Biographia Literaria / Mark Kaunisto. |
Record Nr. | UNINA-9910796605803321 |
Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Challenging the myth of monolingual corpora / / edited by Arja Nurmi, Tanja Rutten, Paivi Pahta |
Pubbl/distr/stampa | Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 |
Descrizione fisica | 1 online resource (248 pages) : illustrations, tables |
Disciplina | 410.188 |
Collana | Language and Computers : Studies in digital linguistics |
Soggetto topico |
Pragmatics
Corpora (Linguistics) Indo-European languages - Discourse analysis Applied linguistics Linguistic analysis (Linguistics) Computational linguistics |
ISBN | 90-04-27669-6 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Preliminary Material -- How Many Languages are there in a Monolingual Corpus? / Arja Nurmi and Tanja Rütten -- Indian English or Indian Englishes? Accounting for Speakers’ Multilingual Repertoires in Corpora of Postcolonial Englishes / Claudia Lange -- Mono- and Multilingualism in a Specialized Corpus of New Zealand Stories / Alexander Onysko and Marta Degani -- What Happens to Ongoing Change in Multilingual Settings? A Corpus Compiler’s Perspective on New Data and New Research Prospects / Mikko Laitinen -- Multilingual Speakers, Multilingual Texts: Multilingual Practices in Learner Corpora / Marcus Callies and Leonie Wiemeyer -- Multilingualism in English as a Lingua Franca: Flagging as an Indicator of Perceived Acceptability and Intelligibility / Niina Hynninen , Kaisa S. Pietikäinen and Svetlana Vetchinnikova -- English Commonplace Books as Multilingual Receiver Corpora / Thomas Kohnen -- Multilingual Practices in the Corpus of English Religious Prose: Annotation and Access / Tanja Rütten -- Semi-automatic Discovery of Multilingual Elements in English Historical Corpora: Methods and Challenges / Jukka Tyrkkö , Arja Nurmi and Jukka Tuominen -- ‘Multilinguality’ in Learner Corpora: The Case of the mile / Rolf Kreyer -- Multilingualism and Quotations from a Corpus-Linguistic Perspective: A Case Study of Samuel Taylor Coleridge’s Biographia Literaria / Mark Kaunisto. |
Record Nr. | UNINA-9910814795203321 |
Leiden, The Netherland ; ; Boston, Massachusetts : , : Brill Rodopi, , 2018 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Challenging the myth of monolingual corpora / / edited by Arja Nurmi, Tanja Rutten, Päivi Pahta |
Edizione | [1st ed.] |
Pubbl/distr/stampa | Leiden ; ; Boston : , : Brill/Rodopi, , [2017] |
Descrizione fisica | 1 online resource (248 pages) : illustrations, tables |
Disciplina | 410.1/88 |
Altri autori (Persone) |
NurmiArja
RüttenTanja PahtaPäivi |
Collana | Language and computers: Studies in digital linguistics |
Soggetto topico |
Pragmatics
Corpora (Linguistics) Indo-European languages - Discourse analysis Applied linguistics Linguistic analysis (Linguistics) Computational linguistics |
ISBN |
9789004276697
9004276696 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Preliminary Material -- How Many Languages are there in a Monolingual Corpus? / Arja Nurmi and Tanja Rütten -- Indian English or Indian Englishes? Accounting for Speakers’ Multilingual Repertoires in Corpora of Postcolonial Englishes / Claudia Lange -- Mono- and Multilingualism in a Specialized Corpus of New Zealand Stories / Alexander Onysko and Marta Degani -- What Happens to Ongoing Change in Multilingual Settings? A Corpus Compiler’s Perspective on New Data and New Research Prospects / Mikko Laitinen -- Multilingual Speakers, Multilingual Texts: Multilingual Practices in Learner Corpora / Marcus Callies and Leonie Wiemeyer -- Multilingualism in English as a Lingua Franca: Flagging as an Indicator of Perceived Acceptability and Intelligibility / Niina Hynninen , Kaisa S. Pietikäinen and Svetlana Vetchinnikova -- English Commonplace Books as Multilingual Receiver Corpora / Thomas Kohnen -- Multilingual Practices in the Corpus of English Religious Prose: Annotation and Access / Tanja Rütten -- Semi-automatic Discovery of Multilingual Elements in English Historical Corpora: Methods and Challenges / Jukka Tyrkkö , Arja Nurmi and Jukka Tuominen -- ‘Multilinguality’ in Learner Corpora: The Case of the mile / Rolf Kreyer -- Multilingualism and Quotations from a Corpus-Linguistic Perspective: A Case Study of Samuel Taylor Coleridge’s Biographia Literaria / Mark Kaunisto. |
Record Nr. | UNINA-9910962813203321 |
Leiden ; ; Boston : , : Brill/Rodopi, , [2017] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Corpora and the changing society : studies in the evolution of English / / edited by Paula Rautionaho, Arja Nurmi, Juhani Klemola |
Pubbl/distr/stampa | Amsterdam ; ; Philadelphia : , : John Benjamins Publishing Company, , [2020] |
Descrizione fisica | 1 online resource (319 pages) |
Disciplina | 420.9 |
Collana | Studies in corpus linguistics |
Soggetto topico |
English language - Grammar - Data processing
English language - Research - Data processing English language - Discourse analysis - Data processing English language - Variation - History English language - Social aspects Computational linguistics |
ISBN | 90-272-6131-8 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Intro -- Corpora and the Changing Society -- Editorial page -- Title page -- Copyright page -- Table of contents -- Acknowledgements -- Introduction: Corpora and the changing society -- Part I. Changing society -- The great temptation: What diachronic corpora do and do not reveal about social change -- 1. Introduction -- 2. Five pitfalls in the analysis of diachronic corpus data -- 2.1 Corpus frequencies (semasiological frequencies) are not always equivalent to frequencies of entities and events in the real world (onomasiological frequencies) 2.2 Corpus frequencies of polysemous words need to be broken down into sense-specific and construction-specific frequencies -- 2.3 Correlations in large datasets may be spurious -- 2.4 Comparisons of frequency trends in diachronic corpora require adequate statistical treatment -- 2.5 It is not always easy to disentangle social change and linguistic change -- 3. Giving in to temptation: A case study of the English make-causative -- 3.1 The English make-causative construction -- 3.2 Corpus data and descriptive statistics 3.3 Using distributional semantics to study the development of the make-causative -- 3.4 Discussion -- 4. Conclusions -- References -- Corpora -- Other references -- Changes in society and language: Charting poverty -- 1. Introduction -- 2. Data and pre-processing -- 2.1 The EEBO Collection as sampler corpus -- 2.2 The CLMET3.0 corpus -- 2.3 The pre-processing step of spelling normalization -- 3. Methods -- 3.1 Data-based and data-driven approaches -- 3.2 Document classification -- 3.3 Topic modelling -- 3.4 Conceptual maps -- 4. Results and discussion -- 4.1 Dictionary-based approach 4.2 Topic modelling -- 4.3 Conceptual maps -- 5. Conclusions -- References -- Corpora and software -- Other references -- Finding evidence for a changing society: A collocational study of medical discourse in 1500-1800 -- 1. Introduction -- 2. Background -- 3. Materials and method -- 4. Results -- 4.1 The Corpus of Early Modern English Medical Texts (1500-1700) -- 4.2 The Corpus of Late Modern English Medical Texts (1700-1800) -- 5. Discussion -- 6. Conclusion -- References -- Corpora and software -- Other references Semantic neology: Challenges in matching corpus-based semantic change to real-world change -- 1. Introduction -- 2. Data and methods -- 2.1 Data and tools -- 2.2 Tracking the neosemes -- 3. Case studies -- 3.1 Case study 1: Birther -- 3.2 Case study 2: Normalisation -- 3.3 Case study 3: Cougar -- 3.4 Case study 4: Snowflake -- 3.5 Case study 5: Ghosting -- 4. Discussion -- 4.1 Challenges -- 4.2 Measures shown to allow or enhance system performance -- 4.3 Sociolinguistic insights gained in the study -- 5. Conclusion -- References -- Corpora and tools -- Other references. |
Record Nr. | UNINA-9910794181303321 |
Amsterdam ; ; Philadelphia : , : John Benjamins Publishing Company, , [2020] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Corpora and the changing society : studies in the evolution of English / / edited by Paula Rautionaho, Arja Nurmi, Juhani Klemola |
Pubbl/distr/stampa | Amsterdam ; ; Philadelphia : , : John Benjamins Publishing Company, , [2020] |
Descrizione fisica | 1 online resource (319 pages) |
Disciplina | 420.9 |
Collana | Studies in corpus linguistics |
Soggetto topico |
English language - Grammar - Data processing
English language - Research - Data processing English language - Discourse analysis - Data processing English language - Variation - History English language - Social aspects Computational linguistics |
ISBN | 90-272-6131-8 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Intro -- Corpora and the Changing Society -- Editorial page -- Title page -- Copyright page -- Table of contents -- Acknowledgements -- Introduction: Corpora and the changing society -- Part I. Changing society -- The great temptation: What diachronic corpora do and do not reveal about social change -- 1. Introduction -- 2. Five pitfalls in the analysis of diachronic corpus data -- 2.1 Corpus frequencies (semasiological frequencies) are not always equivalent to frequencies of entities and events in the real world (onomasiological frequencies) 2.2 Corpus frequencies of polysemous words need to be broken down into sense-specific and construction-specific frequencies -- 2.3 Correlations in large datasets may be spurious -- 2.4 Comparisons of frequency trends in diachronic corpora require adequate statistical treatment -- 2.5 It is not always easy to disentangle social change and linguistic change -- 3. Giving in to temptation: A case study of the English make-causative -- 3.1 The English make-causative construction -- 3.2 Corpus data and descriptive statistics 3.3 Using distributional semantics to study the development of the make-causative -- 3.4 Discussion -- 4. Conclusions -- References -- Corpora -- Other references -- Changes in society and language: Charting poverty -- 1. Introduction -- 2. Data and pre-processing -- 2.1 The EEBO Collection as sampler corpus -- 2.2 The CLMET3.0 corpus -- 2.3 The pre-processing step of spelling normalization -- 3. Methods -- 3.1 Data-based and data-driven approaches -- 3.2 Document classification -- 3.3 Topic modelling -- 3.4 Conceptual maps -- 4. Results and discussion -- 4.1 Dictionary-based approach 4.2 Topic modelling -- 4.3 Conceptual maps -- 5. Conclusions -- References -- Corpora and software -- Other references -- Finding evidence for a changing society: A collocational study of medical discourse in 1500-1800 -- 1. Introduction -- 2. Background -- 3. Materials and method -- 4. Results -- 4.1 The Corpus of Early Modern English Medical Texts (1500-1700) -- 4.2 The Corpus of Late Modern English Medical Texts (1700-1800) -- 5. Discussion -- 6. Conclusion -- References -- Corpora and software -- Other references Semantic neology: Challenges in matching corpus-based semantic change to real-world change -- 1. Introduction -- 2. Data and methods -- 2.1 Data and tools -- 2.2 Tracking the neosemes -- 3. Case studies -- 3.1 Case study 1: Birther -- 3.2 Case study 2: Normalisation -- 3.3 Case study 3: Cougar -- 3.4 Case study 4: Snowflake -- 3.5 Case study 5: Ghosting -- 4. Discussion -- 4.1 Challenges -- 4.2 Measures shown to allow or enhance system performance -- 4.3 Sociolinguistic insights gained in the study -- 5. Conclusion -- References -- Corpora and tools -- Other references. |
Record Nr. | UNINA-9910819865103321 |
Amsterdam ; ; Philadelphia : , : John Benjamins Publishing Company, , [2020] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|