top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Computational Methods for Corpus Annotation and Analysis [[electronic resource] /] / by Xiaofei Lu
Computational Methods for Corpus Annotation and Analysis [[electronic resource] /] / by Xiaofei Lu
Autore Lu Xiaofei
Edizione [1st ed. 2014.]
Pubbl/distr/stampa Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014
Descrizione fisica 1 online resource (192 p.)
Disciplina 006.35
Soggetto topico Applied linguistics
Natural language processing (Computer science)
Applied Linguistics
Natural Language Processing (NLP)
Corpus (Lingüística)
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 94-017-8645-3
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Preface -- Chapter 1 Introduction -- Chapter 2 Text Processing with the Command Line Interface -- Chapter 3 Lexical Annotation -- Chapter 4 Lexical Analysis -- Chapter 5 Syntactic Annotation -- Chapter 6 Syntactic Analysis -- Chapter 7 Semantic, Pragmatic and Discourse Analysis -- Chapter 8 Summary and Outlook -- Appendix.
Record Nr. UNINA-9910484108103321
Lu Xiaofei  
Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
English corpus linguistics [[electronic resource] ] : an introduction / / Charles F. Meyer
English corpus linguistics [[electronic resource] ] : an introduction / / Charles F. Meyer
Autore Meyer Charles F.
Pubbl/distr/stampa Cambridge : , : Cambridge University Press, , 2002
Descrizione fisica 1 online resource (xvi, 168 pages) : digital, PDF file(s)
Disciplina 420/.285
Collana Studies in English language
Soggetto topico Inglés (Lengua) - Análisis del discurso
Lingüística - Informática
Lingüística aplicada
Corpus (Lingüística)
Análisis del discurso - Metodología
English language - Research - Data processing
English language - Discourse analysis - Data processing
Computational linguistics
ISBN 1-107-12483-2
0-511-04200-0
1-280-43640-9
0-511-17679-1
0-511-15769-X
0-511-32980-6
0-511-60631-1
0-511-04475-5
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Half-title; Series-title; Title; Copyright; Dedication; Contents; Preface; 1 Corpus analysis and linguistic theory; 2 Planning the construction of a corpus; 3 Collecting and computerizing data; 4 Annotating a corpus; 5 Analyzing a corpus; 6 Future prospects in corpus linguistics; Appendix 1 Corpus resources; Appendix 2 Concordancing programs; References; Index
Record Nr. UNINA-9910777340303321
Meyer Charles F.  
Cambridge : , : Cambridge University Press, , 2002
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
English corpus linguistics [[electronic resource] ] : an introduction / / Charles F. Meyer
English corpus linguistics [[electronic resource] ] : an introduction / / Charles F. Meyer
Autore Meyer Charles F.
Pubbl/distr/stampa Cambridge : , : Cambridge University Press, , 2002
Descrizione fisica 1 online resource (xvi, 168 pages) : digital, PDF file(s)
Disciplina 420/.285
Collana Studies in English language
Soggetto topico Inglés (Lengua) - Análisis del discurso
Lingüística - Informática
Lingüística aplicada
Corpus (Lingüística)
Análisis del discurso - Metodología
English language - Research - Data processing
English language - Discourse analysis - Data processing
Computational linguistics
ISBN 1-107-12483-2
0-511-04200-0
1-280-43640-9
0-511-17679-1
0-511-15769-X
0-511-32980-6
0-511-60631-1
0-511-04475-5
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Half-title; Series-title; Title; Copyright; Dedication; Contents; Preface; 1 Corpus analysis and linguistic theory; 2 Planning the construction of a corpus; 3 Collecting and computerizing data; 4 Annotating a corpus; 5 Analyzing a corpus; 6 Future prospects in corpus linguistics; Appendix 1 Corpus resources; Appendix 2 Concordancing programs; References; Index
Record Nr. UNINA-9910811329103321
Meyer Charles F.  
Cambridge : , : Cambridge University Press, , 2002
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Language corpora annotation and processing / / Niladri Sekhar Dash
Language corpora annotation and processing / / Niladri Sekhar Dash
Autore Dash Niladri Sekhar <1967->
Pubbl/distr/stampa Gateway East, Singapore : , : Springer, , [2021]
Descrizione fisica 1 online resource (292 pages)
Disciplina 410.188
Soggetto topico Corpora (Linguistics)
Corpus (Lingüística)
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 981-16-2960-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Preface -- Acknowledgments -- Introduction -- Why This Book -- Text Annotation Versus Text Processing -- Summary of the Chapters -- Value of this Book -- References -- Contents -- About the Author -- Abbreviations -- 1 Corpus Text Annotation -- 1.1 Introduction: What Is Text Annotation? -- 1.2 Characteristics of Annotation -- 1.3 Kinds of Text Annotation -- 1.4 Criteria of Text Annotation -- 1.5 Maxims of Text Annotation -- 1.6 Justification of Text Annotation -- 1.7 Annotation Schemas and Models -- 1.8 Types of Text Annotation -- 1.8.1 Intralinguistic Annotation -- 1.8.2 Extralinguistic Annotation -- 1.9 Present State of Text Annotation -- 1.10 Utilization of Annotated Texts -- References -- 2 Principles and Rules of Part-of-Speech Annotation -- 2.1 Introduction -- 2.2 Principles of POS Annotation -- 2.3 Rules for POS Annotation -- 2.4 Conclusion -- References -- 3 Part-of-Speech Annotation -- 3.1 Introduction -- 3.2 Concept of POS Annotation -- 3.3 Morphological Analysis Versus POS Annotation -- 3.4 Levels of POS Annotation -- 3.5 Stages of POS Annotation -- 3.5.1 Pre-editing Stage -- 3.5.2 POS Assignment Stage -- 3.5.3 Postediting Stage -- 3.6 Earlier Methods POS Annotation -- 3.7 POS Annotation: Indian Scenario -- 3.8 The BIS POS Tagset -- 3.9 The BIS Tagset and Bengali -- 3.9.1 Utility of a POS Annotated Text -- References -- 4 Extratextual Annotation -- 4.1 Introduction -- 4.2 Definition of Extratextual Annotation -- 4.3 Intratextual and Extratextual Annotation -- 4.4 Relevance of Extratextual Annotation -- 4.5 Extratextual Annotation: Some Early Works -- 4.6 Extratextual Annotation Types -- 4.6.1 File Name: A Gateway -- 4.6.2 Annotation of Text Categories -- 4.6.3 Subject Category Annotation -- 4.6.4 Annotating Title of a Text -- 4.6.5 Header Part: Metadata Depository -- 4.7 Conclusion -- References -- 5 Etymological Annotation.
5.1 Introduction -- 5.2 Lexical Borrowing: Some Scenarios -- 5.3 Vocabulary Classification -- 5.4 Defining Etymological Annotation Tagset -- 5.5 Defining Annotation Strategy -- 5.5.1 Annotating Borrowed Words -- 5.5.2 Annotating Portmanteau Words -- 5.5.3 Annotating Affixed Words -- 5.5.4 Annotating Inflected Words -- 5.6 Process of Etymological Annotation -- 5.7 Findings of an Etymologically Annotated Text -- 5.8 Frequently Used English Words in Bengali -- 5.9 System Adoption at Lexical Level -- 5.10 Conclusion -- References -- 6 More Types of Corpus Annotation -- 6.1 Introduction -- 6.2 Orthographic Annotation -- 6.3 Prosodic Annotation -- 6.4 Semantic Annotation -- 6.5 Discourse Annotation -- 6.6 Rhetoric Annotation -- 6.7 Conclusion -- References -- 7 Morphological Processing of Words -- 7.1 Introduction -- 7.2 Models and Approaches -- 7.2.1 Two-Level Morphology-Based Approach -- 7.2.2 Paradigm-Based Approach -- 7.2.3 Stemmer-Based Approach -- 7.2.4 Acyclic Graph-Based Approach -- 7.2.5 Morph-Based Approach -- 7.2.6 Corpus-Based Approach -- 7.2.7 Suffix Stripping-Based Approach -- 7.3 Issues in Morphological Processing -- 7.4 Method of Information Storage -- 7.5 Method of Morphological Processing -- 7.6 Processing Detached Words -- 7.7 Results of Morphological Processing -- 7.7.1 Rightly Processed Words -- 7.7.2 Wrongly Processed Words -- 7.7.3 Double Processed Words -- 7.7.4 Non-Processed Words -- 7.8 Ambiguity in Morphological Processing -- 7.9 Conclusion -- References -- 8 Lemmatization of Inflected Nouns -- 8.1 Introduction -- 8.2 Lemma(-tization) -- 8.3 Lemmatization and Stemming -- 8.3.1 Lemmatization is Similar to Stemming -- 8.3.2 Lemmatization is Different from Stemming -- 8.4 Lemmatization in English and Other Languages -- 8.5 Surface Structure of Bengali Nouns -- 8.6 Stages for Noun Lemmatization -- 8.6.1 Stage 1: POS Annotation.
8.6.2 Stage 2: Noun Identification and Isolation -- 8.6.3 Stage 3: Alphabetical Sorting of Nouns -- 8.6.4 Stage 4: Noun Classification -- 8.6.5 Stage 5: Tokenization -- 8.7 Operation of Lemmatization Process -- 8.8 Conclusion -- Appendix -- References -- 9 Decomposition of Inflected Verbs -- 9.1 Introduction -- 9.2 Lexical Decomposition -- 9.3 Some Early Works -- 9.4 Morpheme Structure of Bengali Verbs -- 9.4.1 Root Part (dhātu) -- 9.4.2 Suffix Part (Bibhakti) -- 9.5 Conjugation of Bengali Verbs -- 9.6 Categorization of Bengali Verb Roots -- 9.6.1 Verb Root Categorization -- 9.6.2 Verb Suffix Categorization -- 9.7 Issues in Lexical Decomposition -- 9.8 Method of Information Storage -- 9.9 Decomposing Non-conjugated Verbs -- 9.10 Decomposing Conjugated Verbs -- 9.11 Data Creation and Storage -- 9.11.1 Suffix and Root Detection -- 9.11.2 Suffix--Root Mapping -- 9.12 Some Special Cases -- 9.13 The Resultant Output -- 9.14 Performance of the System -- 9.15 Conclusion -- References -- 10 Syntactic Annotation -- 10.1 Introduction -- 10.2 Ambiguity of the Term -- 10.3 Transition of the Concept of Syntactic Annotation -- 10.4 Goals of Syntactic Annotation -- 10.5 Challenges Involved in Syntactic Annotation -- 10.6 What is a Syntactic Annotator (Parser)? -- 10.6.1 Detection of End of a Sentence -- 10.6.2 Tokenization of Words in Sentences -- 10.6.3 Grammatical Annotation -- 10.6.4 Chunking Larger Lexical Blocks -- 10.6.5 Finding Matrix Verb in Sentence -- 10.6.6 Identifying Non-Matrix Clauses -- 10.6.7 Identification of Phrases -- 10.7 Types of Syntactic Annotation -- 10.8 Treebank -- 10.9 Utility of Syntactic Annotated Texts -- 10.10 Conclusion -- References -- Author Index -- Subject Index.
Record Nr. UNINA-9910491024503321
Dash Niladri Sekhar <1967->  
Gateway East, Singapore : , : Springer, , [2021]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Learner corpora : construction and explorations in Chinese and related languages / / edited by Howard Hao-Jan Chen, Keiko Mochizuki, Hongyin Tao
Learner corpora : construction and explorations in Chinese and related languages / / edited by Howard Hao-Jan Chen, Keiko Mochizuki, Hongyin Tao
Edizione [1st ed. 2023.]
Pubbl/distr/stampa Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023
Descrizione fisica 1 online resource (VI, 333 p.) : 383 illus., 41 illus. in color
Disciplina 495.180071
Collana Chinese Language Learning Sciences
Soggetto topico Chinese language - Study and teaching - Foreign speakers
Corpora (Linguistics)
Xinès
Ensenyament de llengües estrangeres
Corpus (Lingüística)
Soggetto genere / forma Llibres electrònics
ISBN 981-19-5731-2
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Part 1 -- 1. Introduction -- Part 2. Learner Corpus Construction and Processing -- 2. Building a New Generation Learner Corpus of Chinese -- 3. Some Pragmatic Issues in Learner Corpus: A CSL Perspective -- 4. The Construction of a Learner Error Corpus of Chinese and Its Pedagogical -- 5. A Preliminary Study on Chinese Learner’s Written Errors Based on an Error-Tagged Learner Corpus -- Part 3. Explorations in Learner Corpora -- 6. A Corpus Study of Verb Complement of Manner Constructions in Chinese as a Second Language -- 7. The Acquisition of Relative Clauses in L2 Chinese: A Corpus-Based Study -- 8. Mis-Selection of Near Synonyms: A Study Based on a Written Chinese Corpus -- 9. A Study on the Common V-N Collection Errors Based on Chinese Learner Corpus -- 10. Acquisition of the Chinese Indefinite Determiner “One + Classifier” and English Articles in Two-Way Learner Corpora -- 11. The Acquisition of Aspect in Chinese Based on a Chinese Learner Corpus by English L1 and Japanese L1 Learners -- 12. Referent Tracking and Event Grounding in Oral Narratives of French Learners of Chinese.
Record Nr. UNINA-9910686776003321
Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
New perspectives on corpus translation studies / / edited by Vincent X. Wang, Lily Lim, and Defeng Li
New perspectives on corpus translation studies / / edited by Vincent X. Wang, Lily Lim, and Defeng Li
Pubbl/distr/stampa Singapore : , : Springer, , [2021]
Descrizione fisica 1 online resource (324 pages)
Disciplina 848.02
Collana New Frontiers in Translation Studies
Soggetto topico Translating and interpreting
Corpora (Linguistics)
Corpus (Lingüística)
Traducció
Soggetto genere / forma Llibres electrònics
ISBN 981-16-4918-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Acknowledgements -- Introduction -- References -- Contents -- Editors and Contributors -- Part I Translation Pedagogy -- 1 Using Corpora in Translation Pedagogy -- Abstract -- 1 Introduction: Expanding Holmes's Vision for Translation Studies -- 2 Corpora in Language Teaching -- 3 Corpora in Translator Training -- 4 Concluding Remarks -- Acknowledgements -- References -- 2 A Corpus-Based Examination of the Translation of the Suffix -ism into Chinese -- Abstract -- 1 Introduction -- 2 A Brief Literature Survey -- 3 Conventional Tools for Understanding -ism and Identifying Its Chinese Translations -- 3.1 The Senses of -ism in Monolingual Dictionary -- 3.2 Translation Candidates for -ism Words in English-Chinese Dictionaries -- 3.3 Monolingual Corpora of English and Chinese: Comparing -ism and -主义 Words -- 4 Querying a Parallel Corpus: The UM-Corpus at SkE -- 4.1 The Coverage of -ism Words -- 4.2 Differentiating Three Types of -ism Words -- 4.3 Translation Candidates in Chinese: The Repertoire and Retrieval Methods -- 4.4 From Chinese Translations to the Senses of -ism Words -- 5 Discussion -- 5.1 Traditional Tools Versus Parallel Corpora for Translators -- 5.2 Parallel Corpus for Translators: Drawbacks and Merits -- 5.3 Technological Skills for Translators -- 6 Conclusion -- Acknowledgements -- References -- References in Chinese -- Part II Translation Norms and Styles -- 3 New Trends in Corpus-Based Translator's Style Studies -- Abstract -- 1 Introduction -- 2 A Review and the State of the Art -- 2.1 The Pre-Corpus Period -- 2.2 The Corpus-Based Period -- 3 New Trends in the Study of Translator's Style -- 3.1 Research Topics -- 3.1.1 Interpreter's Style -- 3.1.2 Group Translators' Style -- 3.1.3 Self-Translating Style -- 3.1.4 Diachronic Shift in Translating Style of Specific Translators -- 3.2 Potential Perspectives.
3.2.1 Sociology of Translation -- 3.2.2 Quantitative Linguistic Analysis -- 3.2.3 Multi-Dimensional/Factorial Perspectives Analysis -- 4 Concluding Remarks -- References -- References in Chinese -- 4 Building a New-Generation Corpus for Empirical Translation Studies: The Dutch Parallel Corpus 2.0 -- Abstract -- 1 Introduction -- 2 Data Collection -- 3 Data Processing -- 3.1 Sentence Alignment -- 3.2 Linguistic Annotation: PoS-Tagging and Lemmatization -- 4 Metadata -- 4.1 Translation- and Translator-Related Metadata -- 4.2 Text-Related Metadata (Translated Texts and Source Texts) -- 5 Classifying Texts into Registers -- 6 Corpus Exploitation -- 7 Conclusion -- Appendix -- References -- 5 Probing a Two-Way Parallel T& -- I Corpus for the Lexical Choices of Translators and Interpreters -- Abstract -- 1 Introduction -- 2 Characterising Translationese and Interpretese -- 3 Corpus Data as Manifestation of Cognitive Processes -- 4 The Current Study -- 5 A Two-Way Parallel T& -- I Corpus -- 5.1 Speech Sources -- 5.2 Speech Transcription -- 5.3 Parallel Components in the Corpus -- 6 Frequency and Lexical Availability -- 6.1 Sampling -- 6.2 Results -- 6.3 Qualitative Characteristics -- 6.3.1 Syllabicity or Word Length -- 6.3.2 Polysemy and Generalness -- 6.3.3 Subtle Semantic Differences -- 7 The Big Picture -- 7.1 Some Words Are Always Available -- 7.2 Interpretese and Lexical Variety -- 7.3 Beyond the Lexical Boundary -- 7.3.1 Formal Equivalence Versus Idiomaticity -- 7.3.2 Context-Dependent Interpretation and Omission -- 7.4 Implications on Lexical Access and Interpreter Training -- 8 Conclusion -- Acknowledgements -- References -- 6 Translationese and Register Variation in English-To-Russian Professional Translation -- Abstract -- 1 Motivation and Aim -- 2 Theoretical Background -- 2.1 Key Concepts and Approaches -- 2.2 Translationese and Register.
3 Methodology -- 3.1 Feature Sets -- 3.2 Research Corpora -- 3.2.1 Data -- 3.2.2 Linguistic Resources -- 3.3 Methods -- 4 Results -- 4.1 Translationese and Register Distinctions -- 4.2 Euclidean Distances Between Translations and Non-Translations -- 4.3 Translationese Effects and Features -- 5 Register-Based Translationese Varieties -- 6 Conclusion -- Acknowledgements -- Appendix -- References -- 7 Normalization, Motivation, and Reception: A Corpus-Based Lexical Study of the Four English Translations of Louis Cha's Martial Arts Fiction -- Abstract -- 1 Introduction -- 2 Texts and Corpora -- 2.1 The Four Complete English Translations of Louis Cha's Martial Arts Fiction -- 2.2 Corpus Design and Compilation -- 3 Findings -- 3.1 Lexical Richness -- 3.2 Normalized POS Distribution -- 3.3 Overlapping Rates of High-Frequency Words -- 3.4 Translation Shifts of Martial-Arts-Specific Terminology -- 4 Discussion -- 4.1 Translators' Motivations and Strategies -- 4.1.1 Promoting Chinese Literature Overseas -- 4.1.2 Winning Readership in English-Speaking Worlds -- 4.1.3 Introducing the Chinese Martial Arts Culture to the West -- 4.1.4 Learning the Chinese Language and Cultures -- 4.2 Readers' Reception of the Translations in the English-Speaking World -- 5 Conclusion -- Acknowledgements -- References -- Part III Cognition and Translation Equivalents -- 8 Translating Principles of Translation: Cross-Cultural and Multi-Brain Perspectives -- Abstract -- 1 Introduction -- 2 Literature Review -- 2.1 Tytler's Three General Laws of Translation -- 2.2 Yan Fu's Translation Principles -- 2.3 The Possible Influence of Tytler's Theory on Yan Fu's Translation Principles -- 2.4 Controversial Understandings of the Order of Importance for Xìn, Dá, Yǎ -- 3 Methodology -- 4 Findings -- 4.1 The Structural and Cultural Meaning of First in English.
4.2 The Structural and Cultural Meaning of Sān in Chinese -- 5 From Tytler to Yan Fu: Three Principles of Translation Shaped by Translation -- 5.1 Translating Principles of Translation: Cross-Cultural Perspective -- 5.2 Translation as a Multi-Brain Activity -- 6 Conclusion -- Acknowledgments -- References -- Website Resources -- 9 Going to Understand 柴? Evidence and Significance of Metonymic Chains in Chinese/English Translation -- Abstract -- 1 Introduction -- 2 Research Methodologies -- 2.1 The 明白 Project -- 2.2 The Be Going to Project -- 2.3 The 柴 Project -- 3 The Data and Findings -- 3.1 Findings and Analyses of The明白 Project -- 3.2 A Brief Account of the Story of "be Going to" and Its Chinese Equivalents -- 3.3 The Mathematical Model for the One-To-Many Relationship in English/Chinese Translation -- 3.4 English Translations of Wang Wei's 柴 [Lù Chái/zhài]: Small Data and the Metonymic Chain -- 4 Conclusion -- References -- 10 Mind the Source Data! Translation Equivalents and Translation Stimuli from Parallel Corpora -- Abstract -- 1 Introduction -- 2 Translation Versus Translation Equivalent -- 3 Translation Equivalent Versus Translation Stimulus -- 4 Does Any Word Have Translation Equivalents? -- 5 Conclusions -- References -- 11 An Intralingual Parallel Corpus of Translations into German Easy Language (Geasy Corpus): What Sentence Alignments Can Tell Us About Translation Strategies in Intralingual Translation -- Abstract -- 1 Introduction -- 2 A Quick Introduction to German Easy Language -- 3 The German Easy Language (Geasy) Corpus -- 4 Alignment Characteristics -- 5 Translation Strategies -- 6 Tree Alignment -- 7 Conclusion and Future Research -- References -- 12 Making Sense of the Prefix de- with an English-Chinese Parallel Corpus -- Abstract -- 1 Introduction -- 2 Relevant Studies -- 2.1 Sense Disambiguation Using Translated Texts.
2.2 Research on Morphemes with Translation Corpora -- 3 Method -- 3.1 List of de- Words -- 3.2 Collecting Corresponding Chinese Items for the Top de- Verbs -- 3.3 Chinese Characters and Words Mapped to the Senses of the Prefix de- -- 4 Results and Discussion -- 4.1 List of de- words -- 4.2 Chinese Lexical Items Corresponding to the Top de- Verbs -- 4.3 Chinese Characters and Words Mapped to Different Senses of the Prefix de- -- 5 Concluding Remarks -- Acknowledgements -- References -- Reference in Chinese.
Record Nr. UNINA-9910502639003321
Singapore : , : Springer, , [2021]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui