Computational Methods for Corpus Annotation and Analysis / / by Xiaofei Lu |
Autore | Lu Xiaofei |
Edizione | [1st ed. 2014.] |
Pubbl/distr/stampa | Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014 |
Descrizione fisica | 1 online resource (192 p.) |
Disciplina | 006.35 |
Soggetto topico |
Applied linguistics
Natural language processing (Computer science) Applied Linguistics Natural Language Processing (NLP) Corpus (Lingüística) Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 94-017-8645-3 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Preface -- Chapter 1 Introduction -- Chapter 2 Text Processing with the Command Line Interface -- Chapter 3 Lexical Annotation -- Chapter 4 Lexical Analysis -- Chapter 5 Syntactic Annotation -- Chapter 6 Syntactic Analysis -- Chapter 7 Semantic, Pragmatic and Discourse Analysis -- Chapter 8 Summary and Outlook -- Appendix. |
Record Nr. | UNINA-9910484108103321 |
Lu Xiaofei | ||
Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Corpus pragmatics : international journal of corpus linguistics and pragmatics |
Pubbl/distr/stampa | [Cham, Switzerland], : Springer International Publishing, , [2017]- |
Descrizione fisica | 1 online resource |
Soggetto topico |
Corpora (Linguistics)
Pragmatics Linguistics Pragmàtica (Lingüística) Lingüística computacional |
Soggetto genere / forma |
Periodicals.
Revistes electròniques. |
Soggetto non controllato | Philology & Linguistics |
ISSN | 2509-9515 |
Formato | Materiale a stampa |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Altri titoli varianti | International journal of corpus linguistics and pragmatics |
Record Nr. | UNINA-9910481993603321 |
[Cham, Switzerland], : Springer International Publishing, , [2017]- | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Deep Learning in Natural Language Processing / / edited by Li Deng, Yang Liu |
Pubbl/distr/stampa | Singapore, : Springer Singapore, : Imprint : Springer, 2018 |
Descrizione fisica | 1 online resource (338 pages) |
Soggetto topico |
Artificial intelligence
Natural language processing (Computer science) Mathematical statistics Tractament del llenguatge natural (Informàtica) Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 981-10-5209-3 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910299297603321 |
Singapore, : Springer Singapore, : Imprint : Springer, 2018 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Finite-state computational morphology : an analyzer and generator for Georgian / / Irina Lobzhanidze |
Autore | Lobzhanidze Irina |
Pubbl/distr/stampa | Cham, Switzerland : , : Springer, , [2022] |
Descrizione fisica | 1 online resource (229 pages) |
Disciplina | 499.96959 |
Soggetto topico |
Georgian language - Grammar
Computational linguistics Georgià (Llengua) Gramàtica Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN |
9783030902483
9783030902476 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910544864303321 |
Lobzhanidze Irina | ||
Cham, Switzerland : , : Springer, , [2022] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
From Complex Sentences to a Formal Semantic Representation Using Syntactic Text Simplification and Open Information Extraction / / Christina Niklaus |
Autore | Niklaus Christina |
Edizione | [First edition.] |
Pubbl/distr/stampa | Wiesbaden, Germany : , : Springer Vieweg, Springer Fachmedien Wiesbaden GmbH, part of Springer Nature, , [2022] |
Descrizione fisica | 1 online resource (0 pages) |
Disciplina | 410.285 |
Soggetto topico |
Computational linguistics
Natural language processing (Computer science) Text processing (Computer science) Tractament del llenguatge natural (Informàtica) Tractament de textos Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 3-658-38697-5 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910590079103321 |
Niklaus Christina | ||
Wiesbaden, Germany : , : Springer Vieweg, Springer Fachmedien Wiesbaden GmbH, part of Springer Nature, , [2022] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Journal of computational social science |
Pubbl/distr/stampa | [Singapore] : , : Springer Nature Singapore Pte Ltd., , [2018]- |
Descrizione fisica | 1 online resource |
Soggetto topico |
Social sciences - Data processing
Social sciences - Computer simulation Social sciences - Mathematical models Big data Complexity (Linguistics) Computational linguistics Social sciences Lingüística computacional Dades massives |
Soggetto genere / forma |
Periodicals.
Revistes electròniques. |
ISSN | 2432-2725 |
Formato | Materiale a stampa |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Altri titoli varianti |
J Comput Soc Sc
JCSS |
Record Nr. | UNINA-9910481976103321 |
[Singapore] : , : Springer Nature Singapore Pte Ltd., , [2018]- | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Language Corpora Annotation and Processing / / by Niladri Sekhar Dash |
Autore | Dash Niladri Sekhar <1967-> |
Edizione | [1st ed. 2021.] |
Pubbl/distr/stampa | Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2021 |
Descrizione fisica | 1 online resource (292 pages) |
Disciplina | 410.188 |
Soggetto topico |
Computational linguistics
Linguistics - Methodology Computational Linguistics Research Methods in Language and Linguistics Corpus (Lingüística) Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 981-16-2960-9 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Introduction -- Chapter 1. Corpora Annotation: Definition and Types -- Chapter 2. Maxims, Principles, & Rules of Text Annotation -- Chapter 3. Extratextual Documentative Annotation -- Chapter 4. Etymological Annotation -- Chapter 5. Concordance, KWIC, LWG and Collocation -- Chapter 6. Morphological Processing of Words -- Chapter 7. Part-of-Speech Tagging -- Chapter 8. Lemmatization of Inflected Nouns -- Chapter 9. Decomposition of Inflected Verbs -- Chapter 10. Parsing Sentences in a Text. . |
Record Nr. | UNINA-9910491024503321 |
Dash Niladri Sekhar <1967-> | ||
Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2021 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Literature, Language and Computing : Russian Contribution / / edited by Polina Eismont, Maria Khokhlova, Mikhail Koryshev, Elena Riekhakaynen |
Autore | Eismont Polina |
Edizione | [1st ed. 2023.] |
Pubbl/distr/stampa | Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023 |
Descrizione fisica | 1 online resource (254 pages) |
Disciplina | 410.285 |
Altri autori (Persone) |
KhokhlovaMaria
KoryshevMikhail RiekhakaynenElena |
Soggetto topico |
Computational linguistics
Linguistics—Methodology Translating and interpreting Applied linguistics Computational Linguistics Research Methods in Language and Linguistics Language Translation Applied Linguistics Lingüística computacional Lingüística aplicada |
Soggetto genere / forma | Llibres electrònics |
ISBN | 981-9936-04-7 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Chapter 1. Literature, Language and Computing: Russian Contribution -- Chapter 2. Self-Repair in Russian Spoken Discourse in Psycholinguistics Aspect: Correlation Analysis and Quantitative Data -- Chapter 3. A Lexicographic Portrait of a Russian Microsyntactic Unit -- Chapter 4. “Plain and Natural” vs “Accurate and Unambiguous”: Pronominal Intrasentential Anaphora in Russian Legislative Texts -- Chapter 5. The Old Church Slavonic Corpora and Their Use in Language Studies at the University -- Chapter 6. Core coordination units in macro- and microdiachrony: experimental data -- Chapter 7. The Use of Futur Antérieur in the Past in Old French: Experience of a Corpus-Based Study -- Chapter 8. Nachhaltigkeit in media crisis discourse -- Chapter 9. Using Corpora for Verifying Language Choices in Translation -- Chapter 10. Stylometric Methods in Comparative Analysis of Text -- Chapter 11. Lexical Diversity of Russian Poets -- Chapter 12. A semantic corpus of Russian literature of 18 century: its current state and its future -- Chapter 13. Multimedia dictionary of verbal vocabulary: concept, structure, implementation -- Chapter 14. Incorporating informal e-learning into foreign language teaching through collaborative personalization -- Chapter 15. Pedagogical peer-to-peer online practice as a means of forming professional competence in distant learning format -- Chapter 16. To the East Slavonic proverbs of the thematic group “Learning - inattention” (as seen in the new Electronic dictionary of current active East Slavonic proverbs) -- Chapter 17. Opportunities of using Dental Internet resources in teaching the language of specialty in the course of Russian as a foreign language -- Chapter 18. Machine Translation vs Human Translation of Artionyms -- Chapter 19. The Emotion in Text Analyzer: How to Visualize its Output? -- Chapter 20. The Multimedia Corpus of Russian Ironic Speech for Phonetic Analysis -- Chapter 21. Theory of Mind and the Mechanism of Imagination for a Companion Robot. |
Record Nr. | UNINA-9910734873703321 |
Eismont Polina | ||
Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii |
Autore | Tanaka-Ishii Kumiko |
Pubbl/distr/stampa | Cham, Switzerland : , : Springer, , [2021] |
Descrizione fisica | 1 online resource (226 pages) : illustrations |
Disciplina | 410.151 |
Collana | Mathematics in Mind |
Soggetto topico |
Mathematical linguistics
Computational linguistics Lingüística matemàtica Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 3-030-59377-0 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Intro -- Contents -- Part I Language as a Complex System -- 1 Introduction -- 1.1 Aims -- 1.2 Structure of This Book -- 1.3 Position of This Book -- 1.3.1 Statistical Universals as Computational Properties of Natural Language -- 1.3.2 A Holistic Approach to Language via Complex Systems Theory -- 1.4 Prospectus -- 2 Universals -- 2.1 Language Universals -- 2.2 Layers of Universals -- 2.3 Universal, Stylized Hypothesis, and Law -- 3 Language as a Complex System -- 3.1 Sequence and Corpus -- 3.1.1 Definition of Corpus -- 3.1.2 On Meaning -- 3.1.3 On Infinity -- 3.1.4 On Randomness -- 3.2 Power Functions -- 3.3 Scale-Free Property: Statistical Self-Similarity -- 3.4 Complex Systems -- 3.5 Two Basic Random Processes -- Part II Property of Population -- 4 Relation Between Rank and Frequency -- 4.1 Zipf's Law -- 4.2 Scale-Free Property and Hapax Legomena -- 4.3 Monkey Text -- 4.4 Power Law of n-grams -- 4.5 Relative Rank-Frequency Distribution -- 5 Bias in Rank-Frequency Relation -- 5.1 Literary Texts -- 5.2 Speech, Music, Programs, and More -- 5.3 Deviations from Power Law -- 5.3.1 Scale -- 5.3.2 Speaker Maturity -- 5.3.3 Characters vs. Words -- 5.4 Nature of Deviations -- 6 Related Statistical Universals -- 6.1 Density Function -- 6.2 Vocabulary Growth -- Part III Property of Sequences -- 7 Returns -- 7.1 Word Returns -- 7.2 Distribution of Return Interval Lengths -- 7.3 Exceedance Probability -- 7.4 Bias Underlying Return Intervals -- 7.5 Rare Words as a Set -- 7.6 Behavior of Rare Words -- 8 Long-Range Correlation -- 8.1 Long-Range Correlation Analysis -- 8.2 Mutual Information -- 8.3 Autocorrelation Function -- 8.4 Correlation of Word Intervals -- 8.5 Nonstationarity of Language -- 8.6 Weak Long-Range Correlation -- 9 Fluctuation -- 9.1 Fluctuation Analysis -- 9.2 Taylor Analysis -- 9.3 Differences Between the Two Fluctuation Analyses.
9.4 Dimensions of Linguistic Fluctuation -- 9.5 Relations Among Methods -- 10 Complexity -- 10.1 Complexity of Sequence -- 10.2 Entropy Rate -- 10.3 Hilberg's Ansatz -- 10.4 Computing Entropy Rate of Human Language -- 10.5 Reconsidering the Question of Entropy Rate -- Part IV Relation to Linguistic Elements and Structure -- 11 Articulation of Elements -- 11.1 Harris's Hypothesis -- 11.2 Information-Theoretic Reformulation -- 11.3 Accuracy of Articulation by Harris's Scheme -- 12 Word Meaning and Value -- 12.1 Meaning as Use and Distributional Semantics -- 12.2 Weber-Fechner Law -- 12.3 Word Frequency and Familiarity -- 12.4 Vector Representation of Words -- 12.5 Compositionality of Meaning -- 12.6 Statistical Universals and Meaning -- 13 Size and Frequency -- 13.1 Zipf Abbreviation of Words -- 13.2 Compound Length and Frequency -- 14 Grammatical Structure and Long Memory -- 14.1 Simple Grammatical Framework -- 14.2 Phrase Structure Grammar -- 14.3 Long-Range Dependence in Sentences -- 14.4 Grammatical Structure and Long-Range Correlation -- 14.5 Nature of Long Memory Underlying Language -- Part V Mathematical Models -- 15 Theories Behind Zipf's Law -- 15.1 Communication Optimization -- 15.2 A Limit Theorem -- 15.3 Significance of Statistical Universals -- 16 Mathematical Generative Models -- 16.1 Criteria for Statistical Universals -- 16.2 Independent and Identically Distributed Sequences -- 16.3 Simon Model and Variants -- 16.4 Random Walk Models -- 17 Language Models -- 17.1 Language Models and Statistical Universals -- 17.2 Building Language Models -- 17.3 N-Gram Models -- 17.4 Grammatical Models -- 17.5 Neural Models -- 17.6 Future Directions for Generative Models -- Part VI Ending Remarks -- 18 Conclusion -- 19 Acknowledgments -- Part VII Appendix -- 20 Glossary and Notations -- 20.1 Glossary -- 20.2 Mathematical Notation. 20.3 Other Conventions -- 21 Mathematical Details -- 21.1 Fitting Functions -- 21.2 Proof that Monkey Typing Follows a Power Law -- 21.3 Relation Between η and ζ -- 21.4 Relation Between η and ξ -- 21.5 Proof That Interval Lengths of I.I.D. Process Follow Exponential Distribution -- 21.6 Proof of α=0.5 and ν=1.0 for I.I.D. Process -- 21.7 Summary of Shannon's Method to Estimate Entropy Rate -- 21.8 Relation of h, Perplexity, and Cross Entropy -- 21.9 Type Counts, Shannon Entropy, and Yule's K, via Generalized Entropy -- 21.10 Upper Bound of Compositional Distance -- 21.11 Rough Summary of Mandelbrot's Communication Optimization Rationale to Deduce a Power Law -- 21.12 Rough Definition of Central Limit Theorem -- 21.13 Definition of Simon Model -- 22 Data -- 22.1 Literary Texts -- 22.2 Large Corpora -- 22.3 Other Kinds of Data Related to Language -- 22.4 Corpora for Scripts -- References -- Index. |
Record Nr. | UNISA-996466553203316 |
Tanaka-Ishii Kumiko | ||
Cham, Switzerland : , : Springer, , [2021] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii |
Autore | Tanaka-Ishii Kumiko |
Pubbl/distr/stampa | Cham, Switzerland : , : Springer, , [2021] |
Descrizione fisica | 1 online resource (226 pages) : illustrations |
Disciplina | 410.151 |
Collana | Mathematics in Mind |
Soggetto topico |
Mathematical linguistics
Computational linguistics Lingüística matemàtica Lingüística computacional |
Soggetto genere / forma | Llibres electrònics |
ISBN | 3-030-59377-0 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Intro -- Contents -- Part I Language as a Complex System -- 1 Introduction -- 1.1 Aims -- 1.2 Structure of This Book -- 1.3 Position of This Book -- 1.3.1 Statistical Universals as Computational Properties of Natural Language -- 1.3.2 A Holistic Approach to Language via Complex Systems Theory -- 1.4 Prospectus -- 2 Universals -- 2.1 Language Universals -- 2.2 Layers of Universals -- 2.3 Universal, Stylized Hypothesis, and Law -- 3 Language as a Complex System -- 3.1 Sequence and Corpus -- 3.1.1 Definition of Corpus -- 3.1.2 On Meaning -- 3.1.3 On Infinity -- 3.1.4 On Randomness -- 3.2 Power Functions -- 3.3 Scale-Free Property: Statistical Self-Similarity -- 3.4 Complex Systems -- 3.5 Two Basic Random Processes -- Part II Property of Population -- 4 Relation Between Rank and Frequency -- 4.1 Zipf's Law -- 4.2 Scale-Free Property and Hapax Legomena -- 4.3 Monkey Text -- 4.4 Power Law of n-grams -- 4.5 Relative Rank-Frequency Distribution -- 5 Bias in Rank-Frequency Relation -- 5.1 Literary Texts -- 5.2 Speech, Music, Programs, and More -- 5.3 Deviations from Power Law -- 5.3.1 Scale -- 5.3.2 Speaker Maturity -- 5.3.3 Characters vs. Words -- 5.4 Nature of Deviations -- 6 Related Statistical Universals -- 6.1 Density Function -- 6.2 Vocabulary Growth -- Part III Property of Sequences -- 7 Returns -- 7.1 Word Returns -- 7.2 Distribution of Return Interval Lengths -- 7.3 Exceedance Probability -- 7.4 Bias Underlying Return Intervals -- 7.5 Rare Words as a Set -- 7.6 Behavior of Rare Words -- 8 Long-Range Correlation -- 8.1 Long-Range Correlation Analysis -- 8.2 Mutual Information -- 8.3 Autocorrelation Function -- 8.4 Correlation of Word Intervals -- 8.5 Nonstationarity of Language -- 8.6 Weak Long-Range Correlation -- 9 Fluctuation -- 9.1 Fluctuation Analysis -- 9.2 Taylor Analysis -- 9.3 Differences Between the Two Fluctuation Analyses.
9.4 Dimensions of Linguistic Fluctuation -- 9.5 Relations Among Methods -- 10 Complexity -- 10.1 Complexity of Sequence -- 10.2 Entropy Rate -- 10.3 Hilberg's Ansatz -- 10.4 Computing Entropy Rate of Human Language -- 10.5 Reconsidering the Question of Entropy Rate -- Part IV Relation to Linguistic Elements and Structure -- 11 Articulation of Elements -- 11.1 Harris's Hypothesis -- 11.2 Information-Theoretic Reformulation -- 11.3 Accuracy of Articulation by Harris's Scheme -- 12 Word Meaning and Value -- 12.1 Meaning as Use and Distributional Semantics -- 12.2 Weber-Fechner Law -- 12.3 Word Frequency and Familiarity -- 12.4 Vector Representation of Words -- 12.5 Compositionality of Meaning -- 12.6 Statistical Universals and Meaning -- 13 Size and Frequency -- 13.1 Zipf Abbreviation of Words -- 13.2 Compound Length and Frequency -- 14 Grammatical Structure and Long Memory -- 14.1 Simple Grammatical Framework -- 14.2 Phrase Structure Grammar -- 14.3 Long-Range Dependence in Sentences -- 14.4 Grammatical Structure and Long-Range Correlation -- 14.5 Nature of Long Memory Underlying Language -- Part V Mathematical Models -- 15 Theories Behind Zipf's Law -- 15.1 Communication Optimization -- 15.2 A Limit Theorem -- 15.3 Significance of Statistical Universals -- 16 Mathematical Generative Models -- 16.1 Criteria for Statistical Universals -- 16.2 Independent and Identically Distributed Sequences -- 16.3 Simon Model and Variants -- 16.4 Random Walk Models -- 17 Language Models -- 17.1 Language Models and Statistical Universals -- 17.2 Building Language Models -- 17.3 N-Gram Models -- 17.4 Grammatical Models -- 17.5 Neural Models -- 17.6 Future Directions for Generative Models -- Part VI Ending Remarks -- 18 Conclusion -- 19 Acknowledgments -- Part VII Appendix -- 20 Glossary and Notations -- 20.1 Glossary -- 20.2 Mathematical Notation. 20.3 Other Conventions -- 21 Mathematical Details -- 21.1 Fitting Functions -- 21.2 Proof that Monkey Typing Follows a Power Law -- 21.3 Relation Between η and ζ -- 21.4 Relation Between η and ξ -- 21.5 Proof That Interval Lengths of I.I.D. Process Follow Exponential Distribution -- 21.6 Proof of α=0.5 and ν=1.0 for I.I.D. Process -- 21.7 Summary of Shannon's Method to Estimate Entropy Rate -- 21.8 Relation of h, Perplexity, and Cross Entropy -- 21.9 Type Counts, Shannon Entropy, and Yule's K, via Generalized Entropy -- 21.10 Upper Bound of Compositional Distance -- 21.11 Rough Summary of Mandelbrot's Communication Optimization Rationale to Deduce a Power Law -- 21.12 Rough Definition of Central Limit Theorem -- 21.13 Definition of Simon Model -- 22 Data -- 22.1 Literary Texts -- 22.2 Large Corpora -- 22.3 Other Kinds of Data Related to Language -- 22.4 Corpora for Scripts -- References -- Index. |
Record Nr. | UNINA-9910484715103321 |
Tanaka-Ishii Kumiko | ||
Cham, Switzerland : , : Springer, , [2021] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|