top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Computational Methods for Corpus Annotation and Analysis / / by Xiaofei Lu
Computational Methods for Corpus Annotation and Analysis / / by Xiaofei Lu
Autore Lu Xiaofei
Edizione [1st ed. 2014.]
Pubbl/distr/stampa Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014
Descrizione fisica 1 online resource (192 p.)
Disciplina 006.35
Soggetto topico Applied linguistics
Natural language processing (Computer science)
Applied Linguistics
Natural Language Processing (NLP)
Corpus (Lingüística)
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 94-017-8645-3
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Preface -- Chapter 1 Introduction -- Chapter 2 Text Processing with the Command Line Interface -- Chapter 3 Lexical Annotation -- Chapter 4 Lexical Analysis -- Chapter 5 Syntactic Annotation -- Chapter 6 Syntactic Analysis -- Chapter 7 Semantic, Pragmatic and Discourse Analysis -- Chapter 8 Summary and Outlook -- Appendix.
Record Nr. UNINA-9910484108103321
Lu Xiaofei  
Dordrecht : , : Springer Netherlands : , : Imprint : Springer, , 2014
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Finite-state computational morphology : an analyzer and generator for Georgian / / Irina Lobzhanidze
Finite-state computational morphology : an analyzer and generator for Georgian / / Irina Lobzhanidze
Autore Lobzhanidze Irina
Pubbl/distr/stampa Cham, Switzerland : , : Springer, , [2022]
Descrizione fisica 1 online resource (229 pages)
Disciplina 499.96959
Soggetto topico Georgian language - Grammar
Computational linguistics
Georgià (Llengua)
Gramàtica
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 9783030902483
9783030902476
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910544864303321
Lobzhanidze Irina  
Cham, Switzerland : , : Springer, , [2022]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
From Complex Sentences to a Formal Semantic Representation Using Syntactic Text Simplification and Open Information Extraction / / Christina Niklaus
From Complex Sentences to a Formal Semantic Representation Using Syntactic Text Simplification and Open Information Extraction / / Christina Niklaus
Autore Niklaus Christina
Edizione [First edition.]
Pubbl/distr/stampa Wiesbaden, Germany : , : Springer Vieweg, Springer Fachmedien Wiesbaden GmbH, part of Springer Nature, , [2022]
Descrizione fisica 1 online resource (0 pages)
Disciplina 410.285
Soggetto topico Computational linguistics
Natural language processing (Computer science)
Text processing (Computer science)
Tractament del llenguatge natural (Informàtica)
Tractament de textos
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 3-658-38697-5
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910590079103321
Niklaus Christina  
Wiesbaden, Germany : , : Springer Vieweg, Springer Fachmedien Wiesbaden GmbH, part of Springer Nature, , [2022]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Language Corpora Annotation and Processing / / by Niladri Sekhar Dash
Language Corpora Annotation and Processing / / by Niladri Sekhar Dash
Autore Dash Niladri Sekhar <1967->
Edizione [1st ed. 2021.]
Pubbl/distr/stampa Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2021
Descrizione fisica 1 online resource (292 pages)
Disciplina 410.188
Soggetto topico Computational linguistics
Linguistics - Methodology
Computational Linguistics
Research Methods in Language and Linguistics
Corpus (Lingüística)
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 981-16-2960-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Introduction -- Chapter 1. Corpora Annotation: Definition and Types -- Chapter 2. Maxims, Principles, & Rules of Text Annotation -- Chapter 3. Extratextual Documentative Annotation -- Chapter 4. Etymological Annotation -- Chapter 5. Concordance, KWIC, LWG and Collocation -- Chapter 6. Morphological Processing of Words -- Chapter 7. Part-of-Speech Tagging -- Chapter 8. Lemmatization of Inflected Nouns -- Chapter 9. Decomposition of Inflected Verbs -- Chapter 10. Parsing Sentences in a Text. .
Record Nr. UNINA-9910491024503321
Dash Niladri Sekhar <1967->  
Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2021
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Literature, Language and Computing : Russian Contribution / / edited by Polina Eismont, Maria Khokhlova, Mikhail Koryshev, Elena Riekhakaynen
Literature, Language and Computing : Russian Contribution / / edited by Polina Eismont, Maria Khokhlova, Mikhail Koryshev, Elena Riekhakaynen
Autore Eismont Polina
Edizione [1st ed. 2023.]
Pubbl/distr/stampa Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023
Descrizione fisica 1 online resource (254 pages)
Disciplina 410.285
Altri autori (Persone) KhokhlovaMaria
KoryshevMikhail
RiekhakaynenElena
Soggetto topico Computational linguistics
Linguistics—Methodology
Translating and interpreting
Applied linguistics
Computational Linguistics
Research Methods in Language and Linguistics
Language Translation
Applied Linguistics
Lingüística computacional
Lingüística aplicada
Soggetto genere / forma Llibres electrònics
ISBN 981-9936-04-7
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Chapter 1. Literature, Language and Computing: Russian Contribution -- Chapter 2. Self-Repair in Russian Spoken Discourse in Psycholinguistics Aspect: Correlation Analysis and Quantitative Data -- Chapter 3. A Lexicographic Portrait of a Russian Microsyntactic Unit -- Chapter 4. “Plain and Natural” vs “Accurate and Unambiguous”: Pronominal Intrasentential Anaphora in Russian Legislative Texts -- Chapter 5. The Old Church Slavonic Corpora and Their Use in Language Studies at the University -- Chapter 6. Core coordination units in macro- and microdiachrony: experimental data -- Chapter 7. The Use of Futur Antérieur in the Past in Old French: Experience of a Corpus-Based Study -- Chapter 8. Nachhaltigkeit in media crisis discourse -- Chapter 9. Using Corpora for Verifying Language Choices in Translation -- Chapter 10. Stylometric Methods in Comparative Analysis of Text -- Chapter 11. Lexical Diversity of Russian Poets -- Chapter 12. A semantic corpus of Russian literature of 18 century: its current state and its future -- Chapter 13. Multimedia dictionary of verbal vocabulary: concept, structure, implementation -- Chapter 14. Incorporating informal e-learning into foreign language teaching through collaborative personalization -- Chapter 15. Pedagogical peer-to-peer online practice as a means of forming professional competence in distant learning format -- Chapter 16. To the East Slavonic proverbs of the thematic group “Learning - inattention” (as seen in the new Electronic dictionary of current active East Slavonic proverbs) -- Chapter 17. Opportunities of using Dental Internet resources in teaching the language of specialty in the course of Russian as a foreign language -- Chapter 18. Machine Translation vs Human Translation of Artionyms -- Chapter 19. The Emotion in Text Analyzer: How to Visualize its Output? -- Chapter 20. The Multimedia Corpus of Russian Ironic Speech for Phonetic Analysis -- Chapter 21. Theory of Mind and the Mechanism of Imagination for a Companion Robot.
Record Nr. UNINA-9910734873703321
Eismont Polina  
Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2023
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii
Autore Tanaka-Ishii Kumiko
Pubbl/distr/stampa Cham, Switzerland : , : Springer, , [2021]
Descrizione fisica 1 online resource (226 pages) : illustrations
Disciplina 410.151
Collana Mathematics in Mind
Soggetto topico Mathematical linguistics
Computational linguistics
Lingüística matemàtica
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 3-030-59377-0
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Contents -- Part I Language as a Complex System -- 1 Introduction -- 1.1 Aims -- 1.2 Structure of This Book -- 1.3 Position of This Book -- 1.3.1 Statistical Universals as Computational Properties of Natural Language -- 1.3.2 A Holistic Approach to Language via Complex Systems Theory -- 1.4 Prospectus -- 2 Universals -- 2.1 Language Universals -- 2.2 Layers of Universals -- 2.3 Universal, Stylized Hypothesis, and Law -- 3 Language as a Complex System -- 3.1 Sequence and Corpus -- 3.1.1 Definition of Corpus -- 3.1.2 On Meaning -- 3.1.3 On Infinity -- 3.1.4 On Randomness -- 3.2 Power Functions -- 3.3 Scale-Free Property: Statistical Self-Similarity -- 3.4 Complex Systems -- 3.5 Two Basic Random Processes -- Part II Property of Population -- 4 Relation Between Rank and Frequency -- 4.1 Zipf's Law -- 4.2 Scale-Free Property and Hapax Legomena -- 4.3 Monkey Text -- 4.4 Power Law of n-grams -- 4.5 Relative Rank-Frequency Distribution -- 5 Bias in Rank-Frequency Relation -- 5.1 Literary Texts -- 5.2 Speech, Music, Programs, and More -- 5.3 Deviations from Power Law -- 5.3.1 Scale -- 5.3.2 Speaker Maturity -- 5.3.3 Characters vs. Words -- 5.4 Nature of Deviations -- 6 Related Statistical Universals -- 6.1 Density Function -- 6.2 Vocabulary Growth -- Part III Property of Sequences -- 7 Returns -- 7.1 Word Returns -- 7.2 Distribution of Return Interval Lengths -- 7.3 Exceedance Probability -- 7.4 Bias Underlying Return Intervals -- 7.5 Rare Words as a Set -- 7.6 Behavior of Rare Words -- 8 Long-Range Correlation -- 8.1 Long-Range Correlation Analysis -- 8.2 Mutual Information -- 8.3 Autocorrelation Function -- 8.4 Correlation of Word Intervals -- 8.5 Nonstationarity of Language -- 8.6 Weak Long-Range Correlation -- 9 Fluctuation -- 9.1 Fluctuation Analysis -- 9.2 Taylor Analysis -- 9.3 Differences Between the Two Fluctuation Analyses.
9.4 Dimensions of Linguistic Fluctuation -- 9.5 Relations Among Methods -- 10 Complexity -- 10.1 Complexity of Sequence -- 10.2 Entropy Rate -- 10.3 Hilberg's Ansatz -- 10.4 Computing Entropy Rate of Human Language -- 10.5 Reconsidering the Question of Entropy Rate -- Part IV Relation to Linguistic Elements and Structure -- 11 Articulation of Elements -- 11.1 Harris's Hypothesis -- 11.2 Information-Theoretic Reformulation -- 11.3 Accuracy of Articulation by Harris's Scheme -- 12 Word Meaning and Value -- 12.1 Meaning as Use and Distributional Semantics -- 12.2 Weber-Fechner Law -- 12.3 Word Frequency and Familiarity -- 12.4 Vector Representation of Words -- 12.5 Compositionality of Meaning -- 12.6 Statistical Universals and Meaning -- 13 Size and Frequency -- 13.1 Zipf Abbreviation of Words -- 13.2 Compound Length and Frequency -- 14 Grammatical Structure and Long Memory -- 14.1 Simple Grammatical Framework -- 14.2 Phrase Structure Grammar -- 14.3 Long-Range Dependence in Sentences -- 14.4 Grammatical Structure and Long-Range Correlation -- 14.5 Nature of Long Memory Underlying Language -- Part V Mathematical Models -- 15 Theories Behind Zipf's Law -- 15.1 Communication Optimization -- 15.2 A Limit Theorem -- 15.3 Significance of Statistical Universals -- 16 Mathematical Generative Models -- 16.1 Criteria for Statistical Universals -- 16.2 Independent and Identically Distributed Sequences -- 16.3 Simon Model and Variants -- 16.4 Random Walk Models -- 17 Language Models -- 17.1 Language Models and Statistical Universals -- 17.2 Building Language Models -- 17.3 N-Gram Models -- 17.4 Grammatical Models -- 17.5 Neural Models -- 17.6 Future Directions for Generative Models -- Part VI Ending Remarks -- 18 Conclusion -- 19 Acknowledgments -- Part VII Appendix -- 20 Glossary and Notations -- 20.1 Glossary -- 20.2 Mathematical Notation.
20.3 Other Conventions -- 21 Mathematical Details -- 21.1 Fitting Functions -- 21.2 Proof that Monkey Typing Follows a Power Law -- 21.3 Relation Between η and ζ -- 21.4 Relation Between η and ξ -- 21.5 Proof That Interval Lengths of I.I.D. Process Follow Exponential Distribution -- 21.6 Proof of α=0.5 and ν=1.0 for I.I.D. Process -- 21.7 Summary of Shannon's Method to Estimate Entropy Rate -- 21.8 Relation of h, Perplexity, and Cross Entropy -- 21.9 Type Counts, Shannon Entropy, and Yule's K, via Generalized Entropy -- 21.10 Upper Bound of Compositional Distance -- 21.11 Rough Summary of Mandelbrot's Communication Optimization Rationale to Deduce a Power Law -- 21.12 Rough Definition of Central Limit Theorem -- 21.13 Definition of Simon Model -- 22 Data -- 22.1 Literary Texts -- 22.2 Large Corpora -- 22.3 Other Kinds of Data Related to Language -- 22.4 Corpora for Scripts -- References -- Index.
Record Nr. UNISA-996466553203316
Tanaka-Ishii Kumiko  
Cham, Switzerland : , : Springer, , [2021]
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii
Statistical universals of language : mathematical chance vs. human choice / / Kumiko Tanaka-Ishii
Autore Tanaka-Ishii Kumiko
Pubbl/distr/stampa Cham, Switzerland : , : Springer, , [2021]
Descrizione fisica 1 online resource (226 pages) : illustrations
Disciplina 410.151
Collana Mathematics in Mind
Soggetto topico Mathematical linguistics
Computational linguistics
Lingüística matemàtica
Lingüística computacional
Soggetto genere / forma Llibres electrònics
ISBN 3-030-59377-0
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Contents -- Part I Language as a Complex System -- 1 Introduction -- 1.1 Aims -- 1.2 Structure of This Book -- 1.3 Position of This Book -- 1.3.1 Statistical Universals as Computational Properties of Natural Language -- 1.3.2 A Holistic Approach to Language via Complex Systems Theory -- 1.4 Prospectus -- 2 Universals -- 2.1 Language Universals -- 2.2 Layers of Universals -- 2.3 Universal, Stylized Hypothesis, and Law -- 3 Language as a Complex System -- 3.1 Sequence and Corpus -- 3.1.1 Definition of Corpus -- 3.1.2 On Meaning -- 3.1.3 On Infinity -- 3.1.4 On Randomness -- 3.2 Power Functions -- 3.3 Scale-Free Property: Statistical Self-Similarity -- 3.4 Complex Systems -- 3.5 Two Basic Random Processes -- Part II Property of Population -- 4 Relation Between Rank and Frequency -- 4.1 Zipf's Law -- 4.2 Scale-Free Property and Hapax Legomena -- 4.3 Monkey Text -- 4.4 Power Law of n-grams -- 4.5 Relative Rank-Frequency Distribution -- 5 Bias in Rank-Frequency Relation -- 5.1 Literary Texts -- 5.2 Speech, Music, Programs, and More -- 5.3 Deviations from Power Law -- 5.3.1 Scale -- 5.3.2 Speaker Maturity -- 5.3.3 Characters vs. Words -- 5.4 Nature of Deviations -- 6 Related Statistical Universals -- 6.1 Density Function -- 6.2 Vocabulary Growth -- Part III Property of Sequences -- 7 Returns -- 7.1 Word Returns -- 7.2 Distribution of Return Interval Lengths -- 7.3 Exceedance Probability -- 7.4 Bias Underlying Return Intervals -- 7.5 Rare Words as a Set -- 7.6 Behavior of Rare Words -- 8 Long-Range Correlation -- 8.1 Long-Range Correlation Analysis -- 8.2 Mutual Information -- 8.3 Autocorrelation Function -- 8.4 Correlation of Word Intervals -- 8.5 Nonstationarity of Language -- 8.6 Weak Long-Range Correlation -- 9 Fluctuation -- 9.1 Fluctuation Analysis -- 9.2 Taylor Analysis -- 9.3 Differences Between the Two Fluctuation Analyses.
9.4 Dimensions of Linguistic Fluctuation -- 9.5 Relations Among Methods -- 10 Complexity -- 10.1 Complexity of Sequence -- 10.2 Entropy Rate -- 10.3 Hilberg's Ansatz -- 10.4 Computing Entropy Rate of Human Language -- 10.5 Reconsidering the Question of Entropy Rate -- Part IV Relation to Linguistic Elements and Structure -- 11 Articulation of Elements -- 11.1 Harris's Hypothesis -- 11.2 Information-Theoretic Reformulation -- 11.3 Accuracy of Articulation by Harris's Scheme -- 12 Word Meaning and Value -- 12.1 Meaning as Use and Distributional Semantics -- 12.2 Weber-Fechner Law -- 12.3 Word Frequency and Familiarity -- 12.4 Vector Representation of Words -- 12.5 Compositionality of Meaning -- 12.6 Statistical Universals and Meaning -- 13 Size and Frequency -- 13.1 Zipf Abbreviation of Words -- 13.2 Compound Length and Frequency -- 14 Grammatical Structure and Long Memory -- 14.1 Simple Grammatical Framework -- 14.2 Phrase Structure Grammar -- 14.3 Long-Range Dependence in Sentences -- 14.4 Grammatical Structure and Long-Range Correlation -- 14.5 Nature of Long Memory Underlying Language -- Part V Mathematical Models -- 15 Theories Behind Zipf's Law -- 15.1 Communication Optimization -- 15.2 A Limit Theorem -- 15.3 Significance of Statistical Universals -- 16 Mathematical Generative Models -- 16.1 Criteria for Statistical Universals -- 16.2 Independent and Identically Distributed Sequences -- 16.3 Simon Model and Variants -- 16.4 Random Walk Models -- 17 Language Models -- 17.1 Language Models and Statistical Universals -- 17.2 Building Language Models -- 17.3 N-Gram Models -- 17.4 Grammatical Models -- 17.5 Neural Models -- 17.6 Future Directions for Generative Models -- Part VI Ending Remarks -- 18 Conclusion -- 19 Acknowledgments -- Part VII Appendix -- 20 Glossary and Notations -- 20.1 Glossary -- 20.2 Mathematical Notation.
20.3 Other Conventions -- 21 Mathematical Details -- 21.1 Fitting Functions -- 21.2 Proof that Monkey Typing Follows a Power Law -- 21.3 Relation Between η and ζ -- 21.4 Relation Between η and ξ -- 21.5 Proof That Interval Lengths of I.I.D. Process Follow Exponential Distribution -- 21.6 Proof of α=0.5 and ν=1.0 for I.I.D. Process -- 21.7 Summary of Shannon's Method to Estimate Entropy Rate -- 21.8 Relation of h, Perplexity, and Cross Entropy -- 21.9 Type Counts, Shannon Entropy, and Yule's K, via Generalized Entropy -- 21.10 Upper Bound of Compositional Distance -- 21.11 Rough Summary of Mandelbrot's Communication Optimization Rationale to Deduce a Power Law -- 21.12 Rough Definition of Central Limit Theorem -- 21.13 Definition of Simon Model -- 22 Data -- 22.1 Literary Texts -- 22.2 Large Corpora -- 22.3 Other Kinds of Data Related to Language -- 22.4 Corpora for Scripts -- References -- Index.
Record Nr. UNINA-9910484715103321
Tanaka-Ishii Kumiko  
Cham, Switzerland : , : Springer, , [2021]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui