LEADER 03993nam 22004935 450 001 9910337563903321 005 20200703072923.0 010 $a3-030-14771-1 024 7 $a10.1007/978-3-030-14771-6 035 $a(CKB)4100000007823611 035 $a(MiAaPQ)EBC5744666 035 $a(DE-He213)978-3-030-14771-6 035 $a(PPN)235669415 035 $a(EXLCZ)994100000007823611 100 $a20190402d2019 u| 0 101 0 $aeng 135 $aurcnu|||||||| 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aSyntactic n-grams in Computational Linguistics /$fby Grigori Sidorov 205 $a1st ed. 2019. 210 1$aCham :$cSpringer International Publishing :$cImprint: Springer,$d2019. 215 $a1 online resource (94 pages) 225 1 $aSpringerBriefs in Computer Science,$x2191-5768 311 $a3-030-14770-3 327 $aPreface -- Introduction -- PART I. VECTOR SPACE MODEL IN THE ANALYSIS OF SIMILARITY BETWEEN TEXTS -- Chapter 1. Formalization in computational linguistics -- Chapter 2. Vector space model -- Chapter 3. Vector space model for texts and the tf-idf measure -- Chapter 4. Latent Semantic Analysis (LSA): reduction of dimensions -- Chapter 5. Design of experiments in computational linguistics -- Chapter 6. Example of application of n-grams: authorship attribution using n-grams of syllables -- PART II. NON-LINEAR CONSTRUCTION OF N-GRAMS -- Chapter 7. Syntactic n-grams: the concept -- Chapter 8. Types of syntactic n-grams according to their components -- Chapter 9. Continuous and non-continuous syntactic n-grams -- Chapter 10. Metalanguage of syntactic n-grams representation -- Chapter 11. Examples of construction of non-continuous syntactic n-grams -- Chapter 12. Automatic analysis of authorship using syntactic n-grams -- Chapter 13. Filtered n-grams -- Chapter 14. Generalized n-grams. 330 $aThis book is about a new approach in the field of computational linguistics related to the idea of constructing n-grams in non-linear manner, while the traditional approach consists in using the data from the surface structure of texts, i.e., the linear structure. In this book, we propose and systematize the concept of syntactic n-grams, which allows using syntactic information within the automatic text processing methods related to classification or clustering. It is a very interesting example of application of linguistic information in the automatic (computational) methods. Roughly speaking, the suggestion is to follow syntactic trees and construct n-grams based on paths in these trees. There are several types of non-linear n-grams; future work should determine, which types of n-grams are more useful in which natural language processing (NLP) tasks. This book is intended for specialists in the field of computational linguistics. However, we made an effort to explain in a clear manner how to use n-grams; we provide a large number of examples, and therefore we believe that the book is also useful for graduate students who already have some previous background in the field. 410 0$aSpringerBriefs in Computer Science,$x2191-5768 606 $aNatural language processing (Computer science) 606 $aComputational linguistics 606 $aNatural Language Processing (NLP)$3https://scigraph.springernature.com/ontologies/product-market-codes/I21040 606 $aComputational Linguistics$3https://scigraph.springernature.com/ontologies/product-market-codes/N22000 615 0$aNatural language processing (Computer science). 615 0$aComputational linguistics. 615 14$aNatural Language Processing (NLP). 615 24$aComputational Linguistics. 676 $a410.285 676 $a410.285 700 $aSidorov$b Grigori$4aut$4http://id.loc.gov/vocabulary/relators/aut$01061319 906 $aBOOK 912 $a9910337563903321 996 $aSyntactic n-grams in Computational Linguistics$92518411 997 $aUNINA