1.

Record Nr.

UNINA9910827005303321

Titolo

Sequences in language and text / / edited by George K. Mikros, Ján Macutek

Pubbl/distr/stampa

Berlin ; ; Boston : , : De Gruyter Mouton, , [2015]

©2015

ISBN

3-11-039477-4

3-11-036287-2

Descrizione fisica

1 online resource (268 p.)

Collana

Quantitative linguistics ; ; Ql69

Disciplina

006.3/5

Soggetti

Computational linguistics - Research

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Note generali

Description based upon print version of record.

Nota di bibliografia

Includes bibliographical references and index.

Nota di contenuto

Front matter -- Foreword -- Contents -- Introduction -- Linguistic Analysis Based on Fuzzy Similarity Models -- Textual navigation and autocorrelation -- Menzerath-Altmann law versus random model -- Text length and the lambda frequency structure of a text -- Linguistic Motifs -- Linguistic Modelling of Sequential Phenomena -- Menzerath-Altmann Law for Word Length Motifs -- Is the Distribution of L-Motifs Inherited from the Word Length Distribution? -- Sequential Structures in “Dalimil’s Chronicle” -- Comparative Evaluation of String Similarity Measures for Automatic Language Classification -- Predicting Sales Trends -- Where Alice Meets Little Prince -- A Probabilistic Model for the Arc Length in Quantitative Linguistics -- Subject Index -- Authors Index -- Authors’ addresses

Sommario/riassunto

The edited volume Sequences in Language and Text is the first collection of original research in the area of the quantitative analysis of sequentially organized linguistic data. Linguistic sequences are extremely useful textual structures in almost all areas of Language Technology. Character and word n-grams are by far the most successful features in text classification tasks such as authorship identification, text categorization, genre classification, sentiment analysis etc. Furthermore character linguistic sequences are the basis for linguistic modeling and subsequent applications such as speech recognition, language identification etc. In addition to the above



language technology oriented research, the present volume aims to give insight to the theoretical value of linguistic sequences. Sequences in texts can be produced by a number of different factors, either external to the linguistic system or by its own grammatical structure. This volume hosts contributions which will analyze linguistic sequences using quantitative methods under the synergetic theoretical framework that can explain their role in the linguistic system.