1.

Record Nr.

UNINA9910971330603321

Titolo

Corpus linguistics and the web / / edited by Marianne Hundt, Nadja Nesselhauf and Carolin Biewer

Pubbl/distr/stampa

Amsterdam : , : Rodopi, , 2007

ISBN

9789401203791

9401203792

9781429481274

1429481277

Edizione

[1st ed.]

Descrizione fisica

1 online resource (312 p.)

Collana

Language and computers ; ; no. 59

Altri autori (Persone)

HundtMarianne

NesselhaufNadja

BiewerCarolin

Disciplina

410.285

Soggetti

Philology - Data processing

Computational linguistics

Discourse analysis - Data processing

Internet

Conference papers and proceedings.

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Note generali

In part, contributions to a conference entitled: Corpus Linguistics -- Perspectives for the future, held in Heidelberg in October 2004.

Conference organized by the Internationales Wissenschaftsforum Heidelberg (IWH), Fritz Thyssen Stiftung and Stiftung Universität Heidelberg.

Nota di bibliografia

Includes bibliographical references and index.

Nota di contenuto

Preliminary material / Editors Corpus Linguistics and the Web -- Corpus linguistics and the web / Marianne Hundt , Nadja Nesselhauf and Carolin Biewer -- Using web data for linguistic purposes / Anke Lüdeling , Stefan Evert and Marco Baroni -- Concordancing the web: promise and problems, tools and techniques / William H. Fletcher -- WebCorp: an integrated system for web text search / Antoinette Renouf , Andrew Kehoe and Jayeeta Banerjee -- From web page to mega-corpus: the CNN transcripts / Sebastian Hoffmann -- Constructing a corpus from the web: message boards / Claudia Claridge -- Towards a taxonomy of web registers and text types: a multi-dimensional analysis



/ Douglas Biber and Jerry Kurjian -- New resources, or just better old ones? The Holy Grail of representativeness / Geoffrey Leech -- An under-exploited resource: using the BNC for exploring the nature of language learning / Graeme Kennedy -- Exploring constructions on the web: a case study / Anette Rosenbach -- Determinants of grammatical variation in English and the formation / confirmation of linguistic hypotheses by means of internet data / Günter Rohdenburg -- Recalcitrant problems of comparative alternation and new insights emerging from internet data / Britta Mondorf -- Change and variation in present-day English: integrating the analysis of closed corpora and web-based monitoring / Christian Mair -- The dynamics of inner and outer circle varieties in the South Pacific and East Asia / Marianne Hundt and Carolin Biewer -- ‘He rung the bell’ and ‘she drunk ale’ – non-standard past tense forms in traditional British dialects and on the internet / Lieselotte Anderwald -- Diachronic analysis with the internet? Will and shall in ARCHER and in a corpus of e-texts from the web / Nadja Nesselhauf.

Sommario/riassunto

Using the Web as Corpus is one of the recent challenges for corpus linguistics. This volume presents a current state-of-the-arts discussion of the topic. The articles address practical problems such as suitable linguistic search tools for accessing the www, the question of register variation, or they probe into methods for culling data from the web. The book also offers a wide range of case studies, covering morphology, syntax, lexis, as well as synchronic and diachronic variation in English. These case studies make use of the two approaches to the www in corpus linguistics – web-as-corpus and web-for-corpus-building. The case studies demonstrate that web data can provide useful additional evidence for a broad range of research questions.