top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Automated data collection with R : a practical guide to web scraping and text mining / / Simon Munzert [and three others]
Automated data collection with R : a practical guide to web scraping and text mining / / Simon Munzert [and three others]
Autore Munzert Simon
Pubbl/distr/stampa Chichester, England : , : Wiley, , 2015
Descrizione fisica 1 online resource (XXII, 453 p.)
Disciplina 006.3/12
Soggetto topico Data mining
Automatic data collection systems
Social sciences - Research - Data processing
R (Computer program language)
ISBN 1-118-83480-1
1-118-83473-9
1-118-83478-X
Classificazione COM021030
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Machine generated contents note: Dedication Table of Contents List of Figures List of Tables Preface 1 Introduction 1.1 Case Study: World Heritage Sites in Danger 1.2 Some Remarks on Web Data Quality 1.3 Technologies for Disseminating, Extracting and Storing Web Data 1.3.1 Technologies for disseminating content on the Web 1.4 Structure of the Book Part One A Primer on Web and Data Technologies 2 HTML 2.1 Browser Presentation and Source Code 2.2 Syntax Rules 2.3 Tags and Attributes 2.4 Parsing Summary Further Reading Problems 3 XML and JSON 3.1 A Short Example XML Document 3.2 XML Syntax Rules 3.3 When Is an XML Document Well-formed or Valid? 3.4 XML Extensions and Technologies 3.5 XML and R in Practice 3.6 A Short Example JSON Document 3.7 JSON Syntax Rules 3.8 JSON and R in Practice Summary Further Reading Problems 4 XPath 4.1 XPath - a Querying Language for Web Documents 4.2 Identifying Node Sets with XPath 4.3 Extracting Node Elements Summary Further Reading Problems 5 HTTP 5.1 HTTP Fundamentals 5.2 Advanced Features of HTTP 5.3 Protocols beyond HTTP 5.4 HTTP in Action Summary Further Reading Problems 6 AJAX 6.1 JavaScript 6.2 XHR 6.3 Exploring AJAX with Web Developer Tools Summary Further Reading Problems 7 SQL and Relational Databases 7.1 Overview and Terminology 7.2 Relational Databases 7.3 SQL: a Language to Communicate with Databases 7.4 Databases in Action Summary Further Reading Problems 8 Regular Expressions and String Functions 8.1 Regular Expressions 8.2 String Processing 8.3 A Word on Character Encodings Summary Further Reading Problems Part Two A Practical Toolbox for Web Scraping and Text Mining 9 Scraping the Web 9.1 Retrieval Scenarios 9.2 Extraction Strategies 9.3 Web Scraping: Good Practice 9.4 Valuable Sources of Inspiration Summary Further Reading Problems 10 Statistical Text Processing 10.1 The running example: classifying press releases of the British government 10.2 Processing Textual Data 10.3 Supervised Learning Techniques 10.4 Unsupervised Learning Techniques Summary Further reading 11 Managing Data Projects 11.1 Interacting with the File System 11.2 Processing Multiple Documents/Links 11.3 Organizing Scraping Procedures 11.4 Executing R Scripts on a Regular Basis Part Three A Bag of Case Studies 12 Collaboration Networks in the U.S. Senate 12.1 Information on the Bills 12.2 Information on the Senators 12.3 Analyzing the network structure 12.4 Conclusion 13 Parsing Information from Semi-Structured Documents 13.1 Downloding Data from the FTP Server 13.2 Parsing Semi-Structured Text Data 13.3 Visualizing station and temperature data 14 Predicting the 2014 Academy Awards using Twitter 14.1 Twitter APIs: Overview 14.2 Twitter-based Forecast of the 2014 Academy Awards 14.3 Conclusion 15 Mapping the Geographic Distribution of Names 15.1 Developing a Data Collection Strategy 15.2 Web Site Inspection 15.3 Data Retrieval and Information Extraction 15.4 Mapping Names 15.5 Automating the Process 15.6 Summary 16 Gathering Data on Mobile Phones 16.1 Page Exploration 16.2 Scraping Procedure 16.3 Graphical Analysis 16.4 Data storage 17 Analyzing Sentiments of Product Reviews 17.1 Introduction 17.2 Collecting the data 17.3 Analyzing the Data 17.4 Conclusion References Bibliography Indices General Index Package Index Function Index .
Record Nr. UNINA-9910132342003321
Munzert Simon  
Chichester, England : , : Wiley, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Automated data collection with R : a practical guide to web scraping and text mining / / Simon Munzert [and three others]
Automated data collection with R : a practical guide to web scraping and text mining / / Simon Munzert [and three others]
Autore Munzert Simon
Pubbl/distr/stampa Chichester, England : , : Wiley, , 2015
Descrizione fisica 1 online resource (XXII, 453 p.)
Disciplina 006.3/12
Soggetto topico Data mining
Automatic data collection systems
Social sciences - Research - Data processing
R (Computer program language)
ISBN 1-118-83480-1
1-118-83473-9
1-118-83478-X
Classificazione COM021030
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Machine generated contents note: Dedication Table of Contents List of Figures List of Tables Preface 1 Introduction 1.1 Case Study: World Heritage Sites in Danger 1.2 Some Remarks on Web Data Quality 1.3 Technologies for Disseminating, Extracting and Storing Web Data 1.3.1 Technologies for disseminating content on the Web 1.4 Structure of the Book Part One A Primer on Web and Data Technologies 2 HTML 2.1 Browser Presentation and Source Code 2.2 Syntax Rules 2.3 Tags and Attributes 2.4 Parsing Summary Further Reading Problems 3 XML and JSON 3.1 A Short Example XML Document 3.2 XML Syntax Rules 3.3 When Is an XML Document Well-formed or Valid? 3.4 XML Extensions and Technologies 3.5 XML and R in Practice 3.6 A Short Example JSON Document 3.7 JSON Syntax Rules 3.8 JSON and R in Practice Summary Further Reading Problems 4 XPath 4.1 XPath - a Querying Language for Web Documents 4.2 Identifying Node Sets with XPath 4.3 Extracting Node Elements Summary Further Reading Problems 5 HTTP 5.1 HTTP Fundamentals 5.2 Advanced Features of HTTP 5.3 Protocols beyond HTTP 5.4 HTTP in Action Summary Further Reading Problems 6 AJAX 6.1 JavaScript 6.2 XHR 6.3 Exploring AJAX with Web Developer Tools Summary Further Reading Problems 7 SQL and Relational Databases 7.1 Overview and Terminology 7.2 Relational Databases 7.3 SQL: a Language to Communicate with Databases 7.4 Databases in Action Summary Further Reading Problems 8 Regular Expressions and String Functions 8.1 Regular Expressions 8.2 String Processing 8.3 A Word on Character Encodings Summary Further Reading Problems Part Two A Practical Toolbox for Web Scraping and Text Mining 9 Scraping the Web 9.1 Retrieval Scenarios 9.2 Extraction Strategies 9.3 Web Scraping: Good Practice 9.4 Valuable Sources of Inspiration Summary Further Reading Problems 10 Statistical Text Processing 10.1 The running example: classifying press releases of the British government 10.2 Processing Textual Data 10.3 Supervised Learning Techniques 10.4 Unsupervised Learning Techniques Summary Further reading 11 Managing Data Projects 11.1 Interacting with the File System 11.2 Processing Multiple Documents/Links 11.3 Organizing Scraping Procedures 11.4 Executing R Scripts on a Regular Basis Part Three A Bag of Case Studies 12 Collaboration Networks in the U.S. Senate 12.1 Information on the Bills 12.2 Information on the Senators 12.3 Analyzing the network structure 12.4 Conclusion 13 Parsing Information from Semi-Structured Documents 13.1 Downloding Data from the FTP Server 13.2 Parsing Semi-Structured Text Data 13.3 Visualizing station and temperature data 14 Predicting the 2014 Academy Awards using Twitter 14.1 Twitter APIs: Overview 14.2 Twitter-based Forecast of the 2014 Academy Awards 14.3 Conclusion 15 Mapping the Geographic Distribution of Names 15.1 Developing a Data Collection Strategy 15.2 Web Site Inspection 15.3 Data Retrieval and Information Extraction 15.4 Mapping Names 15.5 Automating the Process 15.6 Summary 16 Gathering Data on Mobile Phones 16.1 Page Exploration 16.2 Scraping Procedure 16.3 Graphical Analysis 16.4 Data storage 17 Analyzing Sentiments of Product Reviews 17.1 Introduction 17.2 Collecting the data 17.3 Analyzing the Data 17.4 Conclusion References Bibliography Indices General Index Package Index Function Index .
Record Nr. UNINA-9910823262603321
Munzert Simon  
Chichester, England : , : Wiley, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Forschungsinfrastrukturen für die qualitative Sozialforschung / / Denis Huschka, Hubert Knoblauch, Claudia Oellers, und Heike Solga (Hrsg.) [[electronic resource]]
Forschungsinfrastrukturen für die qualitative Sozialforschung / / Denis Huschka, Hubert Knoblauch, Claudia Oellers, und Heike Solga (Hrsg.) [[electronic resource]]
Edizione [1. Auflage.]
Pubbl/distr/stampa Berlin : , : Scivero, , 2013
Descrizione fisica 1 online resource (185 pages ) : illustrations ;
Soggetto topico Qualitative research
Social sciences - Research - Data processing
Social sciences - Information services
ISBN 3-944417-01-1
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione ger
Record Nr. UNINA-9910134015003321
Berlin : , : Scivero, , 2013
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Forschungsinfrastrukturen für die qualitative Sozialforschung / / Denis Huschka, Hubert Knoblauch, Claudia Oellers, und Heike Solga (Hrsg.) [[electronic resource]]
Forschungsinfrastrukturen für die qualitative Sozialforschung / / Denis Huschka, Hubert Knoblauch, Claudia Oellers, und Heike Solga (Hrsg.) [[electronic resource]]
Edizione [1. Auflage.]
Pubbl/distr/stampa Berlin : , : Scivero, , 2013
Descrizione fisica 1 online resource (185 pages ) : illustrations ;
Soggetto topico Qualitative research
Social sciences - Research - Data processing
Social sciences - Information services
ISBN 3-944417-01-1
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione ger
Record Nr. UNISA-996320828303316
Berlin : , : Scivero, , 2013
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Autore Berridge Damon M.
Pubbl/distr/stampa Boca Raton, Fla. : , : CRC Press, , 2011
Descrizione fisica 1 online resource (284 p.)
Disciplina 003/.35133
Altri autori (Persone) CrouchleyRobert
Soggetto topico Social sciences - Research - Mathematical models
Social sciences - Research - Statistical methods
Social sciences - Research - Data processing
Multivariate analysis
Soggetto genere / forma Electronic books.
ISBN 0-429-19160-X
1-4987-4070-7
1-4398-1327-2
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Front Cover; Contents; List of Figures; List of Tables; List of Applications; List of Datasets; Preface; Acknowledgments; 1. Introduction; 2.Generalized linear models for continuous/interval scale data; 3. Generalized linear models for other types of data; 4. Family of generalized linear models; 5. Mixed models for continuous/interval scale data; 6. Mixed models for binary data; 7. Mixed models for ordinal data; 8. Mixed models for count data; 9. Family of two-level generalized linear models; 10. Three-level generalized linear models; 11. Models for multivariate data
12. Models for duration and event history data13. Stayers, non-susceptibles and endpoints; 14. Handling initial conditions/state dependence in binary data; 15. Incidental parameters: an empirical comparison of fixed effects and random effects models; A. SabreR installation, SabreR commands, quadrature, estimation, endogenous effects; B. Introduction to R for Sabre; References
Record Nr. UNINA-9910457693303321
Berridge Damon M.  
Boca Raton, Fla. : , : CRC Press, , 2011
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Autore Berridge Damon M.
Pubbl/distr/stampa Boca Raton, Fla. : , : CRC Press, , 2011
Descrizione fisica 1 online resource (284 p.)
Disciplina 003/.35133
Altri autori (Persone) CrouchleyRobert
Soggetto topico R (Computer program language)
Social sciences - Research - Mathematical models
Social sciences - Research - Statistical methods
Social sciences - Research - Data processing
Multivariate analysis
ISBN 0-429-19160-X
1-4987-4070-7
1-4398-1327-2
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Front Cover; Contents; List of Figures; List of Tables; List of Applications; List of Datasets; Preface; Acknowledgments; 1. Introduction; 2.Generalized linear models for continuous/interval scale data; 3. Generalized linear models for other types of data; 4. Family of generalized linear models; 5. Mixed models for continuous/interval scale data; 6. Mixed models for binary data; 7. Mixed models for ordinal data; 8. Mixed models for count data; 9. Family of two-level generalized linear models; 10. Three-level generalized linear models; 11. Models for multivariate data
12. Models for duration and event history data13. Stayers, non-susceptibles and endpoints; 14. Handling initial conditions/state dependence in binary data; 15. Incidental parameters: an empirical comparison of fixed effects and random effects models; A. SabreR installation, SabreR commands, quadrature, estimation, endogenous effects; B. Introduction to R for Sabre; References
Record Nr. UNINA-9910781583203321
Berridge Damon M.  
Boca Raton, Fla. : , : CRC Press, , 2011
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Multivariate generalized linear mixed models using R / / Damon M. Berridge, Robert Crouchley
Autore Berridge Damon M.
Edizione [1st ed.]
Pubbl/distr/stampa Boca Raton, Fla., : CRC Press, c2011
Descrizione fisica 1 online resource (284 p.)
Disciplina 003/.35133
Altri autori (Persone) CrouchleyRobert
Soggetto topico R (Computer program language)
Social sciences - Research - Mathematical models
Social sciences - Research - Statistical methods
Social sciences - Research - Data processing
Multivariate analysis
ISBN 0-429-19160-X
1-4987-4070-7
1-4398-1327-2
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Front Cover; Contents; List of Figures; List of Tables; List of Applications; List of Datasets; Preface; Acknowledgments; 1. Introduction; 2.Generalized linear models for continuous/interval scale data; 3. Generalized linear models for other types of data; 4. Family of generalized linear models; 5. Mixed models for continuous/interval scale data; 6. Mixed models for binary data; 7. Mixed models for ordinal data; 8. Mixed models for count data; 9. Family of two-level generalized linear models; 10. Three-level generalized linear models; 11. Models for multivariate data
12. Models for duration and event history data13. Stayers, non-susceptibles and endpoints; 14. Handling initial conditions/state dependence in binary data; 15. Incidental parameters: an empirical comparison of fixed effects and random effects models; A. SabreR installation, SabreR commands, quadrature, estimation, endogenous effects; B. Introduction to R for Sabre; References
Record Nr. UNINA-9910812458003321
Berridge Damon M.  
Boca Raton, Fla., : CRC Press, c2011
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Research strategies in the social sciences [[electronic resource] ] : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Research strategies in the social sciences [[electronic resource] ] : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Pubbl/distr/stampa Oxford [England] ; ; New York, : Oxford University Press, 1998
Descrizione fisica 1 online resource (333 p.)
Disciplina 300/.7/2
Altri autori (Persone) ScarbroughElinor
TanenbaumEric
Soggetto topico Social sciences - Methodology
Social sciences - Research - Data processing
Social sciences - Research - Methodology
Soggetto genere / forma Electronic books.
ISBN 0-19-160015-6
0-19-829237-6
9786611970598
1-281-97059-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto ""Foreword""; ""Preface""; ""Contents""; ""List of Figures""; ""List of Tables""; ""List of Contributors""; ""1. Introduction""; ""2. Research Strategies in the New Environment""; ""I: ANALYSES ACROSS OBSERVATIONS""; ""3. Ordinary Least Squares and Logistic Regression Analysis""; ""4. Linear Structural Equation Models""; ""5. Categorical Data Analysis: Log-linear and Latent Class Models""; ""6. Modelling Context and Heterogeneity: Applying Multilevel Models""; ""II: ANALYSES ACROSS TIME""; ""7. It's about Time: Modelling Political and Social Dynamics""
""8. Cointegration and Modelling the Long Run""""9. Modelling Space and Time: The Event History Approach""; ""III: NEW PERSPECTIVES""; ""10. The Glass Bead Game: Testing Game-theoretic Models""; ""11. Political Analysis in a World without Foundations""; ""12. Discourse Theory and Political Analysis""; ""References""; ""Index""; ""NAME INDEX""; ""SUBJECT INDEX"";
Record Nr. UNINA-9910454113703321
Oxford [England] ; ; New York, : Oxford University Press, 1998
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Research strategies in the social sciences [[electronic resource] ] : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Research strategies in the social sciences [[electronic resource] ] : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Pubbl/distr/stampa Oxford [England] ; ; New York, : Oxford University Press, 1998
Descrizione fisica xiv, 316 p. : ill
Disciplina 300/.7/2
Altri autori (Persone) ScarbroughElinor
TanenbaumEric
Soggetto topico Social sciences - Methodology
Social sciences - Research - Data processing
Social sciences - Research - Methodology
ISBN 0-19-152196-5
9786611970598
1-281-97059-X
0-19-160015-6
0-19-829237-6
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910795707803321
Oxford [England] ; ; New York, : Oxford University Press, 1998
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Research strategies in the social sciences : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Research strategies in the social sciences : a guide to new approaches / / edited by Elinor Scarbrough and Eric Tanenbaum
Edizione [1st ed.]
Pubbl/distr/stampa Oxford [England] ; ; New York, : Oxford University Press, 1998
Descrizione fisica xiv, 316 p. : ill
Disciplina 300/.7/2
Altri autori (Persone) ScarbroughElinor
TanenbaumEric
Soggetto topico Social sciences - Methodology
Social sciences - Research - Data processing
Social sciences - Research - Methodology
ISBN 0-19-152196-5
9786611970598
1-281-97059-X
0-19-160015-6
0-19-829237-6
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Foreword -- Preface -- Contents -- List of Figures -- List of Tables -- List of Contributors -- 1. Introduction -- 2. Research Strategies in the New Environment -- I: ANALYSES ACROSS OBSERVATIONS -- 3. Ordinary Least Squares and Logistic Regression Analysis -- 4. Linear Structural Equation Models -- 5. Categorical Data Analysis: Log-linear and Latent Class Models -- 6. Modelling Context and Heterogeneity: Applying Multilevel Models -- II: ANALYSES ACROSS TIME -- 7. It's about Time: Modelling Political and Social Dynamics -- 8. Cointegration and Modelling the Long Run -- 9. Modelling Space and Time: The Event History Approach -- III: NEW PERSPECTIVES -- 10. The Glass Bead Game: Testing Game-theoretic Models -- 11. Political Analysis in a World without Foundations -- 12. Discourse Theory and Political Analysis -- References -- Index -- NAME INDEX -- A -- B -- C -- D -- E -- F -- G -- H -- I -- J -- K -- L -- M -- N -- O -- P -- R -- S -- T -- V -- W -- Y -- Z -- SUBJECT INDEX -- A -- B -- C -- D -- E -- F -- G -- H -- I -- J -- K -- L -- M -- N -- O -- P -- Q -- R -- S -- T -- U -- V -- W.
Record Nr. UNINA-9910818874703321
Oxford [England] ; ; New York, : Oxford University Press, 1998
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui