Vai al contenuto principale della pagina

Data mining methods for the content analyst [[electronic resource] ] : an introduction to the computational analysis of content / / Kalev Hannes Leetaru



(Visualizza in formato marc)    (Visualizza in BIBFRAME)

Autore: Leetaru Kalev Visualizza persona
Titolo: Data mining methods for the content analyst [[electronic resource] ] : an introduction to the computational analysis of content / / Kalev Hannes Leetaru Visualizza cluster
Pubblicazione: New York, : Routledge, 2012
New York : , : Routledge, , 2012
Descrizione fisica: 1 online resource (121 p.)
Disciplina: 006.3/12
006.312
Soggetto topico: Data mining
Data mining - Statistical methods
Note generali: Description based upon print version of record.
Nota di bibliografia: Includes bibliographical references (p. [97]-99) and index.
Nota di contenuto: DATA MINING METHODS FOR THE CONTENT ANALYST An Introduction to the Computational Analysis of Content; Copyright; Contents; List of Tables and Figures; Acknowledgments; 1 Introduction; What Is Content Analysis?; Why Use Computerized Analysis Techniques?; Standalone Tools or Integrated Suites; Transitioning from Theory to Practice; Chapter in Summary; 2 Obtaining and Preparing Data; Collecting Data from Digital Text Repositories; Are the Data Meaningful?; Using Data in Unintended Ways; Analytical Resolution; Types of Data Sources; Finding Sources; Searching Text Collections
Sources of IncompletenessLicensing Restrictions and Content Blackouts; Measuring Viewership; Accuracy and Convenience Samples; Random Samples; Multimedia Content; Converting to Textual Format; Prosody; Example Data Sources; Patterns in Historical War Coverage; Competitive Intelligence; Global News Coverage; Downloading Content; Digital Content; Print Content; Preparing Content; Document Extraction; Cleaning; Post Filtering; Reforming/Reshaping; Content Proxy Extraction; Chapter in Summary; 3 Vocabulary Analysis; The Basics; Word Histograms; Readability Indexes; Normative Comparison
Non-word AnalysisColloquialisms: Abbreviations and Slang; Restricting the Analytical Window; Vocabulary Comparison and Evolution/Chronemics; Advanced Topics; Syllables, Rhyming, and "Sounds Like"; Gender and Language; Authorship Attribution; Word Morphology, Stemming, and Lemmatization; Chapter in Summary; 4 Correlation and Co-occurrence; Understanding Correlation; Computing Word Correlations; Directionality; Concordance; Co-occurrence and Search; Language Variation and Lexicons; Non-co-occurrence; Correlation with Metadata; Chapter in Summary; 5 Lexicons, Entity Extraction, and Geocoding
LexiconsLexicons and Categorization; Lexical Correlation; Lexicon Consistency Checks; Thesauri and Vocabulary Expanders; Named Entity Extraction; Lexicons and Processing; Applications; Geocoding, Gazetteers, and Spatial Analysis; Geocoding; Gazetteers and the Geocoding Process; Operating Under Uncertainty; Spatial Analysis; Chapter in Summary; 6 Topic Extraction; How Machines Process Text; Unstructured Text; Extracting Meaning from Text; Applications of Topic Extraction; Comparing/Clustering Documents; Automatic Summarization; Automatic Keyword Generation
Multilingual Analysis: Topic Extraction with Multiple LanguagesChapter in Summary; 7 Sentiment Analysis; Examining Emotions; Evolution; Evaluation; Analytical Resolution: Documents versus Objects; Hand-crafted versus Automatically Generated Lexicons; Other Sentiment Scales; Limitations; Measuring Language Rather Than Worldview; Chapter in Summary; 8 Similarity, Categorization and Clustering; Categorization; The Vector Space Model; Feature Selection; Feature Reduction; Learning Algorithm; Evaluating ATC Results; Benefi ts of ATC over Human Categorization; Limitations of ATC
Applications of ATC
Sommario/riassunto: With continuous advancements and an increase in user popularity, data mining technologies serve as an invaluable resource for researchers across a wide range of disciplines in the humanities and social sciences. In this comprehensive guide, author and research scientist Kalev Leetaru introduces the approaches, strategies, and methodologies of current data mining techniques, offering insights for new and experienced users alike.Designed as an instructive reference to computer-based analysis approaches, each chapter of this resource explains a set of core concepts and analytical data m
Titolo autorizzato: Data mining methods for the content analyst  Visualizza cluster
ISBN: 0-203-14938-6
1-283-84361-7
1-136-51459-7
Formato: Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione: Inglese
Record Nr.: 9910786303103321
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Serie: Routledge Communication Series