LEADER 05323nam 2200649 450 001 9910132172203321 005 20200520144314.0 010 $a1-118-42200-7 010 $a1-118-42201-5 035 $a(CKB)3710000000167931 035 $a(EBL)1729064 035 $a(SSID)ssj0001262153 035 $a(PQKBManifestationID)11729144 035 $a(PQKBTitleCode)TC0001262153 035 $a(PQKBWorkID)11229813 035 $a(PQKB)10192819 035 $a(OCoLC)875056210 035 $a(MiAaPQ)EBC1729064 035 $a(Au-PeEL)EBL1729064 035 $a(CaPaEBR)ebr10891177 035 $a(PPN)192309064 035 $a(EXLCZ)993710000000167931 100 $a20140717h20142014 uy 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 10$aMaking sense of data I $ea practical guide to exploratory data analysis and data mining /$fGlenn J. Myatt, Wayne P. Johnson 205 $aSecond edition. 210 1$aHoboken, New Jersey :$cWiley,$d2014. 210 4$dİ2014 215 $a1 online resource (250 p.) 300 $aDescription based upon print version of record. 311 $a1-118-40741-5 320 $aIncludes bibliographical references and index. 327 $aMaking Sense of Data I; Contents; Preface; 1 Introduction; 1.1 Overview; 1.2 Sources of Data; 1.3 Process for Making Sense of Data; 1.3.1 Overview; 1.3.2 Problem Definition and Planning; 1.3.3 Data Preparation; 1.3.4 Analysis; 1.3.5 Deployment; 1.4 OVERVIEW OF BOOK; 1.4.1 Describing Data; 1.4.2 Preparing Data Tables; 1.4.3 Understanding Relationships; 1.4.4 Understanding Groups; 1.4.5 Building Models; 1.4.6 Exercises; 1.4.7 Tutorials; 1.5 Summary; Further Reading; Exercises; Exercises; Exercises; Exercises; 2 Describing Data; 2.1 Overview; 2.2 Observations and Variables 327 $a2.3 Types of Variables2.4 Central Tendency; 2.4.1 Overview; 2.4.2 Mode; 2.4.3 Median; 2.4.4 Mean; 2.5 Distribution of the Data; 2.5.1 Overview; 2.5.2 Bar Charts and Frequency Histograms; 2.5.3 Range; 2.5.4 Quartiles; 2.5.5 Box Plots; 2.5.6 Variance; 2.5.7 Standard Deviation; 2.5.8 Shape; 2.6 Confidence Intervals; 2.7 Hypothesis Tests; Further Reading; Further Reading; Further Reading; Further Reading; 3 Preparing Data Tables; 3.1 Overview; 3.2 Cleaning the Data; 3.3 Removing Observations and Variables; 3.4 Generating Consistent Scales Across Variables; 3.5 New Frequency Distribution 327 $a3.6 Converting Text to Numbers3.7 Converting Continuous Data to Categories; 3.8 Combining Variables; 3.9 Generating Groups; 3.10 Preparing Unstructured Data; 4 Understanding Relationships; 4.1 Overview; 4.2 Visualizing Relationships Between Variables; 4.2.1 Scatterplots; 4.2.2 Summary Tables and Charts; 4.2.3 Cross-Classification Tables; 4.3 Calculating Metrics About Relationships; 4.3.1 Overview; 4.3.2 Correlation Coefficients; 4.3.3 Kendall Tau; 4.3.4 t-Tests Comparing Two Groups; 4.3.5 ANOVA; 4.3.6 Chi-Square; 5 Identifying and Understanding Groups; 5.1 Overview; 5.2 Clustering 327 $a5.2.1 Overview5.2.2 Distances; 5.2.3 Agglomerative Hierarchical Clustering; 5.2.4 k-Means Clustering; 5.3 Association Rules; 5.3.1 Overview; 5.3.2 Grouping by Combinations of Values; 5.3.3 Extracting and Assessing Rules; 5.3.4 Example; 5.4 Learning Decision Trees from Data; 5.4.1 Overview; 5.4.2 Splitting; 5.4.3 Splitting Criteria; 5.4.4 Example; Exercises; Further Reading; 6 Building Models from Data; 6.1 Overview; 6.2 Linear Regression; 6.2.1 Overview; 6.2.2 Fitting a Simple Linear Regression Model; 6.2.3 Fitting a Multiple Linear Regression Model; 6.2.4 Assessing the Model Fit 327 $a6.2.5 Testing Assumptions6.2.6 Selecting and Assessing Independent Variables; 6.3 Logistic Regression; 6.3.1 Overview; 6.3.2 Fitting a Simple Logistic Regression Model; 6.3.3 Fitting and Interpreting Multiple Logistic Regression Models; 6.3.4 Significance of Model and Coefficients; 6.3.5 Classification; 6.4 k-Nearest Neighbors; 6.4.1 Overview; 6.4.2 Training; 6.4.3 Predicting; 6.5 Classification and Regression Trees; 6.5.1 Overview; 6.5.2 Predicting; 6.5.3 Example; 6.6 Other Approaches; 6.6.1 Neural Networks; 6.6.2 Support Vector Machines; 6.6.3 Discriminant Analysis; 6.6.4 Nai?ve Bayes 327 $a6.6.5 Random Forests 330 $aWith a focus on the needs of educators and students, Making Sense of Data presents the steps and issues that need to be considered in order to successfully complete a data analysis or data mining project. This Second Edition focuses on basic data analysis approaches that are necessary to complete a diverse range of projects. New examples have been added to illustrate the different approaches, and there is considerably more emphasis on hands-on software tutorials to provide real-world exercises. Via the related Web site, the book is accompanied by Traceis software, data sets, a 606 $aData mining 606 $aMathematical statistics 615 0$aData mining. 615 0$aMathematical statistics. 676 $a006.3/12 700 $aMyatt$b Glenn J.$f1969-$0695403 702 $aJohnson$b Wayne P. 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9910132172203321 996 $aMaking sense of data I$92259734 997 $aUNINA