LEADER 05644nam 2200709Ia 450 001 9910139454503321 005 20231102140033.0 010 $a1-283-10931-X 010 $a9786613109316 010 $a1-118-01825-7 010 $a1-118-01824-9 035 $a(CKB)2550000000032660 035 $a(EBL)700442 035 $a(SSID)ssj0000535680 035 $a(PQKBManifestationID)11335075 035 $a(PQKBTitleCode)TC0000535680 035 $a(PQKBWorkID)10546933 035 $a(PQKB)10754645 035 $a(MiAaPQ)EBC700442 035 $a(OCoLC)746324256 035 $a(CaSebORM)9781118018262 035 $a(PPN)170245810 035 $a(EXLCZ)992550000000032660 100 $a20101012d2011 uy 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aData analysis$b[electronic resource] $ewhat can be learned from the past 50 years /$fPeter J. Huber 205 $aFirst edition 210 1$aHoboken, New Jersey :$cWiley,$dc2011. 215 $a1 online resource (235 pages) 225 1 $aWiley series in probability and statistics. 300 $aDescription based upon print version of record. 311 $a1-118-01826-5 311 $a1-118-01064-7 320 $aIncludes bibliographical references and index. 327 $aDATA ANALYSIS: What Can Be Learned From the Past 50 Years; CONTENTS; Preface; 1 What is Data Analysis?; 1.1 Tukey's 1962 paper; 1.2 The Path of Statistics; 2 Strategy Issues in Data Analysis; 2.1 Strategy in Data Analysis; 2.2 Philosophical issues; 2.2.1 On the theory of data analysis and its teaching; 2.2.2 Science and data analysis; 2.2.3 Economy of forces; 2.3 Issues of size; 2.4 Strategic planning; 2.4.1 Planning the data collection; 2.4.2 Choice of data and methods; 2.4.3 Systematic and random errors; 2.4.4 Strategic reserves; 2.4.5 Human factors; 2.5 The stages of data analysis 327 $a2.5.1 Inspection2.5.2 Error checking; 2.5.3 Modification; 2.5.4 Comparison; 2.5.5 Modeling and Model fitting; 2.5.6 Simulation; 2.5.7 What-if analyses; 2.5.8 Interpretation; 2.5.9 Presentation of conclusions; 2.6 Tools required for strategy reasons; 2.6.1 Ad hoc programming; 2.6.2 Graphics; 2.6.3 Record keeping; 2.6.4 Creating and keeping order; 3 Massive Data Sets; 3.1 Introduction; 3.2 Disclosure: Personal experiences; 3.3 What is massive? A classification of size; 3.4 Obstacles to scaling; 3.4.1 Human limitations: visualization; 3.4.2 Human - machine interactions 327 $a3.4.3 Storage requirements3.4.4 Computational complexity; 3.4.5 Conclusions; 3.5 On the structure of large data sets; 3.5.1 Types of data; 3.5.2 How do data sets grow?; 3.5.3 On data organization; 3.5.4 Derived data sets; 3.6 Data base management and related issues; 3.6.1 Data archiving; 3.7 The stages of a data analysis; 3.7.1 Planning the data collection; 3.7.2 Actual collection; 3.7.3 Data access; 3.7.4 Initial data checking; 3.7.5 Data analysis proper; 3.7.6 The final product: presentation of arguments and conclusions; 3.8 Examples and some thoughts on strategy; 3.9 Volume reduction 327 $a3.10 Supercomputers and software challenges3.10.1 When do we need a Concorde?; 3.10.2 General Purpose Data Analysis and Supercomputers; 3.10.3 Languages, Programming Environments and Databased Prototyping; 3.11 Summary of conclusions; 4 Languages for Data Analysis; 4.1 Goals and purposes; 4.2 Natural languages and computing languages; 4.2.1 Natural languages; 4.2.2 Batch languages; 4.2.3 Immediate languages; 4.2.4 Language and literature; 4.2.5 Object orientation and related structural issues; 4.2.6 Extremism and compromises, slogans and reality; 4.2.7 Some conclusions; 4.3 Interface issues 327 $a4.3.1 The command line interface4.3.2 The menu interface; 4.3.3 The batch interface and programming environments; 4.3.4 Some personal experiences; 4.4 Miscellaneous issues; 4.4.1 On building blocks; 4.4.2 On the scope of names; 4.4.3 On notation; 4.4.4 Book-keeping problems; 4.5 Requirements for a general purpose immediate language; 5 Approximate Models; 5.1 Models; 5.2 Bayesian modeling; 5.3 Mathematical statistics and approximate models; 5.4 Statistical significance and physical relevance; 5.5 Judicious use of a wrong model; 5.6 Composite models; 5.7 Modeling the length of day 327 $a5.8 The role of simulation 330 $aThis book explores the many provocative questions concerning the fundamentals of data analysis. It is based on the time-tested experience of one of the gurus of the subject matter. Why should one study data analysis? How should it be taught? What techniques work best, and for whom? How valid are the results? How much data should be tested? Which machine languages should be used, if used at all? Emphasis on apprenticeship (through hands-on case studies) and anecdotes (through real-life applications) are the tools that Peter J. Huber uses in this volume. Concern with specific statistical techniq 410 0$aWiley series in probability and statistics. 606 $aMathematical statistics$xHistory 606 $aMathematical statistics$xPhilosophy 606 $aNumerical analysis$xMethodology 615 0$aMathematical statistics$xHistory. 615 0$aMathematical statistics$xPhilosophy. 615 0$aNumerical analysis$xMethodology. 676 $a519.5 676 $a519.509 700 $aHuber$b Peter J$012086 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9910139454503321 996 $aData analysis$92107790 997 $aUNINA