LEADER 04979nam 2200673Ia 450 001 9911006788503321 005 20240313145636.0 010 $a1-68015-416-8 010 $a1-299-44085-1 010 $a1-78216-265-8 035 $a(CKB)2550000001018294 035 $a(EBL)1126744 035 $a(SSID)ssj0000906520 035 $a(PQKBManifestationID)11536132 035 $a(PQKBTitleCode)TC0000906520 035 $a(PQKBWorkID)10855556 035 $a(PQKB)10328307 035 $a(MiAaPQ)EBC1126744 035 $a(CaSebORM)9781782162643 035 $a(Au-PeEL)EBL1126744 035 $a(CaPaEBR)ebr10682464 035 $a(CaONFJC)MIL475335 035 $a(OCoLC)840899851 035 $a(PPN)228038472 035 $a(OCoLC)852522514 035 $a(OCoLC)ocn852522514 035 $a(EXLCZ)992550000001018294 100 $a20130418d2013 uy 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 10$aClojure data analysis cookbook /$fEric Rochester 205 $a1st edition 210 $aBirmingham, UK $cPackt Pub.$dc2013 215 $a1 online resource (342 p.) 300 $aIncludes index. 311 $a1-78216-264-X 327 $aCover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Importing Data for Analysis; Introduction; Creating a new project; Reading CSV data into Incanter datasets; Reading JSON data into Incanter datasets; Reading data from Excel with Incanter; Reading data from JDBC databases; Reading XML data into Incanter datasets; Scraping data from tables in web pages; Scraping textual data from web pages; Reading RDF data; Reading RDF data with SPARQL; Aggregating data from different formats; Chapter 2: Cleaning and Validating Data 327 $aIntroductionCleaning data with regular expressions; Maintaining consistency with synonym maps; Identifying and removing duplicate data; Normalizing numbers; Rescaling values; Normalizing dates and times; Lazily processing very large data sets; Sampling from very large data sets; Fixing spelling errors; Parsing custom data formats; Validating data with Valip; Chapter 3: Managing Complexity with Concurrent Programming; Introduction; Managing program complexity with STM; Managing program complexity with agents; Getting better performance with commute; Combining agents and STM 327 $aMaintaining consistency with ensureIntroducing safe side effects into the STM; Maintaining data consistency with validators; Tracking processing with watchers; Debugging concurrent programs with watchers; Recovering from errors in agents; Managing input with sized queues; Chapter 4: Improving Performance with Parallel Programming; Introduction; Parallelizing processing with pmap; Parallelizing processing with Incanter; Partitioning Monte Carlo simulations for better pmap performance; Finding the optimal partition size with simulated annealing; Parallelizing with reducers 327 $aGenerating online summary statistics with reducersHarnessing your GPU with OpenCL and Calx; Using type hints; Benchmarking with Criterium; Chapter 5: Distributed Data Processing with Cascalog; Introduction; Distributed processing with Cascalog and Hadoop; Querying data with Cascalog; Distributing data with Apache HDFS; Parsing CSV files with Cascalog; Complex queries with Cascalog; Aggregating data with Cascalog; Defining new Cascalog operators; Composing Cascalog queries; Handling errors in Cascalog workflows; Transforming data with Cascalog 327 $aExecuting Cascalog queries in the Cloud with PalletChapter 6: Working with Incanter Datasets; Introduction; Loading Incanter's sample datasets; Loading Clojure data structures into datasets; Viewing datasets interactively with view; Converting datasets to matrices; Using infix formulas in Incanter; Selecting columns with ; Selecting rows with ; Filtering datasets with where; Grouping data with group-by; Saving datasets to CSV and JSON; Projecting from multiple datasets with join; Chapter 7: Preparing for and Performing Statistical Data Analysis with Incanter; Introduction 327 $aGenerating summary statistics with rollup 330 $aFull of practical tips, the ""Clojure Data Analysis Cookbook"" will help you fully utilize your data through a series of step-by-step, real world recipes covering every aspect of data analysis.Prior experience with Clojure and data analysis techniques and workflows will be beneficial, but not essential. 606 $aDatabase searching 606 $aClojure (Computer program language) 615 0$aDatabase searching. 615 0$aClojure (Computer program language) 676 $a005.133 700 $aRochester$b Eric$01685079 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9911006788503321 996 $aClojure data analysis cookbook$94393167 997 $aUNINA