LEADER 05531nam 2200721 450 001 9910787882403321 005 20200520144314.0 010 $a1-118-72955-2 010 $a1-118-74209-5 035 $a(CKB)2670000000530787 035 $a(EBL)1637796 035 $a(SSID)ssj0001163870 035 $a(PQKBManifestationID)11664544 035 $a(PQKBTitleCode)TC0001163870 035 $a(PQKBWorkID)11163870 035 $a(PQKB)10188774 035 $a(Au-PeEL)EBL1637796 035 $a(CaPaEBR)ebr10842300 035 $a(CaONFJC)MIL578548 035 $a(OCoLC)876043666 035 $a(CaSebORM)9781118729083 035 $a(MiAaPQ)EBC1637796 035 $a(PPN)179371762 035 $a(EXLCZ)992670000000530787 100 $a20140313h20142014 uy 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 00$aMicrosoft big data solutions /$fAdam Jorgensen [and five others] ; executive editor, Robert Elliot ; project editor, Jennifer Lynn ; cover designer, Ryan Sneed 205 $a1st edition 210 1$aIndianapolis, Indiana :$cWiley,$d2014. 210 4$dİ2014 215 $a1 online resource (410 p.) 300 $aIncludes index. 311 $a1-118-72908-0 327 $aCover; Title Page; Copyright; Contents; Introduction; Part I What Is Big Data?; Chapter 1 Industry Needs and Solutions; What's So Big About Big Data?; A Brief History of Hadoop; Google; Nutch; What Is Hadoop?; Derivative Works and Distributions; Hadoop Distributions; Core Hadoop Ecosystem; Important Apache Projects for Hadoop; The Future for Hadoop; Summary; Chapter 2 Microsoft's Approach to Big Data; A Story of "Better Together"; Competition in the Ecosystem; SQL on Hadoop Today; Hortonworks and Stinger; Cloudera and Impala; Microsoft's Contribution to SQL in Hadoop; Deploying Hadoop 327 $aDeployment Factors Deployment Topologies; Deployment Scorecard; Summary; Part II Setting Up for Big Data with Microsoft; Chapter 3 Configuring Your First Big Data Environment; Getting Started; Getting the Install; Running the Installation; On-Premise Installation: Single-Node Installation; HD Insight Service: Installing in the Cloud; Windows Azure Storage Explorer Options; Validating Your New Cluster; Logging into HD Insight Service; Verify HDP Functionality in the Logs; Common Post-Setup Tasks; Loading Your First Files; Verifying Hive and Pig; Summary; Part III Storing and Managing Big Data 327 $aChapter 4 HDFS, Hive, HBase, and HCatalog Exploring the Hadoop Distributed File System; Explaining the HDFS Architecture; Interacting with HDFS; Exploring Hive: The Hadoop Data Warehouse Platform; Designing, Building, and Loading Tables; Querying Data; Configuring the Hive ODBC Driver; Exploring HCatalog: HDFS Table and Metadata Management; Exploring HBase: An HDFS Column-Oriented Database; Columnar Databases; Defining and Populating an HBase Table; Using Query Operations; Summary; Chapter 5 Storing and Managing Data in HDFS; Understanding the Fundamentals of HDFS; HDFS Architecture 327 $aName Nodes and Data Nodes Data Replication; Using Common Commands to Interact with HDFS; Interfaces for Working with HDFS; File Manipulation Commands; Administrative Functions in HDFS; Moving and Organizing Data in HDFS; Moving Data in HDFS; Implementing Data Structures for Easier Management; Rebalancing Data; Summary; Chapter 6 Adding Structure with Hive; Understanding Hive's Purpose and Role; Providing Structure for Unstructured Data; Enabling Data Access and Transformation; Differentiating Hive from Traditional RDBMS Systems; Working with Hive; Creating and Querying Basic Tables 327 $aCreating Databases Creating Tables; Adding and Deleting Data; Querying a Table; Using Advanced Data Structures with Hive; Setting Up Partitioned Tables; Loading Partitioned Tables; Using Views; Creating Indexes for Tables; Summary; Chapter 7 Expanding Your Capability with HBase and HCatalog; Using HBase; Creating HBase Tables; Loading Data into an HBase Table; Performing a Fast Lookup; Loading and Querying HBase; Managing Data with HCatalog; Working with HCatalog and Hive; Defining Data Structures; Creating Indexes; Creating Partitions; Integrating HCatalog with Pig and Hive 327 $aUsing HBase or Hive as a Data Warehouse 330 $aTap the power of Big Data with Microsoft technologies Big Data is here, and Microsoft's new Big Data platform is a valuable tool to help your company get the very most out of it. This timely book shows you how to use HD Insight along with Horton Works Data Platform for Windows to store, manage, analyze, and share Big Data throughout the enterprise. Focusing primarily on Microsoft and Horton Works technologies but also covering open source tools, Microsoft Big Data Solutions explains best practices, covers on-premises and cloud-based solutions, and features valuable case studies 606 $aCloud computing 606 $aComputers 606 $aWeb services 615 0$aCloud computing. 615 0$aComputers. 615 0$aWeb services. 676 $a005.74 700 $aJorgensen$b Adam$0865174 701 $aJorgensen$b Adam$0865174 701 $aElliot$b Robert$0264929 701 $aLynn$b Jennifer$01573298 701 $aSneed$b Ryan$01573299 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9910787882403321 996 $aMicrosoft big data solutions$93848980 997 $aUNINA