LEADER 07402nam 22007455 450 001 9910298964403321 005 20200630020923.0 010 $a3-319-25313-1 024 7 $a10.1007/978-3-319-25313-8 035 $a(CKB)3710000000580287 035 $a(EBL)4334061 035 $a(SSID)ssj0001606898 035 $a(PQKBManifestationID)16315225 035 $a(PQKBTitleCode)TC0001606898 035 $a(PQKBWorkID)14896817 035 $a(PQKB)10295383 035 $a(DE-He213)978-3-319-25313-8 035 $a(MiAaPQ)EBC4334061 035 $a(PPN)191699691 035 $a(EXLCZ)993710000000580287 100 $a20160112d2015 u| 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 10$aBig-Data Analytics and Cloud Computing $eTheory, Algorithms and Applications /$fedited by Marcello Trovati, Richard Hill, Ashiq Anjum, Shao Ying Zhu, Lu Liu 205 $a1st ed. 2015. 210 1$aCham :$cSpringer International Publishing :$cImprint: Springer,$d2015. 215 $a1 online resource (178 p.) 300 $aDescription based upon print version of record. 311 $a3-319-25311-5 320 $aIncludes bibliographical references at the end of each chapters and index. 327 $aForeword; Preface; Overview and Goals; Organisation and Features; Target Audiences; Suggested Uses; Acknowledgements; Contents; Contributors; Part I Theory; 1 Data Quality Monitoring of Cloud Databases Based on Data Quality SLAs; 1.1 Introduction and Summary; 1.2 Background; 1.2.1 Data Quality in the Context of Big Data; 1.2.2 Cloud Computing; 1.2.3 Data Quality Monitoring in the Cloud; 1.2.4 The Challenge of Specifying a DQSLA; 1.2.5 The Infrastructure Estimation Problem; 1.3 Proposed Solutions; 1.3.1 Data Quality SLA Formalization; 1.3.2 Examples of Data Quality SLAs 327 $a1.3.3 Data Quality-Aware Service Architecture1.4 Future Research Directions; 1.5 Conclusions; References; 2 Role and Importance of Semantic Search in Big Data Governance; 2.1 Introduction; 2.2 Big Data: Promises and Challenges; 2.3 Participatory Design for Big Data; 2.4 Self-Service Discovery; 2.5 Conclusion; References; 3 Multimedia Big Data: Content Analysis and Retrieval; 3.1 Introduction; 3.2 The MapReduce Framework and Multimedia Big Data; 3.2.1 Indexing; 3.2.2 Caveats on Indexing; 3.2.3 Multiple Multimedia Processing; 3.2.4 Additional Work Required? 327 $a5 Integrating Twitter Traffic Information with Kalman Filter Models for Public Transportation Vehicle Arrival Time Prediction5.1 Introduction; 5.2 Communication Platform on Twitter; 5.3 Communication for Data Collection on Twitter; 5.4 Event Detection and Analysis: Tweets Relating to Road Incidents; 5.4.1 Twitter Data: Incident Data Set; 5.5 Methodology; 5.5.1 Time Series and Temporal Analysis of Textual Twitter; 5.6 Proposed Refined Kalman Filter (KF) Model-Based System; 5.7 Conclusion; References; 6 Data Science and Big Data Analytics at Career Builder 327 $a6.1 Carotene: A Job Title Classification System6.1.1 Occupation Taxonomies; 6.1.2 The Architecture of Carotene; 6.1.2.1 Taxonomy Discovery Using Clustering; 6.1.2.2 Coarse-Level Classification: SOC Major Classifier; 6.1.2.3 Fine-Level Classification: Proximity-Based Classifier; 6.1.3 Experimental Results and Discussion; 6.2 CARBi: A Data Science Ecosystem; 6.2.1 Accessing CB Data and Services Using WebScalding; 6.2.2 ScriptDB: Managing Hadoop Jobs; References; 7 Extraction of Bayesian Networks from Large Unstructured Datasets; 7.1 Introduction; 7.2 Text Mining; 7.2.1 Text Mining Techniques 327 $a7.2.2 General Architecture and Various Components of Text Mining 330 $aThis important and timely text/reference reviews the theoretical concepts, leading-edge techniques and practical tools involved in the latest multi-disciplinary approaches addressing the challenges of big data. Illuminating perspectives from both academia and industry are presented by an international selection of experts in big data science. Topics and features: Describes the innovative advances in theoretical aspects of big data, predictive analytics and cloud-based architectures Examines the applications and implementations that utilize big data in cloud architectures Surveys the state of the art in architectural approaches to the provision of cloud-based big data analytics functions Identifies potential research directions and technologies to facilitate the realization of emerging business models through big data approaches Provides relevant theoretical frameworks, empirical research findings, and numerous case studies Discusses real-world applications of algorithms and techniques to address the challenges of big datasets This authoritative volume will be of great interest to researchers, enterprise architects, business analysts, IT infrastructure managers and application developers, who will benefit from the valuable insights offered into the adoption of architectures for big data and cloud computing. The work is also suitable as a textbook for university instructors, with the outline for a possible course structure suggested in the preface. The editors are all members of the Computing and Mathematics Department at the University of Derby, UK, where Dr. Marcello Trovati serves as a Senior Lecturer in Mathematics, Dr. Richard Hillas a Professor and Head of the Computing and Mathematics Department, Dr. Ashiq Anjum as a Professor of Distributed Computing, Dr. Shao Ying Zhu as a Senior Lecturer in Computing, and Dr. Lu Liu as a Professor of Distributed Computing. The other publications of the editors include the Springer titles Guide to Security Assurance for Cloud Computing, Guide to Cloud Computing and Cloud Computing for Enterprise Architectures. 606 $aMathematical statistics 606 $aComputer communication systems 606 $aComputer simulation 606 $aComputer science?Mathematics 606 $aProbability and Statistics in Computer Science$3https://scigraph.springernature.com/ontologies/product-market-codes/I17036 606 $aComputer Communication Networks$3https://scigraph.springernature.com/ontologies/product-market-codes/I13022 606 $aSimulation and Modeling$3https://scigraph.springernature.com/ontologies/product-market-codes/I19000 606 $aMath Applications in Computer Science$3https://scigraph.springernature.com/ontologies/product-market-codes/I17044 615 0$aMathematical statistics. 615 0$aComputer communication systems. 615 0$aComputer simulation. 615 0$aComputer science?Mathematics. 615 14$aProbability and Statistics in Computer Science. 615 24$aComputer Communication Networks. 615 24$aSimulation and Modeling. 615 24$aMath Applications in Computer Science. 676 $a004 702 $aTrovati$b Marcello$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aHill$b Richard$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aAnjum$b Ashiq$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aZhu$b Shao Ying$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aLiu$b Lu$4edt$4http://id.loc.gov/vocabulary/relators/edt 906 $aBOOK 912 $a9910298964403321 996 $aBig-Data Analytics and Cloud Computing$92518758 997 $aUNINA