Vai al contenuto principale della pagina

Entity information life cycle for big data : master data management and information integration / / John R. Talburt, Yinle Zhou



(Visualizza in formato marc)    (Visualizza in BIBFRAME)

Autore: Talburt John R. Visualizza persona
Titolo: Entity information life cycle for big data : master data management and information integration / / John R. Talburt, Yinle Zhou Visualizza cluster
Pubblicazione: Amsterdam, [Netherlands] : , : Morgan Kaufmann, , 2015
©2015
Edizione: 1st edition
Descrizione fisica: 1 online resource (255 p.)
Disciplina: 005.7
Soggetto topico: Big data
Pattern recognition systems
Semantic Web
Data mining
Persona (resp. second.): ZhouYinle
Note generali: Description based upon print version of record.
Nota di bibliografia: Includes bibliographical references and index.
Nota di contenuto: Front Cover; Entity Information Life Cycle for Big Data; Copyright; Contents; Foreword; Preface; THE CHANGING LANDSCAPE OF INFORMATION QUALITY; MOTIVATION FOR THIS BOOK; AUDIENCE; ORGANIZATION OF THE MATERIAL; Acknowledgements; Chapter 1 - The Value Proposition for MDM and Big Data; DEFINITION AND COMPONENTS OF MDM; THE BUSINESS CASE FOR MDM; DIMENSIONS OF MDM; THE CHALLENGE OF BIG DATA; MDM AND BIG DATA - THE N-SQUARED PROBLEM; CONCLUDING REMARKS; Chapter 2 - Entity Identity Information and the CSRUD Life Cycle Model; ENTITIES AND ENTITY REFERENCES; MANAGING ENTITY IDENTITY INFORMATION
ENTITY IDENTITY INFORMATION LIFE CYCLE MANAGEMENT MODELSCONCLUDING REMARKS; Chapter 3 - A Deep Dive into the Capture Phase; AN OVERVIEW OF THE CAPTURE PHASE; BUILDING THE FOUNDATION; UNDERSTANDING THE DATA; DATA PREPARATION; SELECTING IDENTITY ATTRIBUTES; ASSESSING ER RESULTS; DATA MATCHING STRATEGIES; CONCLUDING REMARKS; Chapter 4 - Store and Share - Entity Identity Structures; ENTITY IDENTITY INFORMATION MANAGEMENT STRATEGIES; DEDICATED MDM SYSTEMS; THE IDENTITY KNOWLEDGE BASE; MDM ARCHITECTURES; CONCLUDING REMARKS; Chapter 5 - Update and Dispose Phases - Ongoing Data Stewardship
DATA STEWARDSHIPTHE AUTOMATED UPDATE PROCESS; THE MANUAL UPDATE PROCESS; ASSERTED RESOLUTION; EIS VISUALIZATION TOOLS; MANAGING ENTITY IDENTIFIERS; CONCLUDING REMARKS; Chapter 6 - Resolve and Retrieve Phase - Identity Resolution; IDENTITY RESOLUTION; IDENTITY RESOLUTION ACCESS MODES; CONFIDENCE SCORES; CONCLUDING REMARKS; Chapter 7 - Theoretical Foundations; THE FELLEGI-SUNTER THEORY OF RECORD LINKAGE; THE STANFORD ENTITY RESOLUTION FRAMEWORK; ENTITY IDENTITY INFORMATION MANAGEMENT; CONCLUDING REMARKS; Chapter 8 - The Nuts and Bolts of Entity Resolution; THE ER CHECKLIST
CLUSTER-TO-CLUSTER CLASSIFICATIONSELECTING AN APPROPRIATE ALGORITHM; CONCLUDING REMARKS; Chapter 9 - Blocking; BLOCKING; BLOCKING BY MATCH KEY; DYNAMIC BLOCKING VERSUS PRERESOLUTION BLOCKING; BLOCKING PRECISION AND RECALL; MATCH KEY BLOCKING FOR BOOLEAN RULES; MATCH KEY BLOCKING FOR SCORING RULES; CONCLUDING REMARKS; Chapter 10 - CSRUD for Big Data; LARGE-SCALE ER FOR MDM; THE TRANSITIVE CLOSURE PROBLEM; DISTRIBUTED, MULTIPLE-INDEX, RECORD-BASED RESOLUTION; AN ITERATIVE, NONRECURSIVE ALGORITHM FOR TRANSITIVE CLOSURE; ITERATION PHASE: SUCCESSIVE CLOSURE BY REFERENCE IDENTIFIER
DEDUPLICATION PHASE: FINAL OUTPUT OF COMPONENTSER USING THE NULL RULE; THE CAPTURE PHASE AND IKB; THE IDENTITY UPDATE PROBLEM; PERSISTENT ENTITY IDENTIFIERS; THE LARGE COMPONENT AND BIG ENTITY PROBLEMS; IDENTITY CAPTURE AND UPDATE FOR ATTRIBUTE-BASED RESOLUTION; CONCLUDING REMARKS; Chapter 11 - ISO Data Quality Standards for Master Data; BACKGROUND; GOALS AND SCOPE OF THE ISO 8000-110 STANDARD; FOUR MAJOR COMPONENTS OF THE ISO 8000-110 STANDARD; SIMPLE AND STRONG COMPLIANCE WITH ISO 8000-110; ISO 22745 INDUSTRIAL SYSTEMS AND INTEGRATION; BEYOND ISO 8000-110; CONCLUDING REMARKS
Appendix A - Some Commonly Used ER Comparators
Sommario/riassunto: Entity Information Life Cycle for Big Data walks you through the ins and outs of managing entity information so you can successfully achieve master data management (MDM) in the era of big data. This book explains big data's impact on MDM and the critical role of entity information management system (EIMS) in successful MDM. Expert authors Dr. John R. Talburt and Dr. Yinle Zhou provide a thorough background in the principles of managing the entity information life cycle and provide practical tips and techniques for implementing an EIMS, strategies for exploiting distributed processing to hand
Titolo autorizzato: Entity information life cycle for big data  Visualizza cluster
Formato: Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione: Inglese
Record Nr.: 9910817605603321
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui