1.

Record Nr.

UNINA9910427044503321

Autore

Mucchetti Mark

Titolo

BigQuery for data warehousing : managed data analysis in the Google cloud / / Mark Mucchetti

Pubbl/distr/stampa

[Place of publication not identified] : , : Apress, , [2020]

©2020

ISBN

1-4842-6186-0

Edizione

[1st ed. 2020.]

Descrizione fisica

1 online resource (539 pages)

Disciplina

658.40380285574

Soggetti

Data warehousing

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di contenuto

Part I. Building a Warehouse -- 1. Settling into BigQuery -- 2. Starting Your Warehouse Project -- 3. All My Data -- 4. Managing BigQuery Costs -- Part II. Filling the Warehouse -- 5. Loading Data Into the Warehouse -- 6. Streaming Data Into the Warehouse -- 7. Dataflow -- Part III. Using the Warehouse -- 8. Care and Feeding of Your Warehouse -- 9. Querying the Warehouse -- 10. Scheduling Jobs -- 11. Serverless Functions with GCP -- 12. Cloud Logging -- Part IV. Maintaining the Warehouse -- 13. Advanced BigQuery -- 14. Data Governance -- 15. Adapting to Long-Term Change -- Part V. Reporting On and Visualizing Your Data -- 16. Reporting -- 17. Dashboards and Visualization -- 18. Google Data Studio -- Part VI. Enhancing Your Data's Potential -- 19. BigQuery ML -- 20. Jupyter Notebooks and Public Datasets -- 21. Conclusion -- 22. Appendix A: Cloud Shell and Cloud SDK -- 23. Appendix B: Sample Project Charter.

Sommario/riassunto

Create a data warehouse, complete with reporting and dashboards using Google’s BigQuery technology. This book takes you from the basic concepts of data warehousing through the design, build, load, and maintenance phases. You will build capabilities to capture data from the operational environment, and then mine and analyze that data for insight into making your business more successful. You will gain practical knowledge about how to use BigQuery to solve data challenges in your organization. BigQuery is a managed cloud platform from Google that provides enterprise data warehousing and reporting



capabilities. Part I of this book shows you how to design and provision a data warehouse in the BigQuery platform. Part II teaches you how to load and stream your operational data into the warehouse to make it ready for analysis and reporting. Parts III and IV cover querying and maintaining, helping you keep your information relevant with other Google Cloud Platform services and advanced BigQuery. Part V takes reporting to the next level by showing you how to create dashboards to provide at-a-glance visual representations of your business situation. Part VI provides an introduction to data science with BigQuery, covering machine learning and Jupyter notebooks. You will: Design a data warehouse for your project or organization Load data from a variety of external and internal sources Integrate other Google Cloud Platform services for more complex workflows Maintain and scale your data warehouse as your organization grows Analyze, report, and create dashboards on the information in the warehouse Become familiar with machine learning techniques using BigQuery ML.