1.

Record Nr.

UNINA9910495223203321

Autore

Chen Jeffrey

Titolo

Data Science for Public Policy / / by Jeffrey C. Chen, Edward A. Rubin, Gary J. Cornwall

Pubbl/distr/stampa

Cham : , : Springer International Publishing : , : Imprint : Springer, , 2021

ISBN

3-030-71352-0

Edizione

[1st ed. 2021.]

Descrizione fisica

1 online resource (365 pages)

Collana

Springer Series in the Data Sciences, , 2365-5682

Disciplina

300.2854

Soggetti

Mathematics - Data processing

Mathematical statistics - Data processing

Computational Mathematics and Numerical Analysis

Statistics and Computing

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di bibliografia

Includes bibliographical references and index.

Nota di contenuto

An Introduction -- The Case for Programming -- Elements of Programming -- Transforming Data -- Record Linkage -- Exploratory Data Analysis -- Regression Analysis -- Framing Classification -- Three Quantitative Perspectives -- Prediction -- Cluster Analysis -- Spatial Data -- Natural Language -- The Ethics of Data Science -- Developing Data Products -- Building Data Teams -- Appendix A: Planning a Data Product -- Appendix B: Interview Questions.

Sommario/riassunto

This textbook presents the essential tools and core concepts of data science to public officials, policy analysts, and economists among others in order to further their application in the public sector. An expansion of the quantitative economics frameworks presented in policy and business schools, this book emphasizes the process of asking relevant questions to inform public policy. Its techniques and approaches emphasize data-driven practices, beginning with the basic programming paradigms that occupy the majority of an analyst’s time and advancing to the practical applications of statistical learning and machine learning. The text considers two divergent, competing perspectives to support its applications, incorporating techniques from both causal inference and prediction. Additionally, the book includes open-sourced data as well as live code, written in R and presented in



notebook form, which readers can use and modify to practice working with data.