1.

Record Nr.

UNINA9911019092103321

Autore

Cady Field <1984->

Titolo

The data science handbook / / Field Cady

Pubbl/distr/stampa

Newark : , : John Wiley & Sons, Incorporated, , [2025]

©2025

ISBN

9781394234523

139423452X

9781394234509

1394234503

9781394234516

1394234511

Edizione

[2nd edition]

Descrizione fisica

1 online resource (xix, 347 pages) : illustrations (some color)

Disciplina

005.74

Soggetti

Databases

Statistics - Data processing

Big data

Information theory

Bases de dades

Dades massives

Teoria de la informació

Estadística

Processament de dades

Llibres electrònics

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di bibliografia

Includes bibliographical references and index.

Nota di contenuto

Introduction -- The Data Science Road Map -- Programming Languages -- Data Munging: String Manipulation, Regular Expressions, and Data Cleaning -- Visualizations and Simple Metrics -- AI and Machine Learning Overview -- Interlude: Feature Extraction Ideas -- Machine Learning Classification -- Technical Communication and Documentation -- Unsupervised Learning: Clustering and Dimensionality Reduction -- Regression -- Data Encodings and File



Formats -- Big Data -- Databases -- Software Engineering Best Practices -- Traditional Natural Language Processing -- Time Series Analysis -- Probability -- Statistics -- Programming Language Concepts -- Performance and Computer Memory -- Computer Memory and Data Structures -- Maximum Likelihood Estimation and Optimization -- Deep Learning and AI -- Stochastic Modeling -- Parting Words: Your Future as a Data Scientist.

Sommario/riassunto

"The goal of this book is to turn you into a data scientist, and there are two parts to this mission. Firstly there is a set of specific concepts, tools and techniques that you can go out and solve problems with today. They include buzzwords such machine learning, Spark and NLP. They also include concepts that are distinctly less sexy but often more useful, like regular expressions, unit tests and SQL queries. It would be impossible to give an exhaustive list in any single book, but I cast a wide net"--