1.2. Why learn data science from this book?#
This book uses a computational first approach to data science. You will learn how to leverage the power of modern computers and scientific software to visualize, transform, and simulate data. For instance, one of the main approaches used in this book is to conduct statistical tests by carrying out simulations that draw samples from the data being analyzed. This approach has the following benefits:
We start working with real data sets quickly because this approach does not require a lot of mathematical background and the computer does all the mathematical manipulation and plotting.
Simulation models are easy to create and understand. The results do not rely on any arcane formulas but only the ability to build very simple simulations that draw from the experimental data.
This approach is more general than the traditional approach. It does not rely on the data coming from specific probability distributions, and the same simulation can be used to generate frequentist or Bayesian statistics.
Interactive flashcards and self-assessment quizzes are provided on this website and linked using QR codes in the book.
The interactive materials use spaced repetition to help readers retain knowledge as they progress through the book. Starting with Chapter 2, the interactive chapter reviews also give a random subset of review problems from earlier chapters. Research shows that spaced repetition improves the retention of material.
Each chapter of the book ends with “take-aways” that help summarize the important points from the chapter and address issues that may be topics of questions in data science interviews.