Statistics

Statistics Topics for Data Scientists

Statistical Significance

As data scientists, sometimes our analysis requires assessing statistical significance. Maybe it is an A/B test, and we need to determine if one option is actually better at achieving whatever KPI we might be after.

Perhaps the most common test for statistical significance is the Student’s t-test, and it all started at Guinness:

General Topics

This first article is a paper by a statistics professor and a computer science professor about what they consider to be the most important ideas in statistics from the past half-century. Some of these topics will look familiar to data scientists, like regularization. This follows nicely from the previous page, continuing to look at the connections between statistics and machine learning.