Published in

Taylor and Francis Group, The American Statistician, 4(72), p. 382-391, 2018

DOI: 10.1080/00031305.2017.1356747

Links

Tools

Export citation

Search in Google Scholar

A Guide to Teaching Data Science

Journal article published in 2016 by Stephanie C. Hicks ORCID, Rafael A. Irizarry ORCID
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Red circle
Preprint: archiving forbidden
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Demand for data science education is surging and traditional courses offered by statistics departments are not meeting the needs of those seeking this training. This has led to a number of opinion pieces advocating for an update to the Statistics curriculum. The unifying recommendation is that computing should play a more prominent role. We strongly agree with this recommendation, but advocate that the main priority is to bring applications to the forefront as proposed by Nolan and Speed (1999). We also argue that the individuals tasked with developing data science courses should not only have statistical training, but also have experience analyzing data with the main objective of solving real-world problems. Here, we share a set of general principles and offer a detailed guide derived from our successful experience developing and teaching data science courses centered entirely on case studies. We argue for the importance of statistical thinking, as defined by Wild and Pfannkuck (1999) and describe how our approach teaches students three key skills needed to succeed in data science, which we refer to as creating, connecting, and computing. This guide can also be used for statisticians wanting to gain more practical knowledge about data science before embarking on teaching a course. ; Comment: 26 pages, 2 tables, 5 figures