Skip to main content

Landscape of Introductory Data Science Courses
(in the United States)

 

 

 

Abbreviations:

  • "DS" = Data Science
  • "IntroDS" = the Introductory Data Science Course (in the U.S.)

This is work by the Motivational Attitudes in Statistics / Data Science Education Research (MASDER) team. Click for more details on the MASDER Project.

This instruments used in this work.

  • WHAT/WHO - An Introductory Data Science Course Survey was created as part of the MASDER Project to collect information about introductory courses taught in the United States. Survey participation was solicited from sampled schools in the stratified sample described here, and via distribution lists (ASA and ACM) and dissemination at professional meetings (JSM, USCOTS, SIGCSE). One hundred forty-four valid response were summarized in the analyses. In order to identify a discipline in which the course is taught or an instructor's identity, the following hierarchy was used. Data Science >> CS / Stat >> Math >> Other. Thus, based on the hierarchy, if statistics AND computer science were indicated as primary disciplines, we categorized it as data science. On the other hand, if statistics AND mathematics were indicated as primary disciplines, we categorized it as statistics.
     
  • WHERE - This is based on a representative sample of 147 schools obtained from the population using a stratified random sampling design. Our sampling frame consisted of 2,751 schools in the United States using the Department of Education College Scorecard and the Carnegie Classification Database. We excluded U.S. Service schools, graduate-only institutions, schools with low enrollment numbers, and some special focus schools. For more information about DS and STAT majors and minors in the U.S., see our sampling paper (forthcoming) under Research.

The syllabus content analysis was done by Dr. Jennifer Broatch and Heike Lieske, also supported by the National Science Foundation (DUE 2423026). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation. Additionally, the information presented here is not the work of the GAISE committee nor reflects any opinions of the American Statistical Association.