We will discuss why this is a fundamental question in Data Science and how we connected the concept to educational experience of the next generation of Data Scientists.
Last semester in the School of Data Science at the University of Virginia we taught our first introductory class for undergraduates, DS 1001 Foundations of Data Science. In the process of developing the course we explored how to introduce the fundamental questions of the field of Data Science to our students. This lightning talk is all about one of those questions "What do I leave in, and what do I leave out?" or put another way "How do we decide which data to collect?". We will discuss why this is a fundamental question in Data Science and how we connected the concept to educational experience of the next generation of Data Scientists.