Within the framework of a design-based research project, computer science educators and statistics educators at Paderborn University designed a pilot course on the subject of data science and big data. It addresses upper secondary students and was realized by weekly sessions (three hours) over seven months. The whole course that is intended to introduce upper secondary school students to the field of data science consists of four modules. In module 1, the learners are introduced into the basics of statistics and big data and it aims at developing their data competence and data awareness. In the sec- ond module, learners are introduced to machine learning and programming based, among others, on examples from module 1. In the third and fourth module, learners can apply their knowledge gained in modules 1 and 2 and will work in small groups on real and meaningful data science projects. In this paper, we want to concentrate on the statistics components, especially of module 1, and we will present how we develop the data competence and data awareness of upper secondary school students to prepare them to work on data science projects in modules 3 and 4.