page logo
Please note the following credit values:

BSc Course: 4.5 ECTS*
BSc Seminar Course: 9 ECTS
MSc Course: 5 ECTS
MBA Course: 3 ECTS
MBA Workshop: 1 ECTS
Language course: 5 ECTS

*The following BSc courses have a different credit value: 

Business Communication: Theory & Practice: 3 ECTS
Managing your personal performance holistically: 3 ECTS
Harmonizing Leadership with Personal Development: 3 ECTS
Mental Health First Aid: 1,5 ECTS
Understanding your personal performance base: 1,5 ECTS
Workshop Body Language for Women: 1,5 ECTS
Intercultural Competence - Fit for International Collaboration: 1,5 ECTS
Perform Yourself! Media and Presentation Coaching: Personal Presence!: 1,5 ECTS

Workshop: Data Preparation for Analytics (MSc optional course) - Q3

Participation Prerequisites

Prior Python knowledge is not required.

Instructions for installing the required software will be sent out before the course.

Please ensure a working internet connection throughout the course (in order to install additional packages as required).

Course Content

Course Objectives

“The world’s most valuable resource is no longer oil, but data” – The Economist, May 2017.

Data has the potential to create immense business value, to disrupt existing, and to create new business models. But like oil, data needs refinement to fulfill this potential! The amount of time required to bring the data into shape for machine learning and artificial intelligence algorithms or statistical analysis is often underestimated. Furthermore, introductions to data science typically focus on the methods and algorithms and do not cover the required data preparation appropriately.

This workshop aims at enabling students to go beyond the unrealistically clean datasets provided in data science and machine learning tutorials. Instead, students learn how to handle data as they would face it in real-life business situations, where errors, inconsistencies, incompleteness, duplicates and many more problems are commonplace. They learn how to combine data from different sources and how to efficiently perform computations, aggregations, and other typical data preparation steps. Finally, students are introduced to special data preprocessing steps required for machine learning.

Having completed this course will give students an edge in the labor market where most newcomers have little experience with real-life datasets – especially those aiming for a career in consulting or other areas related to data science and artificial intelligence.

This course is also an ideal complement for students taking the courses “Managing Data Science” and “Visual Data Analysis”.

Course Contents

The course covers the typical data preparation techniques required for analytics:

  • Loading and joining data from different types of data sources
  • Data types and conversions
  • Filtering
  • Computations
  • Aggregations
  • Pivoting / reshaping
  • Handling inconsistencies and errors in the data
  • Time series operations
  • Special preprocessing operations for machine learning

We may emphasize or skip topics based on questions or suggestions during the workshop and based on the pace of the group.

Instruction Type

Lecture

Form of Examination

Form of Assessment Weighting
(in %)
Duration of written exam
in minutes
Written Exam    
Oral Examination   -
Written Work (Individual)   -
Written Work (Group)   -
Presentation (Individual)   -
Presentation (Group)   -
Business Simulation   -
Class Participation   -
Answer-Choice-Exam   -
Other assessment format (please specify):   -

Next events

No current events available!

1/4 Elective Th, 05.02.2026 08:45 Uhr 14:30 Uhr E-103 Hörsaal / Lecture Hall
2/4 Elective Fr, 06.02.2026 08:45 Uhr 14:30 Uhr E-103 Hörsaal / Lecture Hall
3/4 Elective We, 11.02.2026 08:45 Uhr 14:30 Uhr E-103 Hörsaal / Lecture Hall
4/4 Elective Fr, 13.02.2026 08:45 Uhr 14:30 Uhr E-103 Hörsaal / Lecture Hall
Show past events

Lecturers

lecturer image
Keller, Tobias
Lecturer

Indicative Student Workload

Self-Study 36 h
Contact Time 24 h
Examination 0 h