I851 – Foundations in Data Science and Engineering

Module
Foundations in Data Science and Engineering
Foundations in Data Science and Engineering

Please note: This module will be offered for the first time in the Summer semester 2025 semester.
Module number
I851
Version: 1
Faculty
Informatics/Mathematics
Level
Master
Duration
1 Semester
Semester
Summer semester
Module supervisor

Prof. Dr.-Ing. Maik Thiele
maik.thiele(at)htw-dresden.de

Lecturer(s)

Prof. Dr.-Ing. Maik Thiele
maik.thiele(at)htw-dresden.de

Course language(s)

English
in "Foundations in Data Science and Engineering"

ECTS credits

3.00 credits

Workload

90 hours

Courses

2.00 SCH (1.00 SCH Lecture | 1.00 SCH Internship)

Self-study time

60.00 hours

Pre-examination(s)
None
Examination(s)

Written examination
Module examination | Examination time: 90 min | Weighting: 100% | tested in English language
in "Foundations in Data Science and Engineering"

Form of teaching

keine Angabe

Media type

Präsenzveranstaltung

Instruction content/structure

The term "Data Science" has become an important buzzword in dealing with big data. Data scientists handle and analyze large amounts of data to generate information and derive recommendations for action that enable organizations to work more efficiently. To achieve this, various analytical tools and methods are used to extract valuable information from confusing data sets, from which hypotheses are subsequently derived.

This lecture provides an end-to-end look at the data science process. In the first part of the lecture, the fundamentals of data management are taught. It will be shown how large dynamic data sets can be integrated, consolidated and finally used for complex analysis tasks. In particular, the topics of data warehousing, multidimensional modeling and relational mapping are covered. Building on this, complex data models, e.g. graphs and visual analysis as a means of data interpretation are discussed. The second part of the lecture covers methods of data analysis. This includes classical statistical methods like classification, clustering and forecasting as well as machine learning methods like neural networks. Besides the methods themselves, modern analysis tools for their application are also presented. The theoretical content of the lectures is complemented by practical exercises.

Qualification objectives
  • Learning data management system foundations
  • Learning practical skills for dealing with error-prone data and its integration
  • Enablement to choose the correct data analysis methods given concrete usage scenarios
  • Building own end-to-end scenarios in Python for analyzing data
Social and personal skills
No information
Special admission requirements
Recommended prerequisites

Grundlagenkenntnisse in Python

Continuation options
Literature
  • Andreas C. Muller, Sarah Guido: Introduction to Machine Learning with Python. 2017, O'Reilly.
  • Wes McKinney: Python for Data Analysis - Data Wrangling with Pandas, NumPy, and Jupyter. 2022, O'Reilly.
  • Charles Wheelan: Naked Statistics.
Current teaching resources
None
Notes
No information