Module 39-Inf-DI Data Integration

Faculty

Person responsible for module

Regular cycle (beginning)

Every summer semester

Credit points and duration

5 Credit points

For information on the duration of the modul, refer to the courses of study in which the module is used.

Competencies

Non-official translation of the module descriptions. Only the German version is legally binding.

Students learn in the lectures and are the exercise courses how to integrate heterogeneous data sources, i.e. how to combine data residing in different data sources to obtain a global view of the data relating to relevant entities, which represents one of the major challenges in data management especially in the Big Data era as this integration is key to addressing the issue of variety. The problem has been considered for decades, and the lectures will cover foundations of data integration as well as algorithmic and system aspects. The module includes an exam at the end of the term.

Content of teaching

Topics covered in this module:

  • Types of data integration and associated architectures of integrating systems
  • Overcoming schematic heterogeneities between integrated data sources (schema/ontology mapping and schema/ontology matching)
  • Data de-duplication, fusion and curation
  • Keeping track of the integration process through data provenance.
  • Ontology-based data integration
  • Entity matching / record linkage
  • Information extraction from text (entity recognition and linking, relation extraction)
  • Information extraction from Web tables

Recommended previous knowledge

Necessary requirements

Explanation regarding the elements of the module

Module structure: 1 bPr 1

Courses

Data Integration
Type lecture
Regular cycle SoSe
Workload5 60 h (30 + 30)
LP 2 [Pr]
Data Integration
Type exercise
Regular cycle SoSe
Workload5 60 h (30 + 30)
LP 2

Examinations

portfolio with final examination
Allocated examiner Teaching staff of the course Data Integration (lecture)
Weighting 1
Workload 30h
LP2 1

Portfolio consisting of per default weekly exercises or programming tasks and final written exam (per default 60 minutes) or final oral exam (per default 15 minutes). The exercises are based on the content of the lecture and enable students to train and further investigate the topics. It is required that a sufficient percentage of the exercises are successfully completed (per default 50% of the total number of points which can be achieved during a semester). The final oral exam concerns both, the content of the lecture as well as the exercises.

The module is used in these degree programmes:

Degree programme Profile Recom­mended start 3 Duration Manda­tory option 4
Data Science / Master of Science [FsB vom 06.04.2018 mit Änderungen vom 01.07.2019, 02.03.2020, 21.03.2023 und 10.12.2024] Variante 1 2. one semester Compul­sory optional subject
Data Science / Master of Science [FsB vom 06.04.2018 mit Änderungen vom 01.07.2019, 02.03.2020, 21.03.2023 und 10.12.2024] Variante 2 2. one semester Compul­sory optional subject

Automatic check for completeness

The system can perform an automatic check for completeness for this module.


Legend

1
The module structure displays the required number of study requirements and examinations.
2
LP is the short form for credit points.
3
The figures in this column are the specialist semesters in which it is recommended to start the module. Depending on the individual study schedule, entirely different courses of study are possible and advisable.
4
Explanations on mandatory option: "Obligation" means: This module is mandatory for the course of the studies; "Optional obligation" means: This module belongs to a number of modules available for selection under certain circumstances. This is more precisely regulated by the "Subject-related regulations" (see navigation).
5
Workload (contact time + self-study)
SoSe
Summer semester
WiSe
Winter semester
SL
Study requirement
Pr
Examination
bPr
Number of examinations with grades
uPr
Number of examinations without grades
This academic achievement can be reported and recognised.