392157 Big Data Analytics (V) (SoSe 2019)

Contents, comment

Kompetenzen:

Die Vorlesung Big Data Analytics entwickelt Kompetenzen bei der Durchführung von Data-Mining-Aufgaben bei sehr großen Datenmengen, die nicht im Hauptspeicher gespeichert werden können. Die Vorlesung liefert die Schlüsselideen der Ähnlichkeitssuche unter Verwendung von Minhashing und lokalitätssensitivem Hashing, der Verarbeitung von Datenströmen, bei denen Daten so schnell ankommen, dass sie sofort verarbeitet werden müssen oder sonst verloren gehen, von Web-bezogenen Algorithmen wie dem Google PageRank, von Algorithmen um häufige Itemsets, Assoziationsregeln und häufige Teilgraphen zu finden, Algorithmen zur Analyse der Struktur von großen Graphen, wie z. B. von sozialen Netzwerkgraphen, und des MapReduce-Prinzips, um parallele Algorithmen zu entwerfen.

The lecture Big Data Analytics develops competencies in performing data mining tasks on very large amounts of data that cannot be stored in main memory. The lecture provides the key ideas of similarity search using minhashing and locality-sensitive hashing, of data stream processing where data arrives so fast that it has to be processed immediately or is otherwise lost, of Web-related algorithms such as Google's PageRank, of algorithms for mining frequent itemsets, association rules and frequent subgraphs, of algorithms to analyze the structure of large graphs such as social network graphs, and of the map-reduce principle to design parallel algorithms.

Lehrinhalte:

Die Vorlesung Big Data Analytics befasst sich mit Methoden und Algorithmen im Kontext der Analyse von Big Data. Insbesondere werden folgende Themen behandelt:

1) Suchen von ähnlichen Objekten
2) Analyse von Datenströmen
3) PageRank
4) MapReduce
5) Suche nach häufigen Teilmengen
6) Suche nach häufigen Teilgraphen
7) Mining von sozialen Netzwerkgraphen
8) Empfehlungssysteme

The lecture Big Data Analytics deals with methods and algorithms in the context of big data analytics. In particular, the following topics are addressed:

1) Finding Similar Items
2) Stream Data Analysis
3) PageRank
4) MapReduce
5) Mining Frequent Itemsets
6) Mining Frequent Subgraphs
7) Mining Social Network Graphs
8) Recommender Systems

Requirements for participation, required level

Kenntnisse aus Datenbanken I (oder vergleichbare Kenntnisse aus anderen Vorlesungen)

Bibliography

A. Silberschatz, H. F. Korth, S. Sudarshan, „Database System Concepts“, 5th edition, McGraw Hill, 2006.

R. Elmasri und S.B. Navathe, „Fundamentals of Database Systems“, 5th edition, Pearson/Addison Wesley, 2007.

William H. Inmon, "Building the Data Warehouse", John Wiley & Sons, 1996.

Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, "Mining of Massive Datasets", 2nd Edition, Cambridge University Press, 2014.

Tom White, "Hadoop: The Definitive Guide Storage and Analysis at Internet Scale", 3rd edition, O'Reilly.

Viktor Mayer-Schönberger , Kenneth Cukier , " Big Data: A Revolution That Will Transform How We Live, Work and Think", John Murray, 2013.

Eric Redmond , Jim R. Wilson, "Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement", O' Reilly, 2012.

Peter Gulutzan, Trudy Pelzer , "SQL Performance Tuning", Addison Wesley, 2002.

Teaching staff

Dates ( Calendar view )

Frequency Weekday Time Format / Place Period  

Show passed dates >>

Examinations

Date Time Format / Room Comment about examination

Show passed examination dates >>

Subject assignments

Module Course Requirements  
39-Inf-BDA_ver1 Big Data Analytics Big Data Analytics Graded examination
Student information
39-Inf-DB2 Datenbanken II Datenbanken II Student information
- Graded examination Student information

The binding module descriptions contain further information, including specifications on the "types of assignments" students need to complete. In cases where a module description mentions more than one kind of assignment, the respective member of the teaching staff will decide which task(s) they assign the students.


Bestehen der mündlichen oder schriftlichen Prüfung (3 LP) sowie erfolgreiche Bearbeitung der Übungsblätter. Die erfolgreiche Bearbeitung der Übungsblätter beinhaltet die Lösung von mind. 60% der Aufgaben sowie das zweimalige Vorrechnen einer Aufgabe in den Übungsgruppen (2 LP). Diese Leistungen ergeben zusammen 5 LP.

E-Learning Space

A corresponding course offer for this course already exists in the e-learning system. Teaching staff can store materials relating to teaching courses there:

Registered number: 134
This is the number of students having stored the course in their timetable. In brackets, you see the number of users registered via guest accounts.
Address:
SS2019_392157@ekvv.uni-bielefeld.de
This address can be used by teaching staff, their secretary's offices as well as the individuals in charge of course data maintenance to send emails to the course participants. IMPORTANT: All sent emails must be activated. Wait for the activation email and follow the instructions given there.
If the reference number is used for several courses in the course of the semester, use the following alternative address to reach the participants of exactly this: VST_152511391@ekvv.uni-bielefeld.de
Coverage:
66 Students to be reached directly via email
Notes:
Additional notes on the electronic mailing lists
Email archive
Number of entries 0
Open email archive
Last update basic details/teaching staff:
Monday, April 29, 2019 
Last update times:
Friday, July 12, 2019 
Last update rooms:
Friday, July 12, 2019 
Type(s) / SWS (hours per week per semester)
V / 2
Department
Faculty of Technology
Questions or corrections?
Questions or correction requests for this course?
Planning support
Clashing dates for this course
Links to this course
If you want to set links to this course page, please use one of the following links. Do not use the link shown in your browser!
The following link includes the course ID and is always unique:
https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=152511391
Send page to mobile
Click to open QR code
Scan QR code: Enlarge QR code
ID
152511391