Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2019/2020

DB Technologies for Big Data

The course is not on the list Without time-table
Code Completion Credits Range Language
BIE-BIG KZ 4 2P+2C
Lecturer:
Josef Gattermayer (guarantor)
Tutor:
Josef Gattermayer (guarantor)
Supervisor:
Department of Software Engineering
Synopsis:

Students are introduced into the field of Big Data. These are data that the standard relational databases cannot process efficientlydue to the size, and at the same time, their real-time processing can provide information that can have key importance for thecompetitiveness of a company or organization. The course is focused practically. Students learn the most important professionaltechnologies, such as Apache Cassandra, Apache Hadoop, Apache Solr, and others. The course brings to students theoreticalfoundation of algorithms used in Big data systems. In the labs, students learn to develop their own applications on topof these technologies.

Requirements:
Syllabus of lectures:

1. Introduction into the Big Data area, motivation, definitions, CAP theorem.

2. Introduction into NoSQL databases, clustering of NoSQL databases.

3. Distributed computing on top of DB using the map-reduce techniques

4. Data model in NoSQL databases.

5. Internal architecture of NoSQL databases.

6. System layers on top of NoSQL databases.

7. Case study

8. 8. Query languages for NoSQL databases.

9. Automatic generation of client application from a DB model.

10. Case study

11. 11. Performance optimization in NoSQL databases.

12. Case study

13. 13. Case study 4.

Syllabus of tutorials:
Study Objective:

Students are introduced into the field of Big Data. These are data that the standard relational databases cannot process efficientlydue to the size, and at the same time, their real-time processing can provide information that can have key importance for thecompetitiveness of a company or organization. The course is focused practically. Students learn the most important professionaltechnologies, such as Apache Cassandra, Apache Hadoop, Apache Solr, and others. The course brings to students theoreticalfoundation of algorithms used in Big data systems. In the labs, students learn to develop their own applications on topof these technologies.

Study materials:

[1] Sadalage, P. J. - Fowler, M.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence, Addison-Wesley, 2012, 978-0133036121,

Note:
Further information:
https://courses.fit.cvut.cz/BI-BIG/
No time-table has been prepared for this course
The course is a part of the following study plans:
Data valid to 2019-10-18
For updated information see http://bilakniha.cvut.cz/en/predmet3464906.html