Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2023/2024

Searching the Web and Multimedia Databases

The course is not on the list Without time-table
Code Completion Credits Range Language
BI-VWM.21 Z,ZK 5 2P+1C Czech
Garant předmětu:
Tomáš Skopal
Lecturer:
Tomáš Skopal
Tutor:
Tomáš Skopal
Supervisor:
Department of Software Engineering
Synopsis:

Students get basic overview about search techniques in the web environment that is interpreted as a very large distributed and heterogeneous storage of documents. In particular, students acquire information about search techniques in text and hypertext documents (the web pages themselves) and about feature extraction from web pages. They get detailed knowledge of similarity search in multimedia databases (generally in collections of unstructured data). They also learn techniques for programming web search engines for the mentioned data types (documents).

Requirements:

Basic knowledge and skills in algorithmics, programming, data structures and database technologies.

Syllabus of lectures:

1. Web space, search engines, web retrieval modes.

2. Boolean model of information retrieval.

3. Vector model of information retrieval.

4. Link analysis and the web page ranking.

5. [2] Search engine ranking and optimization (SEO).

7. Semantic web and Linked data.

8. Personalized search and social context.

9. Web data mining.

10. Introduction to similarity search in multimedia databases.

11. Indexing of metric similarity for efficient multimedia retrieval.

12. Approximate similarity search.

13. Similarity queries and multimodal search.

Syllabus of tutorials:

1. Project topic presentation.

2. Group consultations.

3. Group consultations.

4. Individual consultations.

5. Individual consultations.

6. Project presentation.

7. Project presentation.

Study Objective:

This module is recommended for students that are interested in deeper understanding of web search engines. In particular, text, hypertext, and multimedia retrieval techniques are explored. The retrieval techniques are described in three layers: theoretical (model), algorithmical, and application. Then, in experimental projects the students can implement the methods and employ them in various web applications.

Study materials:

1. Baeza-Yates R., Ribeiro-Neto B. : Modern Information Retrieval: The Concepts and Technology behind Search. Addison-Wesley Professional, 2011. ISBN 978-0321416919.

2. Langville A. N., Meyer C. D. : Google’s PageRank and Beyond: The Science of Search Engine Rankings. Princeton University Press, 2012. ISBN 978-0691152660.

3. Kristopher B. J. : Search Engine Optimization: Your Visual Blueprint for Effective Internet Marketing. Visual, 2013. ISBN 978-1118551745.

4. Zezula P., Amato G., Dohnal V., Batko M. : Similarity Search: The Metric Space Approach. Springer, 2005. ISBN 978-0387291468.

5. Aggarwal C. C. : Data Mining: The Textbook. Springer, 2015. ISBN 978-3319141411.

Note:
Further information:
https://moodle-vyuka.cvut.cz/course/search.php?search=BI-VWM
No time-table has been prepared for this course
The course is a part of the following study plans:
Data valid to 2023-06-03
Aktualizace výše uvedených informací naleznete na adrese https://bilakniha.cvut.cz/en/predmet6702406.html