Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2024/2025

Searching the Web and Multimedia Databases

The course is not on the list Without time-table
Code Completion Credits Range Language
BI-VWM Z,ZK 5 2P+1C Czech
Garant předmětu:
Lecturer:
Tutor:
Supervisor:
Department of Software Engineering
Synopsis:

Students get basic overview about search techniques in the web environment that is interpreted as a very large distributed and heterogeneous storage of documents. In particular, students acquire information about search techniques in text and hypertext documents (the web pages themselves) and about feature extraction from web pages. They get detailed knowledge of similarity search in multimedia databases (generally in collections of unstructured data). They also learn techniques for programming web search engines for the mentioned data types (documents).

Requirements:

Basic knowledge and skills in algorithmics, programming, data structures and database technologies.

Syllabus of lectures:

1. Web space, search engines, web retrieval modes.

2. Boolean model of information retrieval.

3. Vector model of information retrieval.

4. Link analysis and the web page ranking.

5. [2] Search engine ranking and optimization (SEO).

7. Semantic web and Linked data.

8. Personalized search and social context.

9. Web data mining.

10. Introduction to similarity search in multimedia databases.

11. Indexing of metric similarity for efficient multimedia retrieval.

12. Approximate similarity search.

13. Similarity queries and multimodal search.

Syllabus of tutorials:

1. Project topic presentation.

2. Group consultations.

3. Group consultations.

4. Individual consultations.

5. Individual consultations.

6. Project presentation.

7. Project presentation.

Study Objective:

This module is recommended for students that are interested in deeper understanding of web search engines. In particular, text, hypertext, and multimedia retrieval techniques are explored. The retrieval techniques are described in three layers: theoretical (model), algorithmical, and application. Then, in experimental projects the students can implement the methods and employ them in various web applications.

Study materials:

course slides +

1) Ricardo Baeza-Yates, Berthier Ribeiro-Neto. Modern Information Retrieval: The Concepts and Technology behind Search, 2011, Addison-Wesley Professional, ISBN-10: 0321416910

2) Amy N. Langville, Carl D. Meyer. Google's PageRank and Beyond: The Science of Search Engine Rankings, 2012, Princeton University Press, ISBN-10: 0691152667

3) Kristopher B. Jones. Search Engine Optimization: Your Visual Blueprint for Effective Internet Marketing, 2013, Visual, ISBN-10: 1118551745

4) Pavel Zezula, Giuseppe Amato, Vlastislav Dohnal, Michal Batko. Similarity Search: The Metric Space Approach, 2005, Springer, ISBN-10: 0387291466

Note:
Further information:
https://moodle-vyuka.cvut.cz/course/search.php?search=BI-VWM
No time-table has been prepared for this course
The course is a part of the following study plans:
Data valid to 2024-04-17
Aktualizace výše uvedených informací naleznete na adrese https://bilakniha.cvut.cz/en/predmet1123906.html