Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2023/2024

Searching the Web and Multimedia Databases

Login to KOS for course enrollment Display time-table
Code Completion Credits Range Language
BI-VWM.21 Z,ZK 5 2P+1C Czech
Garant předmětu:
Tomáš Skopal
Lecturer:
Tomáš Skopal
Tutor:
Jiří Novák, Tomáš Skopal
Supervisor:
Department of Software Engineering
Synopsis:

Students get basic overview about search techniques in the web environment that is interpreted as a very large distributed and heterogeneous storage of documents. In particular, students acquire information about search techniques in text and hypertext documents (the web pages themselves) and about feature extraction from web pages. They get detailed knowledge of similarity search in multimedia databases (generally in collections of unstructured data). They also learn techniques for programming web search engines for the mentioned data types (documents).

Requirements:

Basic knowledge and skills in algorithmics, programming, data structures and database technologies.

Syllabus of lectures:

1. Web space, search engines, web retrieval modes.

2. Boolean model of information retrieval.

3. Vector model of information retrieval.

4. Link analysis and the web page ranking.

5. [2] Search engine ranking and optimization (SEO).

7. Semantic web and Linked data.

8. Personalized search and social context.

9. Web data mining.

10. Introduction to similarity search in multimedia databases.

11. Indexing of metric similarity for efficient multimedia retrieval.

12. Approximate similarity search.

13. Similarity queries and multimodal search.

Syllabus of tutorials:

1. Project topic presentation.

2. Group consultations.

3. Group consultations.

4. Individual consultations.

5. Individual consultations.

6. Project presentation.

7. Project presentation.

Study Objective:

This module is recommended for students that are interested in deeper understanding of web search engines. In particular, text, hypertext, and multimedia retrieval techniques are explored. The retrieval techniques are described in three layers: theoretical (model), algorithmical, and application. Then, in experimental projects the students can implement the methods and employ them in various web applications.

Study materials:

1. Baeza-Yates R., Ribeiro-Neto B. : Modern Information Retrieval: The Concepts and Technology behind Search. Addison-Wesley Professional, 2011. ISBN 978-0321416919.

2. Langville A. N., Meyer C. D. : Google’s PageRank and Beyond: The Science of Search Engine Rankings. Princeton University Press, 2012. ISBN 978-0691152660.

3. Kristopher B. J. : Search Engine Optimization: Your Visual Blueprint for Effective Internet Marketing. Visual, 2013. ISBN 978-1118551745.

4. Zezula P., Amato G., Dohnal V., Batko M. : Similarity Search: The Metric Space Approach. Springer, 2005. ISBN 978-0387291468.

5. Aggarwal C. C. : Data Mining: The Textbook. Springer, 2015. ISBN 978-3319141411.

Note:
Further information:
https://moodle-vyuka.cvut.cz/course/search.php?search=BI-VWM
Time-table for winter semester 2023/2024:
Time-table is not available yet
Time-table for summer semester 2023/2024:
06:00–08:0008:00–10:0010:00–12:0012:00–14:0014:00–16:0016:00–18:0018:00–20:0020:00–22:0022:00–24:00
Mon
Tue
Wed
roomT9:105
Skopal T.
11:00–12:30
(lecture parallel1)
Dejvice
Posluchárna
roomTH:A-1247
Novák J.
14:30–16:00
EVEN WEEK

(lecture parallel1
parallel nr.101)

Thákurova 7 (budova FSv)
seminární místnost
roomTH:A-1247
Novák J.
16:15–17:45
EVEN WEEK

(lecture parallel1
parallel nr.102)

Thákurova 7 (budova FSv)
seminární místnost
roomTH:A-1247
Novák J.
14:30–16:00
ODD WEEK

(lecture parallel1
parallel nr.103)

Thákurova 7 (budova FSv)
seminární místnost
roomTH:A-1247
Novák J.
16:15–17:45
ODD WEEK

(lecture parallel1
parallel nr.104)

Thákurova 7 (budova FSv)
seminární místnost
Thu
Fri
The course is a part of the following study plans:
Data valid to 2024-02-23
Aktualizace výše uvedených informací naleznete na adrese https://bilakniha.cvut.cz/en/predmet6702406.html