Studiewijzer 2023/2024

Globale inhoud

Search engines such as Google and Bing are an integral part of everyday use of the Internet. Most users do not think about the underlying techniques, but a basic understanding of how search engines work is essential for anyone who wants to use search engines in a professional environment. In this course, we will discuss how search engines work, their limitations, and the trends in search engine technologies.

The course covers the following topics:

  • Introduction search engine technology
  • Representation of documents
  • Organize documents by relevance given a search query
  • Human interaction and search technology
  • More advanced neural-based ranking approaches
  • Information filtering and recommendation

We conclude the course with building a search engine yourself as your final project. All the students are given the same data collection and compete on an open leaderboard to achieve the best results.

Studiemateriaal

Literatuur

  • Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008. Read it online

  • W. Bruce Croft, Donal Metzler, Trevor Strohman, Search Engines: Information Retrieval in Practice, Pearson Education, Inc., 2015. Read it online

  • Hang Li, Learning to Rank for Information Retrieval and Natural Language Processing, Morgan & Claypool Publishers, 2011. Read it online

  • Nicola Tonellotto, Lecture Notes on Neural Information Retrieval, (preprint) Read it online

Overig

  • Each lecture will be accompanied by lecture notes and/or slides.

     

Leerdoelen

  • The student is able to explain and apply the most essential parts of information retrieval systems.
  • The student is able to explain the added value of user interfaces for information retrieval systems.
  • The student is able to explain the potential of information retrieval techniques in modern society.
  • The student is able to develop a prototype for an information retrieval system on a large scale using existing software packages.
  • The student can determine the quality of an IR system by means of benchmarks.

Onderwijsvormen

  • Hoorcollege
  • Werkcollege
  • Zelfstudie
  • Begeleiding/feedbackmoment
  • Presentatie/symposium

Verdeling leeractiviteiten

12 hours of lecture

12 hours of tutorials

3 hours of exam

10 minutes of project presentation (optional)

rest = self-study and making assignments

Aanwezigheid

Aanwezigheidseisen opleiding (OER-B):

  • Voor practica en werkgroepbijeenkomsten met opdrachten geldt een aanwezigheidsplicht. De invulling van deze aanwezigheidsplicht kan per vak verschillen en staat aangegeven in de studiewijzer. Wanneer studenten niet voldoen aan deze aanwezigheidsplicht kan het onderdeel niet met een voldoende worden afgerond.

Toetsing

Onderdeel en weging Details

Eindcijfer

4 (40%)

Final Exam

Moet ≥ 5.5 zijn

2 (20%)

Assignments

Moet ≥ 5.5 zijn

4 (40%)

Group project

Moet ≥ 5.5 zijn

The exam is digital and will be done on Ans. The resit exam will have a similar structure to the final exam, with different questions. Missing deadlines for the assignments will lead to a zero grade for that assignment. Each student can have a maximum of one late assignment with a grade cap. 

Your final grade is composed as follows:

  • exam: 40%
  • assignments (20%)
  • group project (40%)

Inzage toetsing

Om een inzagemoment aan te vragen, kun je contact opnemen met je begeleider.

Opdrachten

The assignments have to be made individually and graded. The feedback will be communicated via CodeGrade on Canvas.

Fraude en plagiaat

Dit vak hanteert de algemene 'Fraude- en plagiaatregeling' van de UvA. Hier wordt nauwkeurig op gecontroleerd. Bij verdenking van fraude of plagiaat wordt de examencommissie van de opleiding ingeschakeld. Zie de Fraude- en plagiaatregeling van de UvA: http://student.uva.nl

Weekplanning

See the syllabus on Canvas.

Rooster

Het rooster van dit vak is in te zien op DataNose.

Aanvullende informatie

  • The course will be taught in English.
  • All the communications will in English.
  • Assignments, the final project, and the final exam will also be in English.
  • All the communication will be made over Canvas, so keep an eye on Canvas Announcements.

Contactinformatie

Coördinator

  • dr. Mohammad Alian Nejadi

Docenten

  • dr. M. Alian Nejadi

Onderwijsassistenten

  • P.K. Hager MSc
  • C. Meng MSc
  • Z. Seljee
  • Z. Abbasiantaeb