Skip to content

amartzloff/teaching

 
 

Repository files navigation

Hi there 👋 Welcome to my teaching materials!

I'm working on two Information Retrieval courses at the Vienna University of Technology (TU Wien), mainly focusing on the master-level Advanced Information Retrieval course. I try to create engaging, fun, and invormative lectures and exercises.

  • The Introduction to Information Retrieval course focuses on IR basics and engineering skills of the students.
  • The Advanced Information Retrieval course focuses on machine learning & neural IR techniques and tries to follow the state-of-the-art in IR research.

Please feel free to open up an issue or a pull request if you want to add something, find a mistake, or think something should be explained better!

Advanced Information Retrieval 2021

Information Retrieval is the science behind search technology. Certainly, the most visible instances are the large Web Search engines, the likes of Google and Bing, but information retrieval appears everywhere we have to deal with unstructured data (e.g. free text).

A paradigm shift. Taking off in 2019 the Information Retrieval research field began an enormous paradigm shift towards utilizing BERT-based language models in various forms to great effect with huge leaps in quality improvements for search results using large-scale training data. This course aims to showcase a slice of these advances in state-of-the-art IR research towards the next generation of search engines.

Syllabus The AIR'21 syllabus overview

Lectures

In the following we provide links to recordings, slides, and closed captions for our lectures. (Continuously adding more throughout the semester March-June 2021)

Topic Description Recordings Slides Text
0: Introduction Infos on requirements, topics, organization YouTube PDF Transcript
1: Crash Course IR Fundamentals We explore two fundamental building blocks of IR: indexing and ranked retrieval YouTube PDF Transcript
2: Crash Course IR Evaluation We explore how we evaluate ranked retrieval results and common IR metrics (MRR, MAP, NDCG) YouTube PDF Transcript
3: Crash Course IR Test Collections We get to know existing IR test collections, look at how to create your own, and survey potential biases & their effect in the data YouTube PDF Transcript
4: Word Representation Learning We take a look at word representations and basic word embeddings including a usage example in Information Retrieval YouTube PDF Transcript
5: Sequence Modelling We look at CNNs and RNNs for sequence modelling, including the basics of the attention mechanism. YouTube PDF Transcript
6: Transformer & BERT We study the Transformer architecture; pre-training with BERT, the HuggingFace ecosystem where the community can share models; and overview Extractive Question Answering (QA). YouTube PDF Transcript
7: Introduction to Neural Re‑Ranking We look at the workflow (including training and evaluation) of neural re-ranking models and some basic neural re-ranking architectures. YouTube PDF Transcript
8: Transformer Contextualized Re‑Ranking We learn how to use Transformers (and the pre-trained BERT model) for neural re-ranking - for the best possible results and more efficient approaches, where we tradeoff quality for performance. YouTube PDF Transcript
9: Domain Specific Applications Guest lecture by @sophiaalthammer We learn how about different task settings, challenges, and solutions in domains other than web search. YouTube PDF Transcript

Neural IR & Extractive QA Exercise

In this exercise your group is implementing neural network re-ranking models, using pre-trained extractive QA models, and analyze their behavior with respect to our FiRA data.

📃 To the assignment

About

Information Retrieval Courses @ TU Wien

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 77.3%
  • Jupyter Notebook 16.0%
  • C# 6.7%