SluitenHelpPrint
Switch to English
Cursus: NWI-I00041
NWI-I00041
Information Retrieval
Cursus informatieRooster
CursusNWI-I00041
Studiepunten (ECTS)6
CategorieMA (Master)
VoertaalEngels
Aangeboden doorRadboud Universiteit; Faculteit der Natuurwetenschappen, Wiskunde en Informatica; Informatica en Informatiekunde;
Docenten
Coördinator
prof. dr. ir. A.P. de Vries
Overige cursussen docent
Docent
prof. dr. ir. A.P. de Vries
Overige cursussen docent
Contactpersoon van de cursus
prof. dr. ir. A.P. de Vries
Overige cursussen docent
Collegejaar2016
Periode
KW1-KW2  (29-08-2016 t/m 29-01-2017)
Aanvangsblok
KW1
Onderwijsvorm
voltijd
Opmerking-
Inschrijven via OSIRISJa
Inschrijven voor bijvakkersJa
VoorinschrijvingNee
WachtlijstNee
Plaatsingsprocedure-
Cursusdoelen

The objective is that participants in the course

  1. are familiar with the classic retrieval models
  2. understand the limitations and assumptions associated with these models
  3. have insight and proficiency in the design and construction of search engines
  4. are familiar with the standard evaluation methods for IR systems
  5. are familiar with interaction techniques to support searchers in their quest for information
  6. have an understanding of how the searcher's context and behaviour can be used to enhance retrieval effectiveness
  7. have gained familiarity with recent scientific literature in this field
Inhoud
While the rise of the internet has helped strengthen the field of Information Retrieval (IR), the area stretches far beyond plain web search, as a discipline situated between information science and computer science. In 1968, Gerard Salton defined information retrieval as "a field concerned with the structure, analysis, organization, storage, searching, and retrieval of information". Even though the area has seen many changes since that time and made a tremendous impact (who has never used a search egine?!), that definition is still accurate.
IR takes the notion of "relevance" as its core concept. As the scope of IR is limited to those cases where computers try to identify the relevance of information objects given a user's information need (as opposed to humans doing that, the common scenario in information science), perhaps "Computational Relevance" would have been a better term for the research in this area.
In this course, we cover the following aspects of Information Retrieval:
  1. How do people search for information, and how can this be formalized?
  2. How can we take advantage of term statistics, structure and annotations to capture the meaning of texts?
  3. How can these elements be combined in order to find "relevant" information?
  4. What techniques are necessary to scale to large text collections?
Bijzonderheden
This year, the course will be reformed (and differ from the information retrieval course in previous years).
Onderwerpen
The course consists of two main parts:

• Fundamentals
• The term vocabulary and postings lists, inverted files
• Stemming, normalization
• Scoring, term weighting and the vector space model
• Statistical language models and their application to IR
• Evaluation
• Relevance feedback and query expansion
• Exploration of IR application areas
• Documents and structure
• Document Classification
• Pagerank/ anchors
• Social media and IR / click data
• Recommender systems
• User interaction aspectsGuest speakers may be invited to discuss state-of-the-art topics.
Toetsinformatie
Written exam (divided in two parts, a mid-term and a final test) in addition to seminar presentations and practical work.
Voorkennis
Participant of Information Retrieval should have the base qualifications as provided by the bachelor Computing Science, Information Science or Artificial Intelligence.
Literatuur
The course uses the following two books:
• C.C. Manning, P. Raghavan, H. Schutze, Introduction to Information Retrieval,Editor: Cambridge.
Available online at http://nlp.stanford.edu/IR-book/
• W. Bruce Croft, Donald Metzler, Trevor Strohman. Information Retrieval in practice, Editor: Pearson.
See also: http://www.search-engines-book.com/
Online version from CIIR: SEIRiP.pdf
Recommended additional literature (new book):
Ryen W. White, Interactions with Search Systems, Editor: Cambridge University Press.
http://www.cambridge.org/9781107034228Lecture notes will be made available via Blackboard.
Werkvormen

• 30 hours lecture
• 34 hours problem session
• 104 hours individual study period
Extra information teaching methods:

• The course is divided in 2 parts.
• Every week, a lecture discusses one topic in detail.
• Seminar - students present a recently published research paper.
• Practical assignment - students carry out an IR experiment.
Verplicht materiaal
Boek
C.C. Manning, P. Raghavan, H. Schutze, Introduction to Information Retrieval,Editor: Cambridge. Available online at http://nlp.stanford.edu/IR-book/
Boek
W. Bruce Croft, Donald Metzler, Trevor Strohman. Information Retrieval in practice, Editor: Pearson. See also: http://www.search-engines-book.com/ Online version from CIIR: SEIRiP.pdf
Aanbevolen materiaal
Boek
Ryen W. White, Interactions with Search Systems, Editor: Cambridge University Press. http://www.cambridge.org/9781107034228
Dictaat
Lecture notes will be made available via Blackboard.
Werkvormen
Cursusgebeurtenis

Hoorcollege

Werkcollege

Zelfstudie

Toetsen
Tentamen
Weging1
GelegenhedenBlok KW2, Blok KW3

SluitenHelpPrint
Switch to English