Module Description (excerpted from the bulletin): This module discusses the basic concepts and methods of information retrieval including capturing, representing, storing, organizing, and retrieving unstructured or loosely structured information. The most well-known aspect of information retrieval is document retrieval: the process of indexing and retrieving text documents. However, the field of information retrieval includes almost any type of unstructured or semi-structured data, including newswire stories, transcribed speech, email, blogs, images, or video. Therefore, information retrieval is a critical aspect of Web search engines. This module also serves as the foundation for subsequent modules on the understanding, processing and retrieval of particular web media.
N.B. We will be teaching and using the Python programming language throughout this class. We will using Python 2.6.6 instead of the updated Python 3.x, as the NLTK library that we will also be using is currently incompatible with 3.x.
Note: There will only be five tutorials; each tutorial is on a subject related to a homework assignment, and the tutorials are held every other week.
Note to NUS-external visitors: Welcome! If you're a fellow I.R. course instructor looking for lecture material, you can see the syllabus menu item on the left for a preview. Please contact me if you'd like to use any of my material. Thanks!
This document, index.html, has been accessed 9685 times since 10-Feb-11 09:32:41 SGT. This is the 2nd time it has been accessed today. A total of 3639 different hosts have accessed this document in the last 2863 days; your host, ec2-18-212-93-234.compute-1.amazonaws.com, has accessed it 1 times.