Resources
(Not yet updated for Fall 2007)
Textbooks
and other collections
There is no required text for this course. The following sources may be
useful. Some are on-line (note in
particular MRS07). Those with a
bold reference are on reserve in the Science Library on campus in the LGRC lowrise.
- [BYRN99] Modern
Information Retrieval, by Baeza-Yates and Ribeiro-Netao, 1999. Published by Addison-Wesley,
ISBN 020139829X. This text does a reasonable job of hitting most of
the topics that we'll discuss in class. It unfortunately has no coverage
of the statistical language modeling technique that we'll spend a fair bit of
time talking about--but no other text does, either. Marti Hearst keeps a companion website
for the book.
- [CVR79] Information Retrieval, Butterworths, by Keith van Rijsbergen, 1979.
Out of print, but available on-line.
- [FBY92] Information Retrieval: Data
Structures and Algorithms, William B. Frakes and Ricardo Baeza-Yates,
Prentice Hall, 1992.
- [GF04] Information Retrieval: Algorithms and
Heuristics, by Grossman and Frieder, 2004. Published by Springer.
- [KM97] Information Storage and
Retrieval Systems: Theory and Implementation , by Gerald J. Kowalski
and Mark T. Maybury, Kluwer Academic Publishers, 2000 (second edition).
An earlier version of this text is Information Retrieval Systems:
Theory and Implementation, Gerald Kowalski, Kluwer Academic Publishers, 1997.
- [MRS07] Introduction to Information Retrieval,
by Manning, Raghavan, and Schütze,Cambridge
University Press, 2007. This new (as yet unpublished) text has
a companion
website that includes chapters of the text as well as supporting slides in
PowerPoint.
- [MS99] Foundations of
Statistical Natural Language Processing, Christopher D. Manning and Hinrich Schütze, MIT Press,
1999.
- [MZH05] “Recommended
reading for IR research students” by Moffat, Zobel,
and Hawking. Appeared in the SIGIR Forum, 39(2), 2005. A list of readings collected by
attendees at the SWIRL
2004 workshop held in Lorne,
Australia. Available on-line
in PDF.
- [S89] Automatic Text Processing,
Gerard Salton, Addison Wesley, 1989.
- [SJW97] Readings in
Information Retrieval, edited by Sparck Jones
and Willet. Published by Morgan Kaufmann, ISBN 1558604545. This is
a collection of "significant" papers in the field of information retrieval,
along with nice overviews of parts of the field to introduce various sections.
It is not a textbook, but is an excellent resource for additional
reading on some topics.
- [WMB99] Managing Gigabytes, 2nd
edition, by Witten, Moffat, and
Bell, 1999. Published by
Morgan Kaufmann, ISBN 1558605703. This is a very well done text that
focuses on the implementation side of information retrieval systems, with
particular focus on compression techniques for dealing with large amounts of
data. The companion website contains errata,
software, and so on.
- A
quite comprehensive list of additional resources has been
collected by Hinrich Schütze of
Stuttgart.