Manmatha's Home Page
I am interested in the areas of Information Retrieval, Computer Vision,
and at their intersection in Image and Video Retrieval. I am also interested
in Document Analysis and Recognition including the recognition and
retrieval of printed material particularly in Indian languages
and the recognition and retrieval of handwritten manuscripts.
I am a Research Associate Professor in the Department of Computer Science and work
with the Multi-media
Indexing and Retrieval (MIR) group at the Center
for Intelligent Information Retrieval (CIIR). The group's aim is to
index non-textual sources of information by either converting them to ASCII
text and using a search engine like INDRI or by directly indexing the
For a current list of publications go
here. This list is automatically generated from the CIIR publications website.
My current work focuses on:
- Automatically Annotating and Retrieving Images. Along
with Victor Lavrenko and my former students Jiwoon Jeon and Shaolei Feng
I have investigated a number of models in this area. These include both discrete
and continuous relevance models (CMRM, CRM, MBRM and NCRM), a maximum entropy
models, an inference network model. My current research in this area is
focused on Markov Random Field models - see the CIVR'08 paper.
See the publication list for papers.
Here is the first paper which started it all
- Distributed Image Search. I am doing this with Tingxin Yan and Deepak
Ganesan. We are investigating how to represent queries and images concisely
so that resource limited devices (eg Imotes) may be used as sensor devices
and can be searched in a distributed manner. Here is our
- Indexing and Retrieving Handwritten Manuscripts. We are
particularly focused on George Washington's manuscripts. This work was
primarily done with Toni Rath my former student and
We use an approach based on relevance models which allow
us to use ASCII queries. We have also used approach called word spotting
(using word image matching) - the idea being to create automatic indices.
Jamie Rothfeder, Nitin Srimal and I also investigated scale space techniques
for segmenting handwritten manuscripts.
Along with my former student Shaolei Feng and Prof. Nicholas Howe at Smith
College I also investigated some recognition models for such manuscripts.
Go to the publication list below to check out papes.
to check out papers.
Check out this
of a handwriting retrieval system based on relevance models
for 1000 pages (8 GB of data) of George Washington's manuscripts based on
text queries. This is the first automatic (does not use manual annotations)
retrieval system for historical manuscripts.
- Alignment Techniques for Printed and Handwritten Documents.
I have investigated a number of techniques for aligning handwritten document
images to transcripts to automatically generate groundtruth. This includes
work with Micah Kornfield and James Allan using dynamic time warping and
with Jamie Rothfeder and Toni Rath using HMM's.
Along with Shaolei Feng, I investigated an automatic technique to align
OCR output for printed books and their electronic versions on Gutenberg
- Searching Printed Indian Language Documents.
Along with Prof. C. V. Jawahar at IIIT Hyderabad and Anand Kumar I am
investigating techniques to search printed documents in Indian languages for
which OCR systems are not readily available. This work uses locality
sensitive hashing for fast search in a book.
My previous work includes:
- Meta Search (or combining the outputs of multiple search engines). This work
is based on modeling the score distributions of relevant documents as Gaussians
and those of non-relevant documents as exponentials. A mixture model can be solved
using Expectation-Maximization to recover the parameters of these distributions
when relevance is unknown.
- I have also worked on image matching under deformations
(affine, similarity), image retrieval using color and appearance
and text detection in images
and on the scale space segmentation of handwritten manuscripts.
is a list of papers that I have been involved with or associated with.
This is my picture.
I am a co-founder and technical advisor to SnapTell a mobile image search company.
I used to write stories. Here are two samples The
Shadow and Marshall
Teddy if you have time to kill.
manmatha at cs.umass.edu
Quote for the day.
From Ignazio Silone's Fontemara
At the head of everything is God, the Lord of Heaven.
Everyone knows that.
Then comes Prince Torlonia, lord of the earth.
Then come Prince Torlonia's guards.
Then come Prince Torlonia's guards' dogs.
Then, nothing at all.
Then, nothing at all.
Then, nothing at all.
Then come the peasants. And that's all.
Click here to go to
the CIIR multi-media indexing and retrieval homepage - another old page.