Q: So, lemur doesn't seem to have an easy way of retrieving the
text of the document corresponding to its docID. So, when I get the result of a query, I can either grep in the data file for the docID, or write some code to dump the text associated with an docID (something to do with a DocumentManager). Is the point of this problem set to provide an interface for displaying retrieved documents, or should I just search around in the data file?
A: The point of the exercise is really to come up with the queries, so just searching around will do for now. However, it
will be useful to pull up documents in the future, so if you invest the time in doing that, you will be happy that you did.
--
ToniRath - 03 Oct 2003
Q: When I indexed govcrawl-tags.docs with Lemur, it told me that "15,298 documents processed", but the P2 webpage says that total 15,556 documents are there in this collection. Am I missing something?
--
ChiragShah - 06 Oct 2003
to top