Mining a Million Books project starts, part of NSF Massive Data program

October
2009

NSF Massive Data program project, “Mining a Million Scanned Books: Linguistic and Structure Analysis, Fast Expanded Search, and Improved OCR,” starts. PIs: James Allan, R. Manmatha, and David Smith; $2.1 Million in NSF funding from 10/01/09 – 9/30/15