Confidence Measures for Information Extraction of Entities, Relations and Object Correspondence

Principal Investigator:

Andrew McCallum, PI
mccallum@cs.umass.edu

Information Extraction and Synthesis Laboratory (IESL)/
Center for Intelligent Information Retrieval (CIIR)
Department of Computer Science
140 Governors Drive
University of Massachusetts
Amherst, MA 01003-9264

In this NSF KDD project, UMass Amherst intends to improve the state-of-the-art in the ability to associate confidence measures with information extracted from unstructured text. The team will build on its previously successful research in probabilistic models for confidence assessment of individual extracted text segments, and will provide new capabilities for confidence assessment of object correspondence, and relations between entities. Additional research tasks include work on digesting large bodies of email message text through a combination of extraction and social network analysis, and work on using confidence estimates to actively direct information gathering.