David A. Smith

Research Assistant Professor (Sept. 2008),
Department of Computer Science
Center for Intelligent Information Retrieval

Department of Computer Science
140 Governors Drive, Box 9264
University of Massachusetts, Amherst
Amherst, MA 01003-9264


Bio: David Smith will receive his Ph.D. this year from Johns Hopkins University's Computer Science Department, where he is a member of the Center for Language and Speech Processing. He received his A.B. in classics from Harvard University. His interests are in machine translation, natural language parsing, and semi-supervised machine learning methods. David was formerly head programmer for the Perseus Digital Library Project at Tufts University, where he strayed from the path of classical philology toward text mining, geocoding, and information extraction.

Current Home Page (Johns Hopkins University)

Research interests:  Machine translation, natural language parsing, semi-supervised machine learning methods, digital libraries

Some Recent Publications:

David A. Smith and Jason Eisner. Bootstrapping feature-rich dependency parsers with entropic priors. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 667-677, 2007.

David A. Smith and Noah A. Smith. Probabilistic models of nonprojective dependency trees. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 132-140, 2007.

David A. Smith and Jason Eisner. Quasi-synchronous grammars: Alignment by soft projection of syntactic dependencies. In Proceedings of the HLT-NAACL Workshop on Statistical Machine Translation, pages 23-30, 2006.

David A. Smith and Jason Eisner. Minimum risk annealing for training log-linear models. In Proceedings of the International Conference on Computational Linguistics and the Association for Computational Linguistics, pages 787-794, 2006.

Noah A. Smith, David A. Smith, and Roy W. Tromble. Context-based morphological disambiguation with random fields. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 475-482, 2005.

David A. Smith and Noah A. Smith. Bilingual parsing with factored estimation: Using English to parse Korean. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 49-56, 2004.