Home
Research
Vita
Personal
Research

Publications

Code

  • NER is a Java-implemented named entity tagger based on Conditional Random Fields. It recognizes 7 types of entities: Person, Organization, Location, Time, Date, Money and Percent.

    Usage Example

    java -Xmx200m -jar ner.jar inputFile_1 inputFile_2 ... inputFile_n
    It will generate inputFile_1.ner, inputFile_2.ner, ..., inputFile_n.ner for the tagged text.
  • PAM is a C-code implementation of the Pachinko Allocation model. Check out README.txt for usage instructions. This is a very preliminary version. A new version will be posted later.