edu.umass.cs.mallet.share.casutton.ner
Class ConllNer2003Sentence2TokenSequence

java.lang.Object
  extended byedu.umass.cs.mallet.base.pipe.Pipe
      extended byedu.umass.cs.mallet.share.casutton.ner.ConllNer2003Sentence2TokenSequence
All Implemented Interfaces:
java.io.Serializable

public class ConllNer2003Sentence2TokenSequence
extends Pipe

Reads a data file in CoNLL 2003 format, and makes some simple transformations. Unlike the version in mccallum.ner, does not expect fields in the data file for tags and phrasos if those features are off. Does not look for target field if isTargetProcessing() is false.

See Also:
Serialized Form

Constructor Summary
ConllNer2003Sentence2TokenSequence()
           
ConllNer2003Sentence2TokenSequence(boolean useTags, boolean usePhrases)
           
 
Method Summary
 Instance pipe(Instance carrier)
          Process an Instance.
 
Methods inherited from class edu.umass.cs.mallet.base.pipe.Pipe
getDataAlphabet, getInstanceId, getParent, getParentRoot, getTargetAlphabet, isDataAlphabetSet, isTargetProcessing, pipe, readResolve, resolveDataAlphabet, resolveTargetAlphabet, setDataAlphabet, setParent, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ConllNer2003Sentence2TokenSequence

public ConllNer2003Sentence2TokenSequence()

ConllNer2003Sentence2TokenSequence

public ConllNer2003Sentence2TokenSequence(boolean useTags,
                                          boolean usePhrases)
Method Detail

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Process an Instance. This method takes an input Instance, destructively modifies it in some way, and returns it. This is the method by which all pipes are eventually run.

One can create a new concrete subclass of Pipe simply by implementing this method.

Specified by:
pipe in class Pipe
Parameters:
carrier - Instance to be processed.