edu.umass.cs.mallet.base.pipe.tsf
Class Target2BIOFormat
java.lang.Object
edu.umass.cs.mallet.base.pipe.Pipe
edu.umass.cs.mallet.base.pipe.tsf.Target2BIOFormat
- All Implemented Interfaces:
- java.io.Serializable
- public class Target2BIOFormat
- extends Pipe
- implements java.io.Serializable
Creates a LabelSequence
out of a TokenSequence
that
is the target of an Instance
. Labels are constructed out of
each Token in the TokenSequence to conform with BIO format (Begin,
Inside, Outside of Segment). Prepends a "B-" to Tokens that leave a
background state and an "I-" to tags that have the same label as
the previous Token. NOTE: This class assumes that subsequent
identical tags belong to the same Segment. This means that you
cannot have B B I, only B I I.
- See Also:
- Serialized Form
Methods inherited from class edu.umass.cs.mallet.base.pipe.Pipe |
getDataAlphabet, getInstanceId, getParent, getParentRoot, getTargetAlphabet, isDataAlphabetSet, isTargetProcessing, pipe, readResolve, resolveDataAlphabet, resolveTargetAlphabet, setDataAlphabet, setParent, setTargetAlphabet, setTargetProcessing |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Target2BIOFormat
public Target2BIOFormat()
Target2BIOFormat
public Target2BIOFormat(java.lang.String background)
- Parameters:
background
- represents Tokens that are not part of a target
Segment.
pipe
public Instance pipe(Instance carrier)
- Description copied from class:
Pipe
- Process an Instance. This method takes an input Instance,
destructively modifies it in some way, and returns it.
This is the method by which all pipes are eventually run.
One can create a new concrete subclass of Pipe simply by
implementing this method.
- Specified by:
pipe
in class Pipe
- Parameters:
carrier
- Instance to be processed.