edu.umass.cs.mallet.base.extract
Interface TokenizationFilter

All Known Implementing Classes:
BIOTokenizationFilter, ConfidenceTokenizationFilter, DefaultTokenizationFilter, HierarchicalTokenizationFilter

public interface TokenizationFilter

Created: Nov 12, 2004


Method Summary
 LabeledSpans constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
          Converts a the sequence of labels into a set of labeled spans.
 

Method Detail

constructLabeledSpans

public LabeledSpans constructLabeledSpans(LabelAlphabet dict,
                                          java.lang.Object document,
                                          Label backgroundTag,
                                          Tokenization input,
                                          Sequence seq)
Converts a the sequence of labels into a set of labeled spans. Essentially, this converts the output of sequence labeling into an extraction output.

Parameters:
dict -
document -
backgroundTag -
input -
seq -
Returns: