edu.umass.cs.mallet.base.pipe
Class SelectiveSGML2TokenSequence
java.lang.Object
edu.umass.cs.mallet.base.pipe.Pipe
edu.umass.cs.mallet.base.pipe.SelectiveSGML2TokenSequence
- All Implemented Interfaces:
- java.io.Serializable
- public class SelectiveSGML2TokenSequence
- extends Pipe
- implements java.io.Serializable
Similar to SGML2TokenSequence
, except that only the tags
listed in allowedTags
are converted to Label
s.
- See Also:
- Serialized Form
Methods inherited from class edu.umass.cs.mallet.base.pipe.Pipe |
getDataAlphabet, getInstanceId, getParent, getParentRoot, getTargetAlphabet, isDataAlphabetSet, isTargetProcessing, pipe, readResolve, resolveDataAlphabet, resolveTargetAlphabet, setDataAlphabet, setParent, setTargetAlphabet, setTargetProcessing |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(CharSequenceLexer lexer,
java.lang.String backgroundTag,
java.util.Set allowed)
- Parameters:
lexer
- to tokenize inputbackgroundTag
- default tag when not in any other tagallowed
- set of tags (Strings) that will be converted to
labels
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(java.lang.String regex,
java.lang.String backgroundTag,
java.util.Set allowed)
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(java.util.Set allowed)
SelectiveSGML2TokenSequence
public SelectiveSGML2TokenSequence(CharSequenceLexer lex,
java.util.Set allowed)
pipe
public Instance pipe(Instance carrier)
- Description copied from class:
Pipe
- Process an Instance. This method takes an input Instance,
destructively modifies it in some way, and returns it.
This is the method by which all pipes are eventually run.
One can create a new concrete subclass of Pipe simply by
implementing this method.
- Specified by:
pipe
in class Pipe
- Parameters:
carrier
- Instance to be processed.
toString
public java.lang.String toString()