edu.umass.cs.mallet.base.pipe
Class TokenSequenceRemoveStopwords
java.lang.Object
edu.umass.cs.mallet.base.pipe.Pipe
edu.umass.cs.mallet.base.pipe.TokenSequenceRemoveStopwords
- All Implemented Interfaces:
- java.io.Serializable
- public class TokenSequenceRemoveStopwords
- extends Pipe
- implements java.io.Serializable
Remove tokens from the token sequence in the data field whose text is in the stopword list.
- See Also:
- Serialized Form
Methods inherited from class edu.umass.cs.mallet.base.pipe.Pipe |
getDataAlphabet, getInstanceId, getParent, getParentRoot, getTargetAlphabet, isDataAlphabetSet, isTargetProcessing, pipe, readResolve, resolveDataAlphabet, resolveTargetAlphabet, setDataAlphabet, setParent, setTargetAlphabet, setTargetProcessing |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TokenSequenceRemoveStopwords
public TokenSequenceRemoveStopwords(boolean caseSensitive,
boolean markDeletions)
TokenSequenceRemoveStopwords
public TokenSequenceRemoveStopwords(boolean caseSensitive)
TokenSequenceRemoveStopwords
public TokenSequenceRemoveStopwords()
setCaseSensitive
public TokenSequenceRemoveStopwords setCaseSensitive(boolean flag)
setMarkDeletions
public TokenSequenceRemoveStopwords setMarkDeletions(boolean flag)
pipe
public Instance pipe(Instance carrier)
- Description copied from class:
Pipe
- Process an Instance. This method takes an input Instance,
destructively modifies it in some way, and returns it.
This is the method by which all pipes are eventually run.
One can create a new concrete subclass of Pipe simply by
implementing this method.
- Specified by:
pipe
in class Pipe
- Parameters:
carrier
- Instance to be processed.