edu.umass.cs.mallet.base.pipe
Class CharSequence2CharNGrams

java.lang.Object
  extended byedu.umass.cs.mallet.base.pipe.Pipe
      extended byedu.umass.cs.mallet.base.pipe.CharSequence2CharNGrams
All Implemented Interfaces:
java.io.Serializable

public class CharSequence2CharNGrams
extends Pipe
implements java.io.Serializable

Transform a character sequence into a token sequence of character N grams.

See Also:
Serialized Form

Constructor Summary
CharSequence2CharNGrams(int n, boolean distinguishBorders)
           
 
Method Summary
protected  java.lang.String[] ngramify(java.lang.CharSequence s)
           
 Instance pipe(Instance carrier)
          Process an Instance.
 
Methods inherited from class edu.umass.cs.mallet.base.pipe.Pipe
getDataAlphabet, getInstanceId, getParent, getParentRoot, getTargetAlphabet, isDataAlphabetSet, isTargetProcessing, pipe, readResolve, resolveDataAlphabet, resolveTargetAlphabet, setDataAlphabet, setParent, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CharSequence2CharNGrams

public CharSequence2CharNGrams(int n,
                               boolean distinguishBorders)
Method Detail

ngramify

protected java.lang.String[] ngramify(java.lang.CharSequence s)

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Process an Instance. This method takes an input Instance, destructively modifies it in some way, and returns it. This is the method by which all pipes are eventually run.

One can create a new concrete subclass of Pipe simply by implementing this method.

Specified by:
pipe in class Pipe
Parameters:
carrier - Instance to be processed.