edu.umass.cs.mallet.base.extract
Class RegexFieldCleaner
java.lang.Object
edu.umass.cs.mallet.base.extract.RegexFieldCleaner
- All Implemented Interfaces:
- FieldCleaner
- public class RegexFieldCleaner
- extends java.lang.Object
- implements FieldCleaner
A field cleaner that removes all occurrences of a given regex.
Created: Nov 26, 2004
Method Summary |
java.lang.String |
cleanFieldValue(java.lang.String rawFieldValue)
Returns a post-processed version of a field. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
REMOVE_PUNCT
public static final java.lang.String REMOVE_PUNCT
- See Also:
- Constant Field Values
RegexFieldCleaner
public RegexFieldCleaner(java.lang.String regex)
RegexFieldCleaner
public RegexFieldCleaner(java.util.regex.Pattern regex)
cleanFieldValue
public java.lang.String cleanFieldValue(java.lang.String rawFieldValue)
- Description copied from interface:
FieldCleaner
- Returns a post-processed version of a field.
- Specified by:
cleanFieldValue
in interface FieldCleaner
- Parameters:
rawFieldValue
-
- Returns:
- A processed string