Package com.topologi.diffx.load.text
Interface TextTokenizer
- All Known Implementing Classes:
TokenizerByChar,TokenizerByText,TokenizerByWord
public interface TextTokenizer
An interface for text tokenizers.
Text tokenisers are used to return a list of TextEvent
from a piece of text.
- Version:
- 3 February 2005
-
Method Summary
Modifier and TypeMethodDescriptionReturns the text granularity of this tokenizer.tokenize(CharSequence seq) Returns the list ofTextEventcorresponding to the specified character sequence.
-
Method Details
-
tokenize
Returns the list ofTextEventcorresponding to the specified character sequence.- Parameters:
seq- the character sequence to tokenize.- Returns:
- the corresponding list.
-
granurality
TextGranularity granurality()Returns the text granularity of this tokenizer.- Returns:
- the text granularity of this tokenizer.
-