public class EnglishWordTokenizer extends WordTokenizer
REMOVED_EMOJI| Constructor and Description |
|---|
EnglishWordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
List<String> |
tokenize(String text)
Tokenizes text.
|
getProtocols, getTokenizingCharacters, isCurrencyExpression, isEMail, isUrl, joinEMails, joinEMailsAndUrls, joinUrls, replaceEmojis, restoreEmojis, splitCurrencyExpressionpublic List<String> tokenize(String text)
tokenize in interface Tokenizertokenize in class WordTokenizertext - String of words to tokenize.