org.apache.lucene.analysis

Class WhitespaceTokenizer


public class WhitespaceTokenizer
extends CharTokenizer

A WhitespaceTokenizer is a tokenizer that divides text at whitespace. Adjacent sequences of non-Whitespace characters form tokens.

Field Summary

Fields inherited from class org.apache.lucene.analysis.Tokenizer

input

Constructor Summary

WhitespaceTokenizer(Reader in)
Construct a new WhitespaceTokenizer.

Method Summary

protected boolean
isTokenChar(char c)
Collects only characters which do not satisfy Character.isWhitespace(char).

Methods inherited from class org.apache.lucene.analysis.CharTokenizer

isTokenChar, next, normalize

Methods inherited from class org.apache.lucene.analysis.Tokenizer

close

Methods inherited from class org.apache.lucene.analysis.TokenStream

close, next

Constructor Details

WhitespaceTokenizer

public WhitespaceTokenizer(Reader in)
Construct a new WhitespaceTokenizer.

Method Details

isTokenChar

protected boolean isTokenChar(char c)
Collects only characters which do not satisfy Character.isWhitespace(char).
Overrides:
isTokenChar in interface CharTokenizer


Copyright © 2000-2005 Apache Software Foundation. All Rights Reserved.