Search/Lucene/Analysis/Analyzer.php
Zend Framework
LICENSE
This source file is subject to the new BSD license that is bundled
with this package in the file LICENSE.txt.
It is also available through the world-wide-web at this URL:
http://framework.zend.com/license/new-bsd
If you did not receive a copy of the license and are unable to
obtain it through the world-wide-web, please send an email
to license@zend.com so we can send you a copy immediately.
- Category
- Zend
- Copyright
- Copyright (c) 2005-2014 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
- Package
- Zend_Search_Lucene
- Subpackage
- Analysis
- Version
- $Id$
\Zend_Search_Lucene_Analysis_Analyzer
Package: Zend_Search_Lucene\Analysis
An Analyzer is used to analyze text.
It thus represents a policy for extracting index terms from text.
Note:
Lucene Java implementation is oriented to streams. It provides effective work
with a huge documents (more then 20Mb).
But engine itself is not oriented such documents.
Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).
- Children
- \Zend_Search_Lucene_Analysis_Analyzer_Common
- Category
- Zend
- Copyright
- Copyright (c) 2005-2014 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
Properties
\Zend_Search_Lucene_Analysis_Analyzer $_defaultImpl
static
The Analyzer implementation used by default.
Details
Methods
getDefault() : \Zend_Search_Lucene_Analysis_Analyzer
static
Return the default Analyzer implementation used by indexing code.
Returns
Type | Description |
---|---|
\Zend_Search_Lucene_Analysis_Analyzer |
nextToken() : \Zend_Search_Lucene_Analysis_Token | null
abstract
Tokenization stream API Get next token Returns null at the end of stream
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
ReturnsType | Description |
---|---|
\Zend_Search_Lucene_Analysis_Token | null |
setDefault( $analyzer) : void
static
Set the default Analyzer implementation used by indexing code.
Parameters
Name | Type | Description |
---|---|---|
$analyzer |
setInput(string $data, $encoding = '') : void
Tokenization stream API Set input
Parameters
Name | Type | Description |
---|---|---|
$data | string | |
$encoding |