发布于 2015-04-09 00:52:38 | 126 次阅读 | 评论: 0 | 来源: 网友投递
Apache Lucene全文检索引擎工具包
Lucene是apache软件基金会4 jakarta项目组的一个子项目,是一个开放源代码的全文检索引擎工具包,即它不是一个完整的全文检索引擎,而是一个全文检索引擎的架构,提供了完整的查询引擎和索引引擎,部分文本分析引擎(英文与德文两种西方语言)。Lucene的目的是为软件开发人员提供一个简单易用的工具包,以方便的在目标系统中实现全文检索的功能,或者是以此为基础建立起完整的全文检索引擎。
Apache Lucene 4.10.3 发布,它是一个高性能的全 java 编写的文本搜索引擎库。几乎适用于所有需要全文搜索的应用程序。此版本中主要修复了 12 个 Bug。
Bug 修复:
LUCENE-6019, LUCENE-6117: Remove -Dtests.assert to make IndexWriter infoStream sane.
(Robert Muir, Mike McCandless)
LUCENE-6161: Resolving deletes was failing to reuse DocsEnum likely causing substantial performance cost for use cases that frequently delete old documents
(Mike McCandless)
LUCENE-6192: Fix int overflow corruption case in skip data for high frequency terms in extremely large indices
(Robert Muir, Mike McCandless)
LUCENE-6207: Fixed consumption of several terms enums on the same sorted (set) doc values instance at the same time.
(Tom Shally, Robert Muir, Adrien Grand)
LUCENE-6093: Don't throw NullPointerException from BlendedInfixSuggester for lookups that do not end in a prefix token.
(jane chang via Mike McCandless)
LUCENE-6279: Don't let an abusive leftover _N_upgraded.si in the index directory cause index corruption on upgrade
(Robert Muir, Mike McCandless)
LUCENE-6287: Fix concurrency bug in IndexWriter that could cause index corruption (missing _N.si files) the first time 4.x kisses a 3.x index if merges are also running.
(Simon Willnauer, Mike McCandless)
LUCENE-6205: Fixed intermittent concurrency issue that could cause FileNotFoundException when writing doc values updates at the same time that a merge kicks off.
(Mike McCandless)
LUCENE-6214: Fixed IndexWriter deadlock when one thread is committing while another opens a near-real-time reader and an unrecoverable (tragic) exception is hit.
(Simon Willnauer, Mike McCandless)
LUCENE-6105: Don't cache FST root arcs if the number of root arcs is small, or if the cache would be > 20% of the size of the FST.
(Robert Muir, Mike McCandless)
LUCENE-6001: DrillSideways hits NullPointerException for certain BooleanQuery searches.
(Dragan Jotannovic, jane chang via Mike McCandless)
LUCENE-6306: Merging of doc values and norms now checks whether the merge was aborted so IndexWriter.rollback can more promptly abort a running merge.
(Robert Muir, Mike McCandless)
更多详情内容请见发行页面。
本版本已提供下载:
http://lucene.apache.org/core/mirrors-core-latest-redir.html
Lucene是apache软件基金会4 jakarta项目组的一个子项目,是一个开放源代码的全文检索引擎工具包,即它不是一个完整的全文检索引擎,而是一个全文检索引擎的架构,提供了完整的查询引擎和索引引擎,部分文本分析引擎(英文与德文两种西方语言)。Lucene的目的是为软件开发人员提供一个简单易用的工具包,以方便的在目标系统中实现全文检索的功能,或者是以此为基础建立起完整的全文检索引擎。