发布于 2017-01-24 03:33:07 | 118 次阅读 | 评论: 0 | 来源: 网友投递

这里有新鲜出炉的Lucene教程,程序狗速度看过来!

Apache Lucene全文检索引擎工具包

Lucene是apache软件基金会4 jakarta项目组的一个子项目,是一个开放源代码的全文检索引擎工具包,即它不是一个完整的全文检索引擎,而是一个全文检索引擎的架构,提供了完整的查询引擎和索引引擎,部分文本分析引擎(英文与德文两种西方语言)。Lucene的目的是为软件开发人员提供一个简单易用的工具包,以方便的在目标系统中实现全文检索的功能,或者是以此为基础建立起完整的全文检索引擎。


Apache Lucene 6.3.0 发布了。

主要更新内容:

  • Lucene's best efforts to un-map memory mapped files with "MMapDirectory" now work with the latest Java9 early access builds

  • A new similarity "BooleanSimilarity" that gives terms a score that is equal to their query boost

  • The axiomatic family of similarities (6 in total) based on https://www.eecis.udel.edu/~hfang/pubs/sigir05-axiom.pdf

  • A new token filter "SynonymGraphFilter" that outputs a correct graph structure for multi-token synonyms at query time

  • Graph token streams, such as those produced by the "SynonymGraphFilter", are now handled accurately by query parsers

  • A new collector "DocValuesStatsCollector" gives the ability to compute statistics on DocValues field

  • It is now possible to filter "SortedDocValues" and "SortedSetDocValues" terms enum with a compiled automaton

  • The "UnifiedHighlighter" can now highlight fields with queries that don't necessarily refer to that field

  • DrillSideways can now run queries concurrently

  • Index sorting now supports sorting on multi-valued fields using MIN, MAX, etc. selectors

  • Points do not store the implicit split dimension in the 1-dimension case. This saves between 6% memory for the largest types such an InetAddressPoint to 33% for the smaller types such as HalfFloatPoint.

  • The BKD in-memory index for dimensional points now uses a compressed format, using substantially less RAM in some cases

  • The BKD writing now buffers each leaf block in heap before writing to disk, giving a small speedup in points-heavy use cases

  • "TermAutomatonQuery" now rewrites to more efficient queries when possible

更多内容及下载地址:http://lucene.apache.org/



历史版本 :
Java 搜索引擎 Apache Lucene 7.2.0 发布,Bug 修复
Apache Lucene 7.2.0 发布,Java 搜索引擎
Apache Lucene 5.5.5 发布,Java 搜索引擎
Apache Lucene 6.6.2 发布,Java 搜索引擎
Apache Lucene 和 Solr 7.1.0 发布,Java 搜索引擎
Apache Lucene 7.0.1 发布,Java 搜索引擎
Apache Lucene 7.0.0 发布,Java 搜索引擎
Apache Lucene 6.6.1 发布,Java 搜索引擎
LucenePlus 1.4,基于 Lucene 的全文搜索框架
Apache Lucene 6.6.0 发布,Java 搜索引擎
Apache Lucene 6.5.1 发布,Java 搜索引擎
Apache Lucene 6.5.0 发布,Java 搜索引擎
最新网友评论  共有(0)条评论 发布评论 返回顶部

Copyright © 2007-2017 PHPERZ.COM All Rights Reserved   冀ICP备14009818号  版权声明  广告服务