Apache Lucene 4.5.0 发布,Java全文搜索引擎
jopen 11年前
Apache Lucene是一个高性能,全功能的全文搜索引擎库,完全用Java编写。它适用于几乎需要全文搜索功能的任何应用程序,尤其是跨平台的。
Lucene通过一个简单的API,提供了强大的功能:
Scalable, High-Performance Indexing
- over 150GB/hour on modern hardware
- small RAM requirements -- only 1MB heap
- incremental indexing as fast as batch indexing
- index size roughly 20-30% the size of text indexed
Powerful, Accurate and Efficient Search Algorithms
- ranked searching -- best results returned first
- many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more
- fielded searching (e.g. title, author, contents)
- sorting by any field
- multiple-index searching with merged results
- allows simultaneous update and searching
- flexible faceting, highlighting, joins and result grouping
- fast, memory-efficient and typo-tolerant suggesters
- pluggable ranking models, including the Vector Space Model and Okapi BM25
- configurable storage engine (codecs)
Cross-Platform Solution
- Available as Open Source software under the Apache License which lets you use Lucene in both commercial and Open Source programs
- 100%-pure Java
- Implementations in other programming languages available that are index-compatible
版本说明:基于磁盘的DOC值和改进的过滤器缓存。