开源Python高维数据最近邻快速检索库:Panns

cymt 9年前

Panns=Python Approximate Nearest Neighbor Search,基于索引优化技术,支持欧氏距离和余弦夹角作为相似度测度,支持并行查询。
特性

  • Pure python implementation.
  • Optimized for large and high-dimension dataset (e.g. > 500).
  • Generate small index file with high query accuracy.
  • Support both Euclidean and cosine distance metrics.
  • Support parallel building of indices.
  • Small memory usage and index can be shared among processes.
  • Support raw, csv, numpy and HDF5 datasets.

项目主页:http://www.open-open.com/lib/view/home/1430640043883