Apache OpenNLP 1.6.0 发布,机器学习工具包
OpenNLP 是一个机器学习工具包,用于处理自然语言文本。支持大多数常用的 NLP 任务,例如:标识化、句子切分、部分词性标注、名称抽取、组块、解析等。
Apache OpenNLP 1.6.0 发布,此版本新增了大量的新特性,改进和 bug 修复,同时还优化了 API。
值得关注的改进:
- Added evalutation support to the parser and doccat components
- Added support to Evalita 07/09, Brat and OntoNotes corpus formats
- Now L-BFGS is stable
- Added Snowball to the Stemmer package
- NameFinder now supports a user defined factory
- Added pluggable machine learning support
- Added a lemmatizer module
- Added Cluster, Document Begin and Clark feature generators to the Name
Finder
- Added Liblinear as a Machine Learning addon
- Entity Linker now has a command line interface
- Added sequence classification support
下载:http://opennlp.apache.org/cgi-bin/download.cgi
Maven :http://opennlp.apache.org/maven-dependency.html
--The Apache OpenNLP Team