An open source C++ library providing language analysis services.
Like tokenization, sentence splitting, morphological analysis, named entity and date/number/currency recognition, PoS tagging, and shallow parsing.
The software is released under LGPL.
Developed by Natural Language Research Group, Technical University of Catalonia, Spain