The Alvis suite
The Alvis suite is a text-mining and knowledge management suite, in use in scientific laboratories.
The Alvis suite has dedicated components:
-
AlvisNLP, an automatic corpus processing engine that features a library with more than 50 processing modules: tokenization, POS-tagging, parsing, NER, machine-learning, etc.
-
AlvisAE, an annotation editor for building training corpora for NER, entity normalization and relation extraction.
-
AlvisIR, a semantic search engine builder.
-
TyDI, a terminology and ontology editor.
Alvis is developped by the Bibliome team at the French National Institute for Agriculture, Food and Environment (INRAE).
Text mining
Text mining is a mature discipline that aims to extract high-quality and structured information from heterogeneous and unstructured resources. Current developments make use of Natural Language Processing techniques and Machine Learning algorithms to automatize the Information Extraction (IE). The text-mining methods makes extensive use of shared resources like terminologies and ontologies. The gap between text-mining and the Linked Data communities is narrowing!