corpus processing engine

AlvisNLP data structure

The AlvisNLP data structure is an object shared by all processing steps. It is passed from one processing step to the next. The data structure contains the document structure and content, as well as annotations produced by successive steps. The understanding of the data structure is crucial to to use AlvisNLP since this object allows the steps to communicate with each other.

The following figure presents an UML-like specification of the AlvisNLP data structure.

Data model