OpenNLPDocumentCategorizer
Synopsis
Categorizes documents with a model trained with OpenNLPDocumentCategorizerTrain .
This module is experimental.
Description
OpenNLPDocumentCategorizer uses a model trained with OpenNLPDocumentCategorizerTrain to categorize unlabeled documents. The documents are specified by documents . The classifier algorithm uses the document content specified by tokens and form .
Snippet
<opennlpdocumentcategorizer class="OpenNLPDocumentCategorizer">
<categoryFeature></categoryFeature>
<model></model>
</opennlpdocumentcategorizer>
Mandatory parameters
categoryFeature
Feature where to store the predicted category.
model
Model file generated with OpenNLPDocumentCategorizerTrain .
Optional parameters
scoreFeature
Feature where to store the score of the predicted category.
scoresFeaturePrefix
Prefix of feature names where to store the score of each category.
documents
Elements to classify. This expression is evaluated from the corpus.
form
Form of the token. This expression is evaluated as a string from the token.
tokens
Tokens of the elements to classify. This expression is evaluated as a list of elements from the element to classify.