AlvisNLP

corpus processing engine

WapitiLabel

Synopsis

Label tokens using a model trained with WapitiTrain .

Description

WapitiLabel labels a sequence of tokens with a CRF model trained with WapitiTrain . The sequences are generated by annotations in tokenLayer segmented with annotations in sentenceLayer .

features must be the same as used in WapitiTrain but must not include the last one (that represents the label). The predicted label will be stored in the token feature labelFeature .

Snippet

<wapitilabel class="WapitiLabel">
    <features></features>
    <labelFeature></labelFeature>
    <modelFile></modelFile>
    <wapitiExecutable></wapitiExecutable>
</wapitilabel>

Mandatory parameters

features

Mandatory

A list of expressions evaluated as strings from the token annotation. The result represents the set of features of each token. For the training phase, the last feature is the label.

labelFeature

Mandatory
Type: String

Feature where to write the predicted label.

modelFile

Mandatory
Type: InputFile

Path to the file trained with WapitiTrain .

wapitiExecutable

Mandatory

Path to the wapiti executable.

Optional parameters

commandLineOptions

Optional
Type: String[]

Additional command line options to pass to wapiti . See the Wapiti manual for the list of options. Note that options mode , -T , -a , -p , -m are set automatically by WapitiLabel .

documentFilter

Default value: `true`
Type: Expression

Only process document that satisfy this expression.

sectionFilter

Default value: `true and layer:words`
Type: Expression

Process only sections that satisfy this expression.

sentenceLayer

Default value: `sentences`
Type: String

Layer containing sentence annotations.

tokenLayer

Default value: `words`
Type: String

Layer containing token annotations.

Deprecated parameters

sentenceLayerName

Deprecated
Type: String

Deprecated alias for sentenceLayer .

tokenLayerName

Deprecated
Type: String

Deprecated alias for tokenLayer .