AlvisNLP

corpus processing engine

TEESTrain

Synopsis

Train a model that can be used to predict binary relations using TEES

Description

Snippet

<teestrain class="TEESTrain">
    <modelTargetDir></modelTargetDir>
    <namedEntityLayer></namedEntityLayer>
    <python2Executable></python2Executable>
    <schema></schema>
    <teesHome></teesHome>
</teestrain>

Mandatory parameters

modelTargetDir

Mandatory
Type: OutputFile

Path to the directory where put the trained model

namedEntityLayer

Mandatory
Type: String

Name of the layer containing the named entities.

python2Executable

Mandatory

Path to the Python 2 executable, TEES will fail if run through Python 3.

schema

Mandatory

Give the schema of the relations to train i.e.


      	  <schema>
	    <Lives_In>Bacteria,Location</Lives_In>
      	  </schema>
	

teesHome

Mandatory

Path to the TEES home directory.

Optional parameters

constantRelationFeatures

Optional
Type: Mapping

Constant features to add to each relation created by this module.

constantTupleFeatures

Optional
Type: Mapping

Constant features to add to each tuple created by this module.

corpusSetFeature

Default value: `set`
Type: String

UNDOCUMENTED

devSetValue

Default value: `dev`
Type: String

Feature key of the dev set corpus.

documentFilter

Default value: `true`
Type: Expression

Only process document that satisfy this expression.

modelName

Default value: `test-model`
Type: String

give a name to the trained model

namedEntityTypeFeature

Default value: `ne-type`
Type: String

Name of the feature to access the type of the named entities.

omitSteps

Default value: `GENIA_SPLITTER,BANNER`
Type: String

Set the preprocessing steps to ignore in the form of [GENIA_SPLITTER][,BANNER][,BLLIP_BIO][,STANFORD_CONVERT][,SPLIT_NAMES][,FIND_HEADS]

posFeature

Default value: `pos`
Type: String

Name of the feature containing the POS-tag.

sectionFilter

Default value: `true and layer:words and layer:sentences`
Type: Expression

Process only sections that satisfy this expression.

sentenceLayer

Default value: `sentences`
Type: String

Name of the layer that contains sentence annotations.

testSetValue

Default value: `test`
Type: String

Feature key of the test set corpus.

tokenLayer

Default value: `words`
Type: String

Name of the layer that contains tokens.

trainSetValue

Default value: `train`
Type: String

Feature key of the train set corpus.

Deprecated parameters

namedEntityLayerName

Deprecated
Type: String

Deprecated alias for namedEntityLayer .

sentenceLayerName

Deprecated
Type: String

Deprecated alias for sentenceLayer .

tokenLayerName

Deprecated
Type: String

Deprecated alias for tokenLayer .