AlvisNLP

corpus processing engine

StanfordNER

Synopsis

Applies the Stanford NLP named-entity recognition .

Description

Applies the Stanford NLP named-entity recognition . StanfordNER requires sentence splitting and tokenization.

Snippet

<stanfordner class="StanfordNER">
    <classifierFile></classifierFile>
    <labelFeature></labelFeature>
    <targetLayer></targetLayer>
</stanfordner>

Mandatory parameters

classifierFile

Mandatory
Type: InputFile

Path to the CRF classifier.

labelFeature

Mandatory
Type: String

Name of the feature where to store the named entity label.

targetLayer

Mandatory
Type: String

Name of the layer where to store named-entity annotation.

Optional parameters

constantAnnotationFeatures

Optional
Type: Mapping

Constant features to add to each annotation created by this module.

documentFilter

Default value: `true`
Type: Expression

Only process document that satisfy this expression.

formFeature

Default value: `form`
Type: String

Name of the feature containing the token surface form (ignored if searchInContents is set).

searchInContents

Default value: `false`
Type: Boolean

If true, then search named entities in the section contents.

sectionFilter

Default value: `true and layer:words and layer:sentences`
Type: Expression

Process only sections that satisfy this expression.

sentenceLayer

Default value: `sentences`
Type: String

Name of the layer containing sentence annotations (ignored if searchInContents is set).

wordLayer

Default value: `words`
Type: String

Name of the layer containing token annotations (ignored if searchInContents is set).

Deprecated parameters

formFeatureName

Deprecated
Type: String

Deprecated alias for formFeature .

labelFeatureName

Deprecated
Type: String

Deprecated alias for labelFeature .

sentenceLayerName

Deprecated
Type: String

Deprecated alias for sentenceLayer .

targetLayerName

Deprecated
Type: String

Deprecated alias for targetLayer .

wordLayerName

Deprecated
Type: String

Deprecated alias for wordLayer .