StanfordCoreNLP
Synopsis
Process the documents with Stanford’s CoreNLP .
This module is experimental.
Description
StanfordCoreNLP tokenizes, POS-tags and lemmatizes each section using CoreNLP .
If ner is set then StanfordCoreNLP also performs Named Entity Recognition. Refer to CoreNLP NER for details on methods and Named Entity Types.
If parse is set, then StanfordCoreNLP parses the sentences and creates dependencies tuples in dependencyRelation .
If pretokenized is set, then StanfordCoreNLP will not create annotations for tokens and sentences. Thus the segmentation must be performed beforehand.
Snippet
<stanfordcorenlp class="StanfordCoreNLP">
</stanfordcorenlp>
Mandatory parameters
Optional parameters
constantAnnotationFeatures
Constant features to add to each annotation created by this module.
constantRelationFeatures
Constant features to add to each relation created by this module.
constantTupleFeatures
Constant features to add to each tuple created by this module.
dependencyLabelFeature
Name of the feature where to store the dependency label.
dependencyRelation
Name of the relation where to store dependency tuples.
dependencySentenceRole
Name of the role of the dependency tuple argument that references the parsed sentence.
dependentRole
Name of the role of the dependency tuple argument that references the modifier (dependent) token.
documentFilter
Only process document that satisfy this expression.
headRole
Name of the role of the dependency tuple argument that references the head (governor) token.
lemmaFeature
Feature where to record the lemma.
namedEntityLayer
Layer where to create named entity annotations.
namedEntityTypeFeature
Feature where to record the named entity type.
ner
Perform NER.
parse
Perform dependency parsing.
pipelineProperties
Additional properties to pass to CoreNLP pipeline. See the documentation of each pipeline annotator for available options.
posFeature
Feature where to record the POS tag.
pretokenized
Do not perform tokenization and sentence splitting. Read tokens and sentences generated by previous steps.
sectionFilter
Process only sections that satisfy this expression.
sentenceLayer
Layer where to place (or read if pretokenized is set) sentence annotations.
wordLayer
Layer where to place (or read if pretokenized is set) tokens annotations.