AlvisNLP

corpus processing engine

CCGParser

Synopsis

Syntax parsing with CCG Parser .

Description

CCGParser applies the CCG Parser to sentences specified as annotations from the sentenceLayer layer. Sentence words are specified by annotations in the wordLayer layer. For each sentence, only words entirely included in the sentence will be considered; WoSMig and SeSMig should create these layers with the appropriate annotations. Additionally CCGParser takes advantage of word POS tag specified in the posFeature feature.

CCGParser creates a relation named dependencyRelation in each section and a tuple in this relation for each dependency. This relation is ternary:

  1. dependencySentenceRole : the first argument is the sentence in which the dependency was found;
  2. headRole : the second argument is the head word of the dependency;
  3. dependentRole : the third argument is the dependent word of the dependency.

CCGParser adds to each dependency tuple a feature dependencyLabelFeature with the label of the dependency.

Snippet

<ccgparser class="CCGParser">
    <executable></executable>
    <parserModel></parserModel>
    <superModel></superModel>
</ccgparser>

Mandatory parameters

executable

Mandatory

Path to the CCG Parser executable.

parserModel

Mandatory

Path to the parser model file.

superModel

Mandatory

Path to the CCG supertagger model file.

Optional parameters

constantRelationFeatures

Optional
Type: Mapping

Constant features to add to each relation created by this module.

constantTupleFeatures

Optional
Type: Mapping

Constant features to add to each tuple created by this module.

stanfordMarkedUpScript

Optional
Type: InputFile

Path to the markedup script for Stanford tagset output. See Biomedical parsing for CCG .

stanfordScript

Optional

Post-processing script for Stanford tagset output. See Biomedical parsing for CCG .

dependencyLabelFeature

Default value: `label`
Type: String

Name of the feature where to store the dependency label.

dependencyRelation

Default value: `dependencies`
Type: String

Name of the relation where to store dependency tuples.

dependencySentenceRole

Default value: `sentence`
Type: String

Name of the role of the dependency tuple argument that references the parsed sentence.

dependentRole

Default value: `dependent`
Type: String

Name of the role of the dependency tuple argument that references the modifier (dependent) token.

documentFilter

Default value: `true`
Type: Expression

Only process document that satisfy this expression.

formFeature

Default value: `form`
Type: String

Name of the feature in word annotations that contains the surface form.

headRole

Default value: `head`
Type: String

Name of the role of the dependency tuple argument that references the head (governor) token.

internalEncoding

Default value: `UTF-8`
Type: String

Character encoding to use for CCG input and output files.

lpTransformation

Default value: `false`
Type: Boolean

Either to translate into LP tag-set.

maxRuns

Default value: `1`
Type: Integer

Maximal number of CCG runs.

maxSuperCats

Default value: `500000`
Type: Integer

Maximum number of supercats before the parse explodes (cited from CCG documentation).

posFeature

Default value: `pos`
Type: String

Name of the feature in word annotations where to write POS tags.

sectionFilter

Default value: `true and layer:sentences and layer:words`
Type: Expression

Process only sections that satisfy this expression.

sentenceFilter

Default value: `true`
Type: Expression

Process only sentences that satisfy this filter.

sentenceLayer

Default value: `sentences`
Type: String

Name of the layer containing sentence annotations.

supertagFeature

Default value: `supertag`
Type: String

Name of the feature containing the supertag label.

wordLayer

Default value: `words`
Type: String

Name of the layer containing word annotations.

Deprecated parameters

formFeatureName

Deprecated
Type: String

Deprecated alias for formFeature .

labelFeatureName

Deprecated
Type: String

Deprecated alias for dependencyLabelFeature .

posFeatureName

Deprecated
Type: String

Deprecated alias for posFeature .

relationName

Deprecated
Type: String

Deprecated alias for dependencyRelation .

sentenceLayerName

Deprecated
Type: String

Deprecated alias for sentenceLayer .

supertagFeatureName

Deprecated
Type: String

Deprecated alias for supertagFeature .

wordLayerName

Deprecated
Type: String

Deprecated alias for wordLayer .