KeywordsSelector
Synopsis
Selects most relevant keywords in documents.
Description
KeywordsSelector selects the most relevant keywords in documents. The candidate keywords are specified with keywords evaluated as a list of elements with the document as the context element. The keyword text is specified by keywordForm .
KeywordsSelector ranks the keywords according to the scoreFunction function, then selects the keywordCount keywords with the highest value. The selected keywords are stored in the document feature keywordFeature , and the corresponding scores in scoreFeature .
Snippet
<keywordsselector class="KeywordsSelector">
</keywordsselector>
Mandatory parameters
Optional parameters
keywordFeature
Document feature where to store the selected keywords.
outFile
scoreFeature
Document feature where to store the score of selected keywords computed by scoreFunction .
charset
documentId
documents
keywordCount
Number of keywords to select.
keywordForm
Text of the keyword. This expression is evaluated as a string with the keyword element as the context.
keywords
Expression evaluated as a list of elements with the document as the context element. Each element represents a keyword of the document.
scoreFunction
Function to use for ranking keywords. Available functions include the keyword frequency, different variants of tf-idf and Okapi BM25.