Udostępnij za pośrednictwem


Term Extraction Transformation Editor (Advanced Tab)

Use the Advanced tab of the Term Extraction Transformation Editor dialog box to specify properties for the extraction such as frequency, length, and whether to extract words or phrases.

To learn more about the Term Extraction transformation, see Term Extraction Transformation.

Options

Noun
Specify that the transformation extracts individual nouns only.

Noun phrase
Specify that the transformation extracts noun phrases only.

Noun and noun phrase
Specify that the transformation extracts both nouns and noun phrases.

Frequency
Specify that the score is the frequency of the term.

TFIDF
Specify that the score is the TFIDF value of the term. The TFIDF score is the product of Term Frequency and Inverse Document Frequency, defined as: TFIDF of a Term T = (frequency of T) * log( (#rows in Input) / (#rows having T) )

Frequency threshold
Specify the number of times a word or phrase must occur before extracting it. The default value is 2.

Maximum length of term
Specify the maximum length of a phrase in words. This option affects noun phrases only. The default value is 12.

Use case-sensitive term extraction
Specify whether to make the extraction case-sensitive. The default is False.

Configure Error Output
Use the Configure Error Output dialog box to specify error handling for rows that cause errors.

See Also

Integration Services Error and Message Reference
Term Extraction Transformation Editor (Term Extraction Tab)
Term Extraction Transformation Editor (Exclusion Tab)
Term Lookup Transformation