Windows.Data.Text Namespace

Reference

Important

Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.

Edit

Provides support for identifying text segments and for determining the properties of Unicode characters.

Classes

AlternateWordForm	Identifies an alternate form of the word represented by a WordSegment object. For example, this may contain a number in a normalized format.
SelectableWordSegment	Represents a segment from your provided text that includes a word and any trailing whitespace and/or punctuation after this word.
SelectableWordsSegmenter	A segmenter class that is able to segment provided text into units appropriate for selecting text by words. The language supplied when this object is constructed is matched against the languages with word breakers on the system, and the best word segmentation rules available are used. The language need not be one of the app's supported languages. If there are no supported language rules available specifically for that language, the language-neutral rules are used (an implementation of Unicode Standard Annex #29 Unicode Text Segmentation), and the ResolvedLanguage property is set to "und" (undetermined language).
SemanticTextQuery	Permits apps to perform text queries using Advanced Query Syntax (AQS) strings, with the option of providing a language tag to be employed in the query.
TextConversionGenerator	Converts input phonetic characters into a collection of corresponding ideographic characters (Chinese characters). Input of Japanese and Chinese Pinyin is supported.
TextPhoneme	Represents the result of calling the TextReverseConversionGenerator.GetPhonemesAsync method, encapsulating both the original display text and the phonemes that match the original display text.
TextPredictionGenerator	Predicts Japanese words based on a phonetic characters prefix.
TextReverseConversionGenerator	Reverse-converts a Japanese string which is mix of phonetic and ideographic characters, to a string of phonetic characters.
UnicodeCharacters	Provides a way for apps to get information about Unicode characters, per the Unicode Standard Annex #44.
WordSegment	Represents a word from your provided text. Words in this class do not include trailing whitespace or punctuation. This class can also expose alternate forms of words, and normalized numbers, currencies, dates, and times.
WordsSegmenter	A segmenter class that is able to segment provided text into words or word stems (depending on the particular language).

Structs

TextSegment

Identifies a sub-string of a source text string. Your app can use this structure to obtain the segment of your provided text that is identified by AlternateWordForm, SelectableWordSegment, WordSegment, or SemanticTextQuery.

Enums

AlternateNormalizationFormat	Identifies the normalization format of an AlternateWordForm object.
TextPredictionOptions	Defines constants that specify text prediction options.
UnicodeGeneralCategory	Defines the Unicode general category of a character.
UnicodeNumericType	Defines the type of numeral when a Unicode character represents a number.

Delegates

SelectableWordSegmentsTokenizingHandler	Defines the signature of a function that is provided to SelectableWordsSegmenter.Tokenize.
WordSegmentsTokenizingHandler	Defines the signature of a function that is provided to WordsSegmenter.Tokenize.

Share via

Windows.Data.Text Namespace

Classes

Structs

Enums

Delegates

See also

Feedback

Additional resources