Windows.Data.Text Namespace
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Provides support for identifying text segments and for determining the properties of Unicode characters.
Classes
AlternateWordForm |
Identifies an alternate form of the word represented by a WordSegment object. For example, this may contain a number in a normalized format. |
SelectableWordSegment |
Represents a segment from your provided text that includes a word and any trailing whitespace and/or punctuation after this word. |
SelectableWordsSegmenter |
A segmenter class that is able to segment provided text into units appropriate for selecting text by words. The language supplied when this object is constructed is matched against the languages with word breakers on the system, and the best word segmentation rules available are used. The language need not be one of the app's supported languages. If there are no supported language rules available specifically for that language, the language-neutral rules are used (an implementation of Unicode Standard Annex #29 Unicode Text Segmentation), and the ResolvedLanguage property is set to "und" (undetermined language). |
SemanticTextQuery |
Permits apps to perform text queries using Advanced Query Syntax (AQS) strings, with the option of providing a language tag to be employed in the query. |
TextConversionGenerator |
Converts input phonetic characters into a collection of corresponding ideographic characters (Chinese characters). Input of Japanese and Chinese Pinyin is supported. |
TextPhoneme |
Represents the result of calling the TextReverseConversionGenerator.GetPhonemesAsync method, encapsulating both the original display text and the phonemes that match the original display text. |
TextPredictionGenerator |
Predicts Japanese words based on a phonetic characters prefix. |
TextReverseConversionGenerator |
Reverse-converts a Japanese string which is mix of phonetic and ideographic characters, to a string of phonetic characters. |
UnicodeCharacters |
Provides a way for apps to get information about Unicode characters, per the Unicode Standard Annex #44. |
WordSegment |
Represents a word from your provided text. Words in this class do not include trailing whitespace or punctuation. This class can also expose alternate forms of words, and normalized numbers, currencies, dates, and times. |
WordsSegmenter |
A segmenter class that is able to segment provided text into words or word stems (depending on the particular language). |
Structs
TextSegment |
Identifies a sub-string of a source text string. Your app can use this structure to obtain the segment of your provided text that is identified by AlternateWordForm, SelectableWordSegment, WordSegment, or SemanticTextQuery. |
Enums
AlternateNormalizationFormat |
Identifies the normalization format of an AlternateWordForm object. |
TextPredictionOptions |
Defines constants that specify text prediction options. |
UnicodeGeneralCategory |
Defines the Unicode general category of a character. |
UnicodeNumericType |
Defines the type of numeral when a Unicode character represents a number. |
Delegates
SelectableWordSegmentsTokenizingHandler |
Defines the signature of a function that is provided to SelectableWordsSegmenter.Tokenize. |
WordSegmentsTokenizingHandler |
Defines the signature of a function that is provided to WordsSegmenter.Tokenize. |