Factoids for East Asian Languages

East Asian languages are defined as Japanese, Chinese (Simplified), Chinese (Traditional), and Korean. The formats within the factoids in the following table are specific to each language's recognizer.

For example, the Telephone factoid is different in each language. Furthermore, each factoid is specific to a particular recognizer. For example, only the Japanese Telephone factoid can be used with the Japanese recognizer. In addition to the factoids in the following table, all languages use the factoids listed in Factoids Common Across Languages.

Note

The factoids for East Asian languages are implemented by specifying a list of acceptable Unicode characters. The factoids for western languages are implemented by using regular expressions that describe the expected input. This is because western languages are composed of letters that are combined to make words, whereas East Asian languages are character based.

Recognizers of East Asian characters support combining up to ten factoids together. These factoid combinations employ a logical OR operator; therefore, the input can match any of the factoids in the expression.

Factoid OneChar Percent PostalCode UpperChar
Description
One character.
Numbers with a percent symbol.
Numerical postal codes.
Uppercase Latin script characters.
Unicode values
U+0020
U+0021
U+0022
U+0023
U+0024
U+0025
U+0026
U+0027
U+0028
U+0029
U+002A
U+002B
U+002C
U+002D
U+002E
U+002F
U+0030
U+0031
U+0032
U+0033
U+0034
U+0035
U+0036
U+0037
U+0038
U+0039
U+003A
U+003B
U+003C
U+003D
U+003E
U+003F
U+0040
U+0041
U+0042
U+0043
U+0044
U+0045
U+0046
U+0047
U+0048
U+0049
U+004A
U+004B
U+004C
U+004D
U+004E
U+004F
U+0050
U+0051
U+0052
U+0053
U+0054
U+0055
U+0056
U+0057
U+0058
U+0059
U+005A
U+005B
U+005C
U+005D
U+005E
U+005F
U+0060
U+0061
U+0062
U+0063
U+0064
U+0065
U+0066
U+0067
U+0068
U+0069
U+006A
U+006B
U+006C
U+006D
U+006E
U+006F
U+0070
U+0071
U+0072
U+0073
U+0074
U+0075
U+0076
U+0077
U+0078
U+0079
U+007A
U+007B
U+007C
U+007D
U+007E
U+0025
U+002E
U+0030
U+0031
U+0032
U+0033
U+0034
U+0035
U+0036
U+0037
U+0038
U+0039
U+002D
U+0030
U+0031
U+0032
U+0033
U+0034
U+0035
U+0036
U+0037
U+0038
U+0039
U+0041
U+0042
U+0043
U+0044
U+0045
U+0046
U+0047
U+0048
U+0049
U+004A
U+004B
U+004C
U+004D
U+004E
U+004F
U+0050
U+0051
U+0052
U+0053
U+0054
U+0055
U+0056
U+0057
U+0058
U+0059
U+005A

The following topics show the formats supported for each factoid in Japanese, Chinese (Simplified), Chinese (Traditional), and Korean.