2.2.1 Supported Codepage in Windows

Windows assigns an integer, called code page ID, to every supported codepage.

Based on the usage, the codepage supported in Windows can be categorized in the following:

  • ANSI codepage

    Windows codepages are also sometimes referred to as active codepages or system active codepages. Windows always has one currently active Windows codepage. All ANSI Windows functions use the currently active codepage.

    The usual ANSI codepage ID for US English is codepage 1252.

    Windows codepage 1252, the codepage commonly used for English and other Western European languages, was based on an American National Standards Institute (ANSI) draft. That draft eventually became ISO 8859-1, but Windows codepage 1252 was implemented before the standard became final, and is not exactly the same as ISO 8859-1.

  • OEM codepage

  • Extended codepage

    These codepages cannot be used as ANSI codepages, or OEM codepages. Windows can support conversions between Unicode and these codepages. These codepages are generally used for information exchange purpose with international/national standard or legacy systems. Examples are UTF-8, UTF-7, EBCDIC, and Macintosh codepages.

The following table shows all the supported codepages by Windows. The Codepage ID lists the integer number assigned to a codepage. ANSI/OEM codepages are in bold face. The Codepage Description column describes the codepage. The Codepage notes column lists the category of a codepage and the relevant protocol section in this document to find protocol information.

Codepage ID

Codepage descriptions

Codepage notes

37

IBM EBCDIC US-Canada

Extended codepage; for processing rules, see section 3.1.5.1.1.

437

OEM United States

OEM codepage; for processing rules, see section 3.1.5.1.1.

500

IBM EBCDIC International

Extended codepage; for processing rules, see section 3.1.5.1.1.

708

Arabic (ASMO 708)

Extended codepage; for processing rules, see section 3.1.5.1.1.

720

Arabic (Transparent ASMO); Arabic (DOS)

Extended codepage; for processing rules, see section 3.1.5.1.1.

737

OEM Greek (formerly 437G); Greek (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

775

OEM Baltic; Baltic (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

850

OEM Multilingual Latin 1; Western European (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

852

OEM Latin 2; Central European (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

855

OEM Cyrillic (primarily Russian)

OEM codepage; for processing rules, see section 3.1.5.1.1.

857

OEM Turkish; Turkish (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

858

OEM Multilingual Latin 1 + Euro symbol

OEM codepage; for processing rules, see section 3.1.5.1.1.

860

OEM Portuguese; Portuguese (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

861

OEM Icelandic; Icelandic (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

862

OEM Hebrew; Hebrew (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

863

OEM French Canadian; French Canadian (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

864

OEM Arabic; Arabic (864)

OEM codepage; for processing rules, see section 3.1.5.1.1.

865

OEM Nordic; Nordic (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

866

OEM Russian; Cyrillic (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

869

OEM Modern Greek; Greek, Modern (DOS)

OEM codepage; for processing rules, see section 3.1.5.1.1.

870

IBM EBCDIC Multilingual/ROECE (Latin 2); IBM EBCDIC Multilingual Latin 2

Extended codepage; for processing rules, see section 3.1.5.1.1.

874

ANSI/OEM Thai (same as 28605, ISO 8859-15); Thai (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

875

IBM EBCDIC Greek Modern

Extended codepage; for processing rules, see section 3.1.5.1.1.

932

ANSI/OEM Japanese; Japanese (Shift-JIS)

ANSI/OEM codepage; for processing rules, see section 3.1.5.1.1.

936

ANSI/OEM Simplified Chinese (PRC, Singapore); Chinese Simplified (GB2312)

ANSI/OEM codepage; for processing rules, see section 3.1.5.1.1.

949

ANSI/OEM Korean (Unified Hangul Code)

ANSI/OEM codepage; for processing rules, see section 3.1.5.1.1.

950

ANSI/OEM Traditional Chinese (Taiwan; Hong Kong SAR, PRC); Chinese Traditional (Big5)

ANSI/OEM codepage; for processing rules, see section 3.1.5.1.1.

1026

IBM EBCDIC Turkish (Latin 5)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1047

IBM EBCDIC Latin 1/Open System

Extended codepage; for processing rules, see section 3.1.5.1.1.

1140

IBM EBCDIC US-Canada (037 + Euro symbol); IBM EBCDIC (US-Canada-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1141

IBM EBCDIC Germany (20273 + Euro symbol); IBM EBCDIC (Germany-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1142

IBM EBCDIC Denmark-Norway (20277 + Euro symbol); IBM EBCDIC (Denmark-Norway-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1143

IBM EBCDIC Finland-Sweden (20278 + Euro symbol); IBM EBCDIC (Finland-Sweden-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1144

IBM EBCDIC Italy (20280 + Euro symbol); IBM EBCDIC (Italy-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1145

IBM EBCDIC Latin America-Spain (20284 + Euro symbol); IBM EBCDIC (Spain-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1146

IBM EBCDIC United Kingdom (20285 + Euro symbol); IBM EBCDIC (UK-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1147

IBM EBCDIC France (20297 + Euro symbol); IBM EBCDIC (France-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1148

IBM EBCDIC International (500 + Euro symbol); IBM EBCDIC (International-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1149

IBM EBCDIC Icelandic (20871 + Euro symbol); IBM EBCDIC (Icelandic-Euro)

Extended codepage; for processing rules, see section 3.1.5.1.1.

1200

Unicode UTF-16, little-endian byte order (BMP of ISO 10646); available only to managed applications

Not used in Windows.

1201

Unicode UTF-16, big-endian byte order; available only to managed applications

Not used in Windows.

1250

ANSI Central European; Central European (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1251

ANSI Cyrillic; Cyrillic (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1252

ANSI Latin 1; Western European (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1253

ANSI Greek; Greek (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1254

ANSI Turkish; Turkish (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1255

ANSI Hebrew; Hebrew (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1256

ANSI Arabic; Arabic (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1257

ANSI Baltic; Baltic (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1258

ANSI/OEM Vietnamese; Vietnamese (Windows)

ANSI codepage; for processing rules, see section 3.1.5.1.1.

1361

Korean (Johab)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10000

MAC Roman; Western European (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10001

Japanese (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10002

MAC Traditional Chinese (Big5); Chinese Traditional (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10003

Korean (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10004

Arabic (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10005

Hebrew (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10006

Greek (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10007

Cyrillic (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10008

MAC Simplified Chinese (GB 2312); Chinese Simplified (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10010

Romanian (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10017

Ukrainian (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10021

Thai (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10029

MAC Latin 2; Central European (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10079

Icelandic (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10081

Turkish (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

10082

Croatian (Mac)

Extended codepage; for processing rules, see section 3.1.5.1.1.

12000

Unicode UTF-32, little-endian byte order; available only to managed applications

Not used in Windows.

12001

Unicode UTF-32, big-endian byte order; available only to managed applications

Not used in Windows.

20000

CNS Taiwan; Chinese Traditional (CNS)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20001

TCA Taiwan

Extended codepage; for processing rules, see section 3.1.5.1.1.

20002

Eten Taiwan; Chinese Traditional (Eten)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20003

IBM5550 Taiwan

Extended codepage; for processing rules, see section 3.1.5.1.1.

20004

TeleText Taiwan

Extended codepage; for processing rules, see section 3.1.5.1.1.

20005

Wang Taiwan

Extended codepage; for processing rules, see section 3.1.5.1.1.

20105

IA5 (IRV International Alphabet No. 5, 7-bit); Western European (IA5)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20106

IA5 German (7-bit)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20107

IA5 Swedish (7-bit)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20108

IA5 Norwegian (7-bit)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20127

US-ASCII (7-bit)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20261

T.61

Extended codepage; for processing rules, see section 3.1.5.1.1.

20269

ISO 6937 Non-Spacing Accent

Extended codepage; for processing rules, see section 3.1.5.1.1.

20273

IBM EBCDIC Germany

Extended codepage; for processing rules, see section 3.1.5.1.1.

20277

IBM EBCDIC Denmark-Norway

Extended codepage; for processing rules, see section 3.1.5.1.1.

20278

IBM EBCDIC Finland-Sweden

Extended codepage; for processing rules, see section 3.1.5.1.1.

20280

IBM EBCDIC Italy

Extended codepage; for processing rules, see section 3.1.5.1.1.

20284

IBM EBCDIC Latin America-Spain

Extended codepage; for processing rules, see section 3.1.5.1.1.

20285

IBM EBCDIC United Kingdom

Extended codepage; for processing rules, see section 3.1.5.1.1.

20290

IBM EBCDIC Japanese Katakana Extended

Extended codepage; for processing rules, see section 3.1.5.1.1.

20297

IBM EBCDIC France

Extended codepage; for processing rules, see section 3.1.5.1.1.

20420

IBM EBCDIC Arabic

Extended codepage; for processing rules, see section 3.1.5.1.1.

20423

IBM EBCDIC Greek

Extended codepage; for processing rules, see section 3.1.5.1.1.

20424

IBM EBCDIC Hebrew

Extended codepage; for processing rules, see section 3.1.5.1.1.

20833

IBM EBCDIC Korean Extended

Extended codepage; for processing rules, see section 3.1.5.1.1.

20838

IBM EBCDIC Thai

Extended codepage; for processing rules, see section 3.1.5.1.1.

20866

Russian (KOI8-R); Cyrillic (KOI8-R)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20871

IBM EBCDIC Icelandic

Extended codepage; for processing rules, see section 3.1.5.1.1.

20880

IBM EBCDIC Cyrillic Russian

Extended codepage; for processing rules, see section 3.1.5.1.1.

20905

IBM EBCDIC Turkish

Extended codepage; for processing rules, see section 3.1.5.1.1.

20924

IBM EBCDIC Latin 1/Open System (1047 + Euro symbol)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20932

Japanese (JIS 0208-1990 and 0121-1990)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20936

Simplified Chinese (GB2312); Chinese Simplified (GB2312-80)

Extended codepage; for processing rules, see section 3.1.5.1.1.

20949

Korean Wansung

Extended codepage; for processing rules, see section 3.1.5.1.1.

21025

IBM EBCDIC Cyrillic Serbian-Bulgarian

Extended codepage; for processing rules, see section 3.1.5.1.1.

21027

Ext Alpha Lowercase

Extended codepage; for processing rules, see section 3.1.5.1.1. NOTE: Although this codepage is supported, it has no known use.

21866

Ukrainian (KOI8-U); Cyrillic (KOI8-U)

Extended codepage; for processing rules, see section 3.1.5.1.1.

28591

ISO 8859-1 Latin 1; Western European (ISO)

Extended codepage; for processing rules, see section 3.1.5.1.1.

28592

ISO 8859-2 Central European; Central European (ISO)

Extended codepage; for processing rules, see section 3.1.5.1.1.

28593

ISO 8859-3 Latin 3

Extended codepage; for processing rules, see section 3.1.5.1.1.

28594

ISO 8859-4 Baltic

Extended codepage; for processing rules, see section 3.1.5.1.1.

28595

ISO 8859-5 Cyrillic

Extended codepage; for processing rules, see section 3.1.5.1.1.

28596

ISO 8859-6 Arabic

Extended codepage; for processing rules, see section 3.1.5.1.1.

28597

ISO 8859-7 Greek

Extended codepage; for processing rules, see section 3.1.5.1.1.

28598

ISO 8859-8 Hebrew; Hebrew (ISO-Visual)

Extended codepage; for processing rules, see section 3.1.5.1.1.

28599

ISO 8859-9 Turkish

Extended codepage; for processing rules, see section 3.1.5.1.1.

28603

ISO 8859-13 Estonian

Extended codepage; for processing rules, see section 3.1.5.1.1.

28605

ISO 8859-15 Latin 9

Extended codepage; for processing rules, see section 3.1.5.1.1.

38598

ISO 8859-8 Hebrew; Hebrew (ISO-Logical)

Extended codepage; for processing rules, see section 3.1.5.1.1. Use [CODEPAGEFILES] 28598.txt.

50220

ISO 2022 Japanese with no halfwidth Katakana; Japanese (JIS)

Extended codepage; for processing rules, see section 3.1.5.1.1.

50221

ISO 2022 Japanese with halfwidth Katakana; Japanese (JIS-Allow 1 byte Kana)

Extended codepage; for processing rules, see section 3.1.5.1.2.

50222

ISO 2022 Japanese JIS X 0201-1989; Japanese (JIS-Allow 1 byte Kana - SO/SI)

Extended codepage; for processing rules, see section 3.1.5.1.2.

50225

ISO 2022 Korean

Extended codepage; for processing rules, see section 3.1.5.1.2.

50227

ISO 2022 Simplified Chinese; Chinese Simplified (ISO 2022)

Extended codepage; for processing rules, see section 3.1.5.1.2.

50229

ISO 2022 Traditional Chinese

Extended codepage; for processing rules, see section 3.1.5.1.2.

51949

EUC Korean

Extended codepage; for processing rules, see section 3.1.5.1.2. Use [CODEPAGEFILES] 20949.txt.

52936

HZ-GB2312 Simplified Chinese; Chinese Simplified (HZ)

Extended codepage; for processing rules, see section 3.1.5.1.2.

54936

GB18030 Simplified Chinese (4 byte); Chinese Simplified (GB18030)

Extended codepage; for processing rules, see section 3.1.5.1.3.

57002

ISCII Devanagari

Extended codepage; for processing rules, see section 3.1.5.1.4.

57003

ISCII Bengali

Extended codepage; for processing rules, see section 3.1.5.1.4.

57004

ISCII Tamil

Extended codepage; for processing rules, see section 3.1.5.1.4.

57005

ISCII Telugu

Extended codepage; for processing rules, see section 3.1.5.1.4.

57006

ISCII Assamese

Extended codepage; for processing rules, see section 3.1.5.1.4.

57007

ISCII Odia (was Oriya)

Extended codepage; for processing rules, see section 3.1.5.1.4.

57008

ISCII Kannada

Extended codepage; for processing rules, see section 3.1.5.1.4.

57009

ISCII Malayalam

Extended codepage; for processing rules, see section 3.1.5.1.4.

57010

ISCII Gujarati

Extended codepage; for processing rules, see section 3.1.5.1.4.

57011

ISCII Punjabi

Extended codepage; for processing rules, see section 3.1.5.1.4.

65000

Unicode (UTF-7)

Extended codepage; for processing rules, see section 3.1.5.1.5.

65001

Unicode (UTF-8)

Extended codepage; for processing rules, see section 3.1.5.1.6.