Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to modify codepage in windows cmd

2025-01-14 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >

Share

Shulou(Shulou.com)06/03 Report--

This article mainly introduces how to modify the codepage in windows cmd, which has certain reference value. Interested friends can refer to it. I hope you will gain a lot after reading this article. Let Xiaobian take you to understand it together.

Chcp

Displays the number of the active console code page, or changes the console's active console code page. Used without parameters, chcp displays the number of the active console code page.

Syntax

chcp [nnn]

Parameters

nnn : Specifies the code page. The following table lists each code page supported and its country/region or language:

Code page Country/region or language

437 United States

850 Multilingual (Latin I)

852 Slavic (Latin II)

855 Cyrillic (Russian)

857 Turkish

860 Portuguese

861 Icelandic

863 Canadian-French

865 Nordic

866 Russian

869 Modern Greek

What is a code page, how to modify the codepage in windows cmd

If your cmd doesn't display Chinese, or any other characters, use chcp to modify it. The parameter nnn represents 3 digits. Simplified Chinese codepage is: 936 Spanish is:1252

History of Code Page:

1. Definition and History of Codepage

A character code is an internal code used to represent a character. Readers use inner codes when entering and storing documents, and inner codes are divided into

Single-Byte character sets (SBCS) support 256 character codes.

Double-Byte character sets (DBCS) support 65000 character codes. It is mainly used to encode oriental characters with large character sets.

A codepage refers to a selected list of character inner codes arranged in a specific order. For early languages with single-byte inner codes, the inner code order in a codepage allows the system to give a corresponding inner code according to the input value of the keyboard according to this list. For two-byte inner codes, a MultiByte to Unicode mapping table is given, so that characters stored in Unicode form can be converted to the corresponding character inner code, or vice versa. The corresponding functions in the Linux kernel are utf8_mbtowc and utf8_wctomb.

Before 1980, there was no international standard such as ISO-8859 or Unicode defining how to extend US-ASCII encoding for use by non-English speaking users. Many IT vendors have invented their own codes and used hard-to-remember numbers to identify:

For example, 936 stands for Simplified Chinese. 950 stands for Traditional Chinese.

1.1 CJK Codepage

Unlike Extended Unix Coding ( EUC ), all of the Far East codepages below utilize the C1 control code { =80..= 9F } As the first byte, use ASCII values { =40..= 7E {as the second byte, so as to contain up to tens of thousands of double-byte characters, which means that ASCII values less than 3F in this encoding do not necessarily represent ASCII characters.

CP932

Shift-JIS contains Japanese charset JIS X 0201 (one byte per character) and JIS X 0208 (two bytes per character), so JIS X 0201 Hiragana contains one-byte half-width characters, and the remaining 60 bytes are used as the first byte of 7076 Kanji characters and 648 other full-width characters. Shift-JIS differs from EUC-JP in that it does not contain the 5802 characters defined in JIS X 202.

CP936

GBK extends EUC-CN code (GB2312 -80 code, containing 6763 Chinese characters) to 20902 Chinese characters defined in Unicode (GB13000.1-93). Simplified Chinese zh_CN is used in the mainland of China.

CP949

UnifiedHangul (UHC) is a superset of the Korean EUC-KR code (KS C 5601-1992 code, consisting of 2350 Hangul syllables and 4888 Kanji a), containing 8822 additional Hangul syllables (in C1)

CP950

Big5 encoding (13072 Traditional zh_TW Chinese characters) in place of EUC-TW (CNS 11643-1992), these definitions are found in Ken Lunde's CJK.INF or Unicode encoding table.

Note: Microsoft uses the above four Codepages, so the above Codepages are required to access Microsoft's file system.

1.2 IBM's Far Eastern Language Codepage

IBM's Codepages are divided into SBCS and DBCS:

IBM SBCS Codepage

37 *

290 (Japanese) *

833 (Korean) *

836 (Simplified Chinese) *

891 (Korean)

897 (Japanese)

903 (Simplified Chinese)

904 (Traditional Chinese)

IBM DBCS Codepage

300 (Japanese) *

301 (Japanese)

834 (Korean) *

835 (Traditional Chinese) *

837 (Simplified Chinese) *

926 (Korean)

927 (Traditional Chinese)

928 (Simplified Chinese)

Mix SBCS Codepage with DBCS Codepage to get IBM MBCS Codepage

930 (Japanese) (Codepage 300 plus 290) *

932 (Japanese) (Codepage 301 plus 897)

933 (Korean) (Codepage 834 plus 833) *

934 (Korean) (Codepage 926 plus 891)

938 (Traditional Chinese) (Codepage 927 plus 904)

936 (Simplified Chinese) (Codepage 928 plus 903)

5031 (Simplified Chinese) (Codepage 837 + 836) *

5033 (Traditional Chinese) (Codepage 835 + 37) *

* Represents EBCDIC encoding format

Thank you for reading this article carefully. I hope that the article "How to modify the codepage in windows cmd" shared by Xiaobian will be helpful to everyone. At the same time, I hope that everyone will support you a lot and pay attention to the industry information channel. More relevant knowledge is waiting for you to learn!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Development

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report