Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How many bytes are needed to store the internal code of a Chinese character?

2025-01-16 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/02 Report--

This article mainly introduces how many bytes are needed to store the internal code of a Chinese character, which can be used for reference by friends who need it. I hope you will learn a lot after reading this article. Next, let the editor take you to learn about it.

The internal code of a Chinese character needs 2 bytes to be stored. In the popular Chinese character system in China, the internal code of a Chinese character accounts for 2 bytes. Because the Chinese character processing system needs to ensure the compatibility of Chinese and Western languages, ambiguity will occur when there are both ASCII code and Chinese character GB code in the system. For this reason, the internal code of the Chinese character machine should be properly processed and transformed.

The internal code of a Chinese character needs 2 bytes to be stored.

In May 1981, the National Bureau of Standards issued the basic set of coded Chinese characters for Information Exchange, code name GB2312-80, which encodes 6763 Chinese characters and 682graphic characters. The coding principle is that Chinese characters are represented by two bytes.

In principle, two bytes can represent 256 × 256 characters 65536 different symbols, which is feasible as the basis of Chinese character coding representation. However, considering the relationship between Chinese character coding and other international common codes, such as ASCII Western character coding, the National Bureau of Standards of China has adopted a modified two-byte Chinese character coding scheme, using only two bytes of low 7 bits.

This scheme can accommodate 128x128116384 different Chinese characters, but in order to be compatible with standard ASCII codes, 32 control function codes, spaces with code values of 32 and 127op codes can no longer be used in each byte. So there are only 94 encodings per byte. In this way, the actual number of words that can be represented by double seven is 94 × 94 × 8836.

Thank you for reading this article carefully. I hope it is helpful for everyone to share the internal code of a Chinese character to store the content. At the same time, I also hope that you can support it, pay attention to the industry information channel, and find out if you encounter problems. Detailed solutions are waiting for you to learn!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report