Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How many bytes does an English letter occupy in the ascii code file in the computer network

2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/01 Report--

This article mainly introduces how many bytes an English letter occupies in the ascii code file in the computer network, which has certain reference value. Interested friends can refer to it. I hope you will have a lot of harvest after reading this article. Let Xiaobian take you to understand it together.

An English letter occupies one byte and a Chinese character occupies two bytes in an ascii file;ascii codes use specified combinations of 7 or 8 binary numbers to represent 128 or 256 possible characters, and bytes are units of binary data, usually 8 bits long.

Operating environment: Windows 7 system, Dell G3 computer.

How many bytes does an English letter occupy in an ASCII file?

In ASCII code, an English letter (case-insensitive) occupies one byte of space, and a Chinese character occupies two bytes of space.

ASCII :

ASCII (American Standard Code for Information Interchange) is a computer coding system based on the Latin alphabet, primarily used to display modern English and other Western European languages. It is the most common standard for information exchange and is equivalent to ISO/IEC 646. ASCII was first published as a standard in 1967 and last updated in 1986, with 128 characters defined to date.

ASCII codes use specified combinations of 7 or 8 binary digits to represent 128 or 256 possible characters. Standard ASCII, also known as base ASCII, uses 7 binary digits (the remaining 1 binary digit is 0) to represent all upper and lower case letters, numbers 0 to 9, punctuation marks, and special control characters used in American English.[1] Of which:

0~31 and 127(33 in total) are control characters or communication-specific characters (the rest are displayable characters), such as control characters: LF (line feed), CR (carriage return), FF (page change), DEL (delete), BS (backspace), BEL (bell), etc.; communication-specific characters: SOH (head), EOT (tail), ACK (acknowledgement), etc.;ASCII values of 8, 9, 10 and 13 are converted into backspace, tab, line feed and carriage return characters respectively. They do not have a specific graphical display, but have different effects on text display depending on the application.[1]

32 to 126(95 in total) are characters (32 is a space), of which 48 to 57 are ten Arabic numerals from 0 to 9.

65~90 are 26 capital English letters, 97~122 are 26 lowercase English letters, and the rest are some punctuation marks, operational symbols, etc.

Also note that in standard ASCII, the most significant bit (b7) is used as a parity bit. The so-called parity check refers to a method used to check whether there is an error in the code transmission process, generally divided into odd check and even check. Odd parity stipulates that the number of 1 in a byte of the correct code must be odd, if not odd, add 1 to the highest bit b7; even parity stipulates that the number of 1 in a byte of the correct code must be even, if not even, add 1 to the highest bit b7.

The last 128 are called extended ASCII codes. Many x86-based systems support the use of extended (or "high") ASCII. Extended ASCII allows the 8th bit of each character to be used to identify additional 128 special symbol characters, loanword letters, and graphic symbols.

Byte:

A byte is a unit of binary data. A byte is usually 8 bits long. However, some older computer architectures use different lengths. To avoid confusion, the word byte is used instead of byte in most international literature. In most computer systems, a byte is an 8-bit unit of data, and most computers use a byte to represent a character, number, or other character. A byte can also represent a series of binary bits. In some computer systems, four bytes represent a word, which is the unit of data that the computer can efficiently process when executing instructions. Some language descriptions require two bytes to represent a character, which is called a two-byte character set. Some processors are capable of processing two-byte or single-byte instructions. Byte is usually abbreviated as "B," while bit is usually abbreviated as lowercase "B," and the size of computer memory is usually expressed in bytes.

Thank you for reading this article carefully. I hope Xiaobian will share the article "How many bytes does an English letter occupy in an ascii code file in a computer network?" This article is helpful to everyone. At the same time, I hope you will support it a lot. Pay attention to the industry information channel. More relevant knowledge is waiting for you to learn!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report