The relationship between url coding and (utf-8,gbk)

clipboard.png

clipboard.png

figure 1 says that the utf-8 code of "Spring Festival" is E698 A5 E8 8A 82

then I transcoded "Spring Festival" with utf-8 with tools, and found that it was Spring Festival

.

could you tell me what this is all about? Or is there something wrong with my understanding?

Mar.11,2021

character: Spring Festival
Unicode code point (decimal): 26149 33410
Unicode code point (hexadecimal): \ u6625\ u8282
Unicode code point (binary): 0110 0110 0010 0101 1000 0010 1000 0010

Unicode-UTF-8 conversion rules

< table > < thead > < tr > < th > Unicode < / th > < th > utf-8 < / th > < / tr > < / thead > < tbody > < tr > < td > U + 0000 ~ U + 007F < / td > < td > 0XXXXXXX < / td > < / tr > < tr > < td > U + 0080 ~ U + 07FF < / td > < td > 110XXXXX 10XXXXXX < / td > < / tr > < tr > < td > U + 0800 ~ U + FFFF < / td > < td > 1110XXXX 10XXXXXX 10XXXXXX < / td > < / tr > < tr > < td > U + 10000 ~ U + 1FFFF < / td > < td > 11110XXX 10XXXXXX 10XXXXXX 10XXXXXX < / td > < / tr > < / tbody > < / table >

hexadecimal substitute table selection corresponds to row .
binary substitution table corresponds to XXX position .

get UTF-8:

binary
11100110 10011000 10100101
11101000 10001010 10000010

hexadecimal
E6 98 A5
E8 8A 82

final Encoding
E6%98%A5%E8%8A%82


"then I used tools to utf-8 transcode" Spring Festival "and found that it was Spring Festival", and wrote
"then I used tools to utf-8 transcode" Spring Festival "and found that it was &-sharp x6625; &-sharp x8282;", the browser will automatically turn to recognize unicode

.

https://www.zhihu.com/questio.

differences and relationships between UTF-8 and Unicode,
one encoding rule, one character set,

The

Unicode: coding specification uses an implementation of a code that includes all the characters in the world
UTF-8:Unicode. Similarly, the full name of
GBK:GBK, such as UTF-16, is "extended Code Specification for Chinese characters" (GBK, the first letter of Hanyu Pinyin, the English name: Chinese Internal Code Specification), was formulated by the National Information Technology Standardization Technology Committee of the people's Republic of China on December 1, 1995. The Standardization Department of the State Bureau of Technical Supervision and the Department of Science, Technology and quality Supervision of the Ministry of Electronic Industry jointly identified it as the technical specification guidance document in the form of Technical Supervision bid letter 1995 on December 15, 1995. It's used internally in China

.
Menu