Python 编码进阶

如果想把内存中的数据通过网络传输，存储等在Python 中转为非Unicode 编码方式：

数据类型转换为 (bytes)

b = 'hello'
b = b'hello' 	#type(b) = class bytes	字符串的常用方法 bytes都有

s1 = '熊猫'
s2 = s1.encode('utf-8')	#中文不能通过b的方法转换 需要使用encode()方法

bytes转换为其他编码：

b1 = '熊猫'
b1 = b1.encode('utf-8')  #type(b1) = bytes
s1 = b1.decode('utf-8')  #type(s1) = utf-8

GBK 如何转换为 UTF-8？

所有编码都和Unicode 有关通过先转为Unicode 再转为需要的编码

s1 = '熊猫'
s1 = b'xd6xd0'
s2 = s1.decode('gbk')
b2 = s2.encode('utf-8')

bytes为什么存在？ str --> bytes (Unicode --> 非 Unicode)

gbk <--> utf-8