Python 字串編碼

'''
對於單個字元的編碼，python提供了ord()函式獲取字元的整數表示，chr()函式把編碼轉換為對應的字元
'''print(ord('a')) # 97
print(ord('a')) # 65
print(ord('0')) # 48
print(chr(97)) # 'a'
print(chr(65)) # 'a'
print(chr(48)) # '0'

# 計算 str 的字元數
print(len('acb')) # 3
print(len('中國')) # 2
# 計算 bytes 的位元組數
print(len('abc'.encode('utf-8'))) # 3，經過utf-8編碼後，1個英文本元占用1個位元組
print(len('中文'.encode('utf-8'))) # 6，經過utf-8編碼後，1個中文字元占用3個位元組

'''
以unicode表示的str通過encode()方法可以編碼為指定的bytes
-- 純英文的str可以用ascii或者utf-8編碼為bytes，結果是一樣的，因為utf-8相容ascii
-- 含有中文的str可以用utf-8編碼為bytes
-- 含有中文的str不可以用ascii編碼，因為中文編碼的範圍超過了ascii編碼的範圍，python會報錯
'''s1 = 'abc'
s2 = '大中國'
s1.encode('utf-8')
s1.encode('ascii')
s2.encode('utf-8')
# s2.encode('ascii') # 報錯

'''
bytes.decode('utf-8')
'''s1 = 'abc'
s2 = '大中國'
b1 = s1.encode('utf-8')
b2 = s1.encode('ascii')
b3 = s2.encode('utf-8')
s1 = b1.decode('utf-8')
s2 = b2.decode('ascii')
s3 = b3.decode('utf-8')

Python字串編碼

在python中有些特殊的地方是存在兩種字串，分別為str和unicode字串，他們都繼承自basestring。如 s hello world s為str us u hello world us為unicode。使用help str 和help unicode 可以檢視各自說明，他們都有decod...

python字串編碼

常見字元編碼型別 ascii 美國資訊交換標準碼，是目前計算機中最廣泛使用的字符集編碼。每個 ascii 碼以 1 個位元組儲存，例如數字字元 0 的 ascii 碼是 0110000，十進位制表示為 48。unicode 為解決世界上上百種語言帶來混合衝突，各國有各國的標準，顯示很容易出現亂碼...

Python字串編碼

字串str 作用用來記錄文字資訊字面值表示方法用引號括起來的部分都是字串單引號雙引號三單引號三雙引號空字串字串的字面值表示方式 hello hello hello hello 單引號和雙引號的區別單引號內的雙引號不算結束符雙引號內的單引號不算結束符三引號的作用三引號內可以包...

Python 字串編碼

Python字串編碼

python字串編碼

Python字串 編碼

相關推薦

Python字串編碼