python2 7解決中文編碼問題

粗略地介紹下編碼知識，首先我們認為是位元組是面向計算機的，字元是面向人類的，相互的轉換就是解碼和編碼，在各種編碼中，ascii碼是7位，用不到乙個位元組，7個位元來表示字元，這樣最多也只有127個字元，iso8859-1用乙個位元組8個位元表示字元，可以表示256個字元，gb2312是用2個位元組，16個位元，可以包含7000多個字元，其次unicode(universal multiple-octet coded character set」，簡稱 ucs, 俗稱「unicode」)包含上述各種編碼的字元，並且每乙個字元都具有編碼的唯一性，隨著網際網路的快速發展，傳輸格式utf（ucs transfer format）標準必須統一，8位一傳輸就是utf-8,以後又出現了utf-16,由gbk變成unicode叫做decode,由unicode變成gbk叫做encode，下面我們看下在python中的**

import sys

print sys.stdin.encoding

print sys.stdout.encoding

input-str=raw_input(u'輸入：'.encode(sys.stdout.encoding))

f=open('test.txt','w')

f.write(input-str.decode(sys.stdin.encoding).encode('utf-8'))

f.close

通過對系統輸入輸出編碼的識別，可以避免由於中文帶來的亂碼

python2 7解決中文編碼問題

python2 7中文編碼 python2 7

python2 7編碼問題

python2 7中文編碼報錯問題

python2 7解決中文編碼問題

python2 7中文編碼 python2 7

python2 7編碼問題

python2 7中文編碼報錯問題

相關推薦