Python爬蟲爬取百度翻譯之資料提取方法json

工具：python 3.6.5、pycharm開發工具、windows 10 作業系統

說明：本例為實現輸入中文翻譯為英文的小程式，適合python爬蟲的初學者一起學習，感興趣的可以做英文翻譯為中文的功能，如單詞查詢功能等。推薦使用谷歌瀏覽器或火狐瀏覽器檢查元素。使用之前需要先安裝模組：pip install request pip install json。

資料提取方法：json

1、資料交換格式，看起來像python型別（列表，字典）的字串

2、使用json之前需要匯入

3、json.loads

（1）、把json字串轉化為python型別

（2）、json.loads(json字串)

4、json.dumps

（1）、把python型別轉化為json字串

（2）、json.dumps({})

（3）、json.dumps(ret1,ensure_ascii=false,indent=2)

ensure_ascii

讓中文顯示成中文

**：

1
import
requests
2import
json
3 url = "
"45 query_str = input("
請輸入要翻譯的中文：")
67 data =
1112 headers =
1718 response = requests.post(url,data=data,headers=headers)
1920 html_str = response.content.decode()#
json字串
2122
#json資料交換格式，使用json之前需要匯入23#
把json字串轉化為python型別
24 dict_ret =json.loads(html_str)25#
print(dict_ret)26#
print(type(dict_ret))
27 ret = dict_ret["
trans
"][0]["
dst"]28
print("
翻譯結果是：
",ret)

執行效果：

python爬蟲之爬取百度首頁

剛開始學習爬蟲，照著教程手打了一遍，還是蠻有成就感的。使用版本 python2.7 注意 python2的預設編碼是ascii編碼而python3預設編碼是utf 8 import urllib2 url response urllib2.urlopen url print response.rea...

爬取百度（有道）翻譯

難點是分析動態網頁抓包找出url。self.query input 請輸入要翻譯的內容 self.url self.data self.headers def run self post response requests.post url self.url,data self.data,hea...

爬蟲爬取百度貼吧 python

本爬蟲是在pycharm中編寫完成，伺服器環境是ubuntu16.04，使用語言是python3，匯入的模組包是requests模組匯入模組 import requests class tiebaspider object def init self self.base url self.head...

Python爬蟲爬取百度翻譯之資料提取方法json

python爬蟲 之 爬取百度首頁

爬取百度（有道）翻譯

爬蟲爬取百度貼吧 python

相關推薦

python爬蟲之爬取百度首頁