爬取58二手房的放原標題

import requests
from bs4 import beautifulsoup
import re
from lxml import etree
import time
# 需求:爬取58二手房的**資訊
if __name__ ==
"__main__"
: headers =
# 爬取到頁面原始碼資料
url =
''page_text = requests.get(url=url, headers=headers)
.text
# 資料解析
tree = etree.html(page_text)
# 儲存li標籤物件
li_list = tree.xpath(
'//ul[@class="house-list-wrap"]/li'
) fp =
open
('58.txt'
,'w'
, encoding=
'utf-8'
)for li in li_list:
# 從當前li標籤中解析出a標籤
title = li.xpath(
'./div[2]/h2/a/text()')[
0]# ./標識的就是li.xpath裡的li(整個頁面原始碼的區域性內容),也就是表示從當前的li開始為根節點進行搜尋
print
(title)
# 資料持久化儲存
fp.write(title+
'\n'
)

Python爬取58同城二手房資訊的標題名稱

今天，我們用python來爬取58同城頁面二手房資訊的資料。首先開啟爬取頁面原始碼資料 page text requests.get url url,headers headers text 資料解析 tree etree.html page text 儲存li標籤物件 li list tree....

爬取二手房資訊

開源到github了專案位址基於springboot,idea 匯入依賴 org.jsoupgroupid jsoupartifactid 1.10.2version dependency 資料放入redis中,引人redis org.springframework.bootgroupid sp...

09 58二手房標題

import requests from lxml import etree url headers response requests.get url url,headers headers page text response.text tree etree.html page text nam...

爬取58二手房的放原標題

Python爬取58同城二手房資訊的標題名稱

爬取二手房資訊

09 58二手房標題

相關推薦