網路爬蟲之BeautifulSoup入門（三）

3.點屬性、find、find_all

我們稱乙個tag的子節點指其包含的多個字串或其他的tag，如上篇1中的例子（不清楚的可以點此檢視），我們可以使用獲取tag的name的方法來獲取資料，如：

soup.head
# the dormouse's storytitle>
head>

這種點屬性操作可以進行多次操作呼叫，如：

soup.body

.b#the dormouse's story

通過點屬性的方法只可以獲取當前名字的第乙個tag值，其用法類似於find，如下：

soup.a
# elsiea>
soup.find('a')
# elsiea>

當然了，有只能查第乙個的，就有能查全部的，使用find_all,其返回值是乙個列表，用法如下：

soup.find_all('a')
# [elsiea>,
# laciea>,
# tilliea>]

有麼有查詢特定的呢？當然是有的了，接下來我們看有條件限制的

4.帶更多引數的find_all

這是官方文件給出的find_all方法的所有引數：find_all( name , attrs , recursive , string , **kwargs )，具體引數介紹及使用可檢視官方文件

在此我們僅給出部分常用的引數示例

soup.find_all("a", limit=2)
# [elsiea>,
# laciea>]

可以看出，僅僅返回了兩個，這是因為我們新增了限制返回數量為2。

soup.find_all(id='link2')
# ["sister" href="" id="link2">lacie]

以上為查詢id為『link2』的tag並返回，我們亦可使用下面的形式查詢含有id屬性的tag：

soup.find_all(id=true)
# [elsiea>,
# laciea>,
# tilliea>]

soup.find_all("a", class_="sister")
# [elsiea>,
# laciea>,
# tilliea>]

python 網路爬蟲 beautifulsoup

1.安裝beautifulsoup 2.使用beautifulsoup快速建立格式 from bs4 import beautifulsoup html 名字年齡性別地點小一 28 北京 soup beatifulsoup html print soup.prettify 3.使用beaut...

從零開始學網路爬蟲之BeautifulSoap

之前我們介紹了正規表示式，可能有的小夥伴也對寫正規表示式的用法還不夠熟練，沒關係，我們還有乙個更強大的工具，叫beautiful soup，它可以與requests配合使用，在獲得網頁原始碼後進行分析，實在是很方便。這一節就讓我們一就一起來學習一下beautiful soup。beautiful s...

爬蟲處理資料的方式（三）BeautifulSoup

使用beautifulsoup提取資料 from bs4 import beautifulsoup html html soup beautifulsoup html,lxml 建立乙個物件，接受html和解析方式 soup.a 拿到a標籤所有的內容，包括 soup.a.string 拿到a標籤裡面...

網路爬蟲之BeautifulSoup入門（三）

python 網路爬蟲 beautifulsoup

從零開始學網路爬蟲之BeautifulSoap

爬蟲處理資料的方式（三）BeautifulSoup

相關推薦