Beautiful Soup - unwrap() 方法



方法描述

unwrap() 方法與 wrap() 方法相反。它用標籤內部的內容替換標籤。它從元素中刪除標籤並返回它。

語法

unwrap()

引數

此方法不需要任何引數。

返回型別

unwrap() 方法返回已刪除的標籤。

示例 1

在下面的示例中,從 html 字串中刪除了<b> 標籤。

html = '''
<p>The quick, <b>brown</b> fox jumps over a lazy dog.</p>
'''
from bs4 import BeautifulSoup

soup = BeautifulSoup(html, "html.parser")
tag1 = soup.find('b')
newtag = tag1.unwrap()

print (soup)

輸出

<p>The quick, brown fox jumps over a lazy dog.</p>

示例 2

下面的程式碼列印 unwrap() 方法的返回值。

html = '''
<p>The quick, <b>brown</b> fox jumps over a lazy dog.</p>
'''
from bs4 import BeautifulSoup

soup = BeautifulSoup(html, "html.parser")
tag1 = soup.find('b')
newtag = tag1.unwrap()

print (newtag)

輸出

<b></b>

示例 3

unwrap() 方法對於去除標記很有用,如下面的程式碼所示:

html = '''
<html>
   <body>
      <p>The quick, brown fox jumps over a lazy dog.</p>
      <p>DJs flock by when MTV ax quiz prog.</p>
      <p>Junk MTV quiz graced by fox whelps.</p>
      <p>Bawds jog, flick quartz, vex nymphs.</p>
   </body>
</html>
'''
from bs4 import BeautifulSoup

soup = BeautifulSoup(html, "html.parser")
#print (soup.unwrap())
for tag in soup.find_all():
   tag.unwrap()
print (soup)

輸出

The quick, brown fox jumps over a lazy dog.
DJs flock by when MTV ax quiz prog.
Junk MTV quiz graced by fox whelps.
Bawds jog, flick quartz, vex nymphs.
廣告