如何在 python 中使用 beautifulsoup4 来抓取标签中的内容

如题所述

举报该文章

相关建议 2017-03-25

æºä»£ç
from bs4 import BeautifulSoup
html_doc = '''
<div class="line-title">


111

ï¼222ï¼





ç¼è¾

</div>
'''
soup = BeautifulSoup(html_doc, "html.parser")
# åçº§ç
didi = soup.b.next_element.strip()
invest = soup.b.span.next_element.strip()
# è¿é¶ç
didi, invest = soup.b.stripped_strings

温馨提示：内容为网友见解，仅供参考

当前网址：https://aolonic.com/aa/a5gaakgk5dd4n13kkkw.html

其他看法

第1个回答 2017-03-25

因为你的html不是合法的xml格式，标签没有成对出现，只能用html解析器 from bs4 import BeautifulSoups

相似回答

大家正在搜