当前位置：首页 > news >正文

网站开发可演示的版本中国装饰网

news 2025/11/15 6:06:35

网站开发可演示的版本,中国装饰网,适合网站设计的gif图片,金种子酒业网站建设bs4模块与案例使用指南 bs4#xff0c;全称BeautifulSoup 4#xff0c;是Python中一个强大的网页解析库#xff0c;它可以帮助我们方便地从网页中提取数据。bs4将复杂HTML文档转换成树形结构#xff0c;每个节点都是Python对象#xff0c;所有对象可以归纳为4种#xf…bs4模块与案例使用指南 bs4全称BeautifulSoup 4是Python中一个强大的网页解析库它可以帮助我们方便地从网页中提取数据。bs4将复杂HTML文档转换成树形结构每个节点都是Python对象所有对象可以归纳为4种TagNavigableStringBeautifulSoupComment。安装bs4 在Python环境下我们可以使用pip命令进行安装 pip install beautifulsoup4基本使用 1导入bs4库 from bs4 import BeautifulSoup2创建BeautifulSoup对象我们可以使用以下方式创建一个BeautifulSoup对象 soup BeautifulSoup(html_doc, html.parser)这里html_doc’是HTML文档字符串html.parser’是解析器。 3访问标签我们可以使用soup.tag_name访问HTML中的标签。例如 print(soup.title)输出 titleThe Dormouses story/title4获取标签的属性我们可以使用tag[‘attr’]或者tag.get(‘attr’)来获取标签的属性。例如 print(soup.a[href])输出 http://example.com/elsie5获取标签的内容我们可以使用tag.string或者tag.text来获取标签的内容。例如 print(soup.title.string)输出 The Dormouses story搜索文档树 bs4提供了多种方法来搜索文档树例如.find_all().find()等。例如 html_doc htmlheadtitleThe Dormouses story/title/head body p classtitlebThe Dormouses story/b/psoup BeautifulSoup(html_doc, html.parser) title_tag soup.find_all(title) print(title_tag)输出 [titleThe Dormouses story/title]网易云音乐案例获取并解析流行歌曲标题 import requests from bs4 import BeautifulSoupresponse requests.get(url https://music.163.com/discover/playlist/?cat%E6%B5%81%E8%A1%8C,headers {User-Agent:Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36 Edg/120.0.0.0,Referer:https://music.163.com/} ) soup BeautifulSoup(response.text,featureshtml.parser) tag soup.find(nameul , attrs{id : m-pl-container}) # 定位到ul标签 for li in tag.find_all(recursiveFalse): # 遍历所有li标签不进行递归遍历a li.find(name a,attrs {class:msk}) # 获取a标签print(a.attrs[title]) # 通过attrs拿到标题属性也可直接使用a[title]来获取标题结果

查看全文

http://www.zqtcl.cn/news/550836/