最近尝试了一下BeautifuSoup 这个解析html的类库,概叹BeautifuSoup 的强大啊,了了几行代码就能抓取香港官网iphone4s的信息 哈哈——
from BeautifulSoup import BeautifulSoup
import urllib
webpage = urllib.urlopen(r"http://store.apple.com/hk-zh/browse/home/shop_iphone/family/iphone/iphone4s");
soup = BeautifulSoup(webpage.read())
tags = soup('ul',{'class':'selection-options all-models'})
tags = tags[0](lambda tag : len(tag.attrs) == 1 and tag.name in ['span'] and
tag['class'] in ['shipping','price','color','title'])
for tag in tags :
print tag.text
print '-' * 30
输入结果:
16GB1
------------------------------
black
------------------------------
HK$ 5,088
------------------------------
估計付運時間:暫無供應
------------------------------
32GB1
------------------------------
black
------------------------------
HK$ 5,888
------------------------------
估計付運時間:暫無供應
------------------------------
64GB1
------------------------------
black
------------------------------
HK$ 6,688
------------------------------
估計付運時間:暫無供應
------------------------------
16GB1
------------------------------
white
------------------------------
HK$ 5,088
------------------------------
估計付運時間:暫無供應
------------------------------
32GB1
------------------------------
white
------------------------------
HK$ 5,888
------------------------------
估計付運時間:暫無供應
------------------------------
64GB1
------------------------------
white
------------------------------
HK$ 6,688
------------------------------
估計付運時間:暫無供應
------------------------------
关于BeautifulSoup 大家可以参考 http://www.crummy.com/software/BeautifulSoup/documentation.zh.html
,赶紧加入pythoner 的行列吧,哈哈
我的微博:http://weibo.com/lei6744
分享到:
相关推荐
BeautifulSoup
离线安装python Beautifulsoup4库
beautifulsoup4-4.3.2.tar.gz
BeautifulSoup1.zip file for download
Python beautifulsoup4包 Python beautifulsoup4包Python beautifulsoup4包Python beautifulsoup4包Python beautifulsoup4包Python beautifulsoup4包
python模块beautifulsoup最新版本4.6.3。python进行爬虫时引用到BeautifuSoup开源的xml解析工具。 将压缩包解压后放入到python的安装目录(D:\Python**\beautifulsoup4-4.6.0)
BeautifulSoup4 官方文档 爬虫
这篇文档介绍了BeautifulSoup4中所有主要特性,并且有小例子.让我来向你展示它适合做什么,如何工作,怎样使用,如何达到你想要的效果,和处理异常情况. 文档中出现的例子在Python2.7和Python3.2中的执行结果相同 你...
python爬虫实例——基于BeautifulSoup与urllib.request,思路是打开目标链接,并爬取通过BeautifulSoup一定区域中的img标签中的src进行保存。
此chm文档由github上的中文BeautifulSoup4.4文档制作而成,方便查阅。
py 第三方库 beautifulsoup 中文文档
02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup示例.py02_BeautifulSoup...
beautifulsoup python 网页抓取 爬虫
beautifulsoup4-4.6.0,是一个非常强大的爬虫工具,它可以很方便地提取出HTML或XML标签中的内容。
beautifulsoup4 python
【重点提炼课件】BS4(BeautifulSoup)快速上手入门手册
html解析(pyhton-beautifulsoup) pyhton-beautifulsoup的学习笔记,大妈可运行
python -BeautifulSoup 4.9.3
03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_BeautifulSoup示例2.py03_...