[问题] BeautifulSoup 使用问题

楼主: gigigigi (gigigigi)   2017-10-16 02:03:36
<html>
<body>
<div class="section-content-row">
<h2 class="rt-goods-list-item-name">
<a class="rt-goods-list-item-name-link"
href="http://goods.ruten.com.tw/item/show?21716971337096" title="前往Lowe
Alpine Strike 24 运动背包/都会日用后背包 FDP55 黑商品页面" target="_blank">
<span class="rt-goods-list-item-name-text"
itemprop="name">Lowe Alpine Strike 24 运动背包/都会日用后背包 FDP55 黑</span>
</a>
</h2>
<div class="rt-goods-list-item-price-sell">
<div class="rt-goods-list-item-price">
<strong class="item-price-symbol rt-text-larger rt-text-price"
itemprop="offers" itemscope="" itemtype="http://schema.org/Offer">2,363<meta
itemprop="price" content="2363"><meta itemprop="priceCurrency" content="TWD">
</strong>
</div>
</div>
</div>
</body>
</html>
我想取出物品名称 网址 价格
soup = BeautifulSoup(html_doc, 'html.parser')
for each_div in soup.findAll('div',{'class':'section-content-row'}):
urls = each_div.find_all('a',{'class':'rt-goods-list-item-name-link'})
url = [url['href'] for url in urls]
print url
titles =
each_div.find_all('span',{'class':'rt-goods-list-item-name-text'})
title = [title.get_text() for title in titles]
print(title)
prices = each_div.find_all('strong', {'class' : 'rt-text-price'})
price = [price.get_text() for price in prices]
print price
可是我程式码这样写感觉好像不太好~不知道有没有比较简洁写法?
下面是我的测试程式码
https://gist.github.com/anonymous/f0fd45ee679e72efadaa979ab168c182
谢谢
作者: chuink (九墨)   2017-10-16 02:58:00
商品名称:each_div.h2.span.text商品网址:each_div.h2.a['href']价格:each_div.div.div.strong.text

Links booklink

Contact Us: admin [ a t ] ucptt.com