[问题] 网页爬虫多个class疑问 sgps50601 PTT批踢踢实业坊

[问题] 网页爬虫多个class疑问

楼主: sgps50601 (é˜¿æ´‹) 2017-08-26 13:23:22

最近练习使用beautiful soup爬公司营收网页，发现这个表格里有两个class，一个是正数t3n1，另一个t3r1来区分负数，于是尝试用find_all再加上("td",{"class":"t3n1","class":"t3r1"})来抓取，却都只能抓到正数t3n1，还在思考是否少了什么，想询问一下高手的见解，以下检附程式码。
(网页片段)
...
<td class="t3n0">102.3Q</td>
<td class="t3n1">162,577</td>
<td class="t3n1">83,636</td>
<td class="t3n1">78,941</td>
<td class="t3n1">48.56%</td>
<td class="t3n1">59,618</td>
<td class="t3n1">36.67%</td>
<td class="t3r1">-268</td>
<td class="t3n1">59,350</td>
<td class="t3n1">51,952</td>
...
(Python程式码)
url = 'http://fund.bot.com.tw/Z/ZC/ZCE/ZCE_2330.DJHTM'
response = rq.get(url)
html_doc = response.text # text 属性就是 html 档案
soup = BeautifulSoup(response.text, "lxml") # 指定 lxml 作为解析器
revence_id =[] #营收单位
revence_date =[] #日期
revence_main =[] #金额
price =soup.find_all("td",{"class":"t3n1","class":"t3r1"})
print(price)

作者: wennie0433 2017-08-26 21:15:00

findAll('td', {'class':['t3n1','t3r1']})

楼主: sgps50601 (é˜¿æ´‹) 2017-08-26 22:12:00

感谢wennie大，可以抓到正负了，看来用字典似乎抓不到。

继续阅读

[问题] 如何把读wav档并转成向量？wallrose [问题] python 连 SQL serverf496328mm [问题] swapping elements of list in forloopdazmin [问题] 用keras模组时，要怎么用cuda和cudnnssd860505da [问题] str转换hiddenwind [问题] 关于重复搜寻paulluopaull [问题] python爬虫抓取资料问题blue14753 Re: [问题] 排列组合只取一半XII [闲聊] 专案诚征开发伙伴(非视觉资讯探究开发)TYHTW Re: [问题] 排列组合只取一半hutdris