[问题] 求救 ...困在编码问题 imimi PTT批踢踢实业坊

[问题] 求救 ...困在编码问题

楼主: imimi (im) 2016-08-30 22:03:44

想请教各位高手
我有个input.txt 我很确定他是用utf-8储存的
但是用以下的beautiful soup 处理 strip tag
却还是乱码想请问是不是哪里写错了谢谢
import requests
from bs4 import BeautifulSoup
res = requests.get('http://localhost/input.txt',verify=False)
obj=bytes(res.text, "utf-8")
UTF8str=obj.decode('utf-8', 'ignore')
soup = BeautifulSoup(UTF8str, "html.parser")
text = soup.getText()
print(text)
但是出来还是乱码
我是照著书打的实在不知道问题出在哪
感谢各位高手 ....orz

作者: Neverfor (八卦蟑螂) 2016-08-30 22:19:00

讨厌win10记事本

作者: enjoyloli (M) 2016-08-31 00:07:00

#-*- coding: utf-8 -*- 有用吗?

作者: uranusjr (â†é€™äººæ˜¯è¶…ç´šç¬¨è›‹) 2016-08-31 05:50:00

既然文字内容就是 UTF-8, 你为什么要 encode 又 decode直接把 res.text 拿去 parse 不就好了？

继续阅读

[问题] 在Windows下使用 Caffe 深度学习套件hardman1110 [问题] CentOS下python runtimeErrorsavageg [问题] Django新手连结MySQL数据库supershi [问题] 1.real 发生 SyntaxErrorpython27 [问题] sqlite存放图片问题hdin [问题] Atom与virtualenvchaostorm 八月份的 Taipei.pytimTan [资讯] pyclub - Taipei.py 8/27(六 sprint 活动keitheis [问题] 如何取得LP_c_char_p 型态?druid0214 [问题] 为什么一直import error.....andrew5106