阅读背景:

如何在阅读HTML文档中翻译/转换unicode转义?

来源:互联网 

When I read some (but not all) HTML files in python using a urllib2 opener, on some files I'm getting text filled with lots of backslashes and the unicode 003c strings. I'm sending this text into BeautifulSoup and am having trouble finding what I'm looking for with findAll(), and I'm now thinking it's due to all these unicode strings.When I read some (but not all) HTML files in py




你的当前访问异常,请进行认证后继续阅读剩余内容。

分享到: