阅读背景:

如何刮掉格式错误的HTML

来源:互联网 

I'm trying to scrape a really really old page that looks like it was built with FrontPage or even just pasted from a Word document. It's full of font tags that can spontaneously stop and start in the middle of a word, or similar elements at randomly different tree depths.I'm trying to scrape a really really old page t




你的当前访问异常,请进行认证后继续阅读剩余内容。

分享到: