阅读背景:

提取网页数据保存为csv文件

来源:互联网 
import requests r = requests.get('https://www.baidu.com') from bs4 import BeautifulSoup soup = BeautifulSoup(r.text, 'html.parser') results = soup.find_all('span', attrs={'class':'short-desc'}) records = [] for result in results: date = result.find('strong').text[0:-1]+',2017' lie = result.contents[1][1:-2] explanation = result.find('a').text[1:-1] url = result.find('a')['href'] records.append((date, lie, expalanation, url)) imort pandas as pd df = pd.DataFrame(records, columns=['date', 'lie', 'explanation', 'url']) df['date'] = pd.to_datetime[df['date']]) df.to_csv('trump_lies.csv', index=False, encoding='utf-8') import requests r = requests.get('https://www.bai



你的当前访问异常,请进行认证后继续阅读剩余内容。

分享到: