Python爬虫基础（三）--将爬虫获取到的数据写入到csv

爬虫需要将网页获取的结果保存下来，现在先学习csv保存数据；

最终实现代码：

import requests
from bs4 import BeautifulSoup
import csv

def db():
    url = "https://www.douban.com/group/"
    headers = {
        "User-Agent":"Mozilla/5.0",
        "Cookie":''
    }
    ret = requests.get(url,headers = headers)
    return ret.content

#解析网页，并获取帖子的url、标题
def get_data(lst,html_data):
    soup = BeautifulSoup(html_data,"html.parser")
    for i in soup.find_all("a",attrs="title"):
        lst.append([i.attrs["href"],i.attrs["title"]])

#保存url、标题到csv文件中
def save_to_csv(lst):
    with open('test.csv','w',newline='',encoding='utf-8')as f:
        f_csv = csv.writer(f)
        for data in lst:
            f_csv.writerow(data)

def main():
    Html = db()
    lst = []
    get_data(lst,Html)
    save_to_csv(lst)

main()

python爬虫

Python爬虫基础（三）--将爬虫获取到的数据写入到csv

爬虫需要将网页获取的结果保存下来，现在先学习csv保存数据；

最终实现代码：

相关

python爬虫

Python爬虫--beautifulsoup 4 用法

python爬虫笔记

python爬虫入门之 requests 模块

Python爬虫基础（二）--beautifulsoup-美丽汤框架介绍

Python爬虫-解析网页之Re库

【Python爬虫】新发现一个高质量跳舞视频网站爬一下试试，男生都喜欢

python爬虫执行js代码-execjs

python爬虫-京东商品爬取

python爬虫（爬取图片）

python爬虫关于Max retries exceeded with url 的错误

Python爬虫实战一之爬取糗事百科段子

标签