日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程语言 > python >内容正文

python

Python 爬取斗图啦图片

發布時間:2024/4/13 python 31 豆豆
生活随笔 收集整理的這篇文章主要介紹了 Python 爬取斗图啦图片 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

斗圖啦

requests
BeautifulSoup4
代碼
# -*- coding:utf-8 -*- # pip install requests 框架 import requests # pip install beautifulsoup4 框架 # pip install lxml 解析器 from bs4 import BeautifulSoup import osclass doutuSpider(object):headers = {"user-agent": "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36"}def get_url(self, url):data = requests.get(url, headers=self.headers)soup = BeautifulSoup(data.content, 'lxml')totals = soup.findAll("a", {"class": "list-group-item"})for one in totals:sub_url = one.get('href')global pathpath = 'E:\\img' + '\\' + sub_url.split('/')[-1]os.mkdir(path)try:self.get_img_url(sub_url)except:passpasspassdef get_img_url(self, url):data = requests.get(url, headers = self.headers)soup = BeautifulSoup(data.content, 'lxml')totals = soup.findAll('div', {'class': 'artile_des'})for one in totals:img = one.find('img')try:sub_url = img.get('src')except Exception as e:raise efinally:urls = sub_urltry:self.get_img(urls)except:print urlspasspasspassdef get_img(self, url):filename = url.split('/')[-1]global pathimg_path = path + '\\' + filenameimg = requests.get(url, headers = self.headers)try:with open(img_path, 'wb') as f:f.write(img.content)except:passpassdef create(self):for count in range(1,10):url = 'https://www.doutula.com/article/list/?page={}'.format(count)print 'download {} page'.format(count)self.get_url(url)passpassif __name__ == '__main__':doutu = doutuSpider()doutu.create() 超強干貨來襲 云風專訪:近40年碼齡,通宵達旦的技術人生

總結

以上是生活随笔為你收集整理的Python 爬取斗图啦图片的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。