如何获取qq群成员的资料信息(爬虫)
**
1.具備條件:是該群的成員
第一步復(fù)制下面地址
https://qun.qq.com/member.html#gid= ######
***######***這里填你想爬取的qq群號(hào)
填完后進(jìn)鏈接,例如https://qun.qq.com/member.html#gid=123456
2.打開(kāi)pycharm將如下代碼放進(jìn)去
注意格式
from sys import path
from selenium import webdriver
from time import sleep
#1.創(chuàng)建Chrome瀏覽器對(duì)象,這會(huì)在電腦上在打開(kāi)一個(gè)瀏覽器窗口
browser = webdriver.Chrome(executable_path=“G:\chromedownload\chromedriver”) # 這里是指驅(qū)動(dòng)路徑
#2.通過(guò)瀏覽器向服務(wù)器發(fā)送URL請(qǐng)求
browser.get(“https://qun.qq.com/member.html#gid=522311269”) # 里面的網(wǎng)址就是加載群成員信息的那個(gè)網(wǎng)址
sleep(20)
#browser.sleep(3)
all_number_nickname = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[3]/span[1]’)
all_number_name = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[4]/span[1]’)
all_number_order = browser.find_elements_by_class_name(‘td-no’)
all_number_qq = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[5]’)
all_number_sex = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[6]’)
all_number_qqage = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[7]’)
all_number_intime = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[8]’)
all_number_marks = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[9]’)
all_number_lastsaytime = browser.find_elements_by_xpath(’//[@class=“l(fā)ist”]/tr/td[10]’)
#for i in [all_number_qq,all_number_nickname,all_number_name,all_number_order,all_number_sex,all_number_qqage,all_number_intime,all_number_marks,all_number_lastsaytime]:
for j in i:
print(j.text)
list = []
for k in range(len(all_number_qq)):
list.append([])
list[k].append(all_number_qq[k].text)
list[k].append(all_number_sex[k].text)
import openpyxl
def write_excel_xlsx(path, sheet_name, value):
index = len(value)
workbook = openpyxl.Workbook()
sheet = workbook.active
sheet.title = sheet_name
for i in range(0, index):
for j in range(0, len(value[i])):
sheet.cell(row=i + 1, column=j + 1, value=str(value[i][j]))
workbook.save(path)
print(“xlsx格式表格寫(xiě)入數(shù)據(jù)成功!”)
book_name_xlsx = ‘python群6666.xlsx’
sheet_name_xlsx = ‘python群6666’
value = list
write_excel_xlsx(book_name_xlsx, sheet_name_xlsx, value)
3.代碼的格式整理正確后運(yùn)行即可
4.一起探討qq641616154
總結(jié)
以上是生活随笔為你收集整理的如何获取qq群成员的资料信息(爬虫)的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問(wèn)題。
- 上一篇: vue --- vue.js实战基础篇
- 下一篇: javasript --- 一个日期规