日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當(dāng)前位置: 首頁 > 编程语言 > python >内容正文

python

python实现大批量pdf格式论文的重命名与目录制作功能

發(fā)布時(shí)間:2024/7/5 python 25 豆豆
生活随笔 收集整理的這篇文章主要介紹了 python实现大批量pdf格式论文的重命名与目录制作功能 小編覺得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

Python實(shí)現(xiàn)批量PDF文件統(tǒng)計(jì)處理:https://www.jianshu.com/p/1ec8f4314611
第一步:批量讀取pdf文件

# -*- coding: utf-8 -*- """ Created on Sat Jun 8 15:30:22 2019 @author: Administrator """import os from io import StringIO from io import open from pdfminer.converter import TextConverter from pdfminer.layout import LAParams from pdfminer.pdfinterp import PDFResourceManager, process_pdf from docx import Document from docx.shared import Pt from docx.shared import RGBColordef read_pdf(pdf):# resource managerrsrcmgr = PDFResourceManager()retstr = StringIO()laparams = LAParams()# devicedevice = TextConverter(rsrcmgr, retstr, laparams=laparams)process_pdf(rsrcmgr, device, pdf)device.close()content = retstr.getvalue()retstr.close()# 獲取所有行lines = str(content).split("\n")return lines#%%#coding=utf-8#%% article_name = os.listdir('G:\嬰兒識(shí)別項(xiàng)目\TEMP') article_name.sort() i=1 test = Document() p = test.add_paragraph(u'目錄') for article in article_name: # if i<=8 : # i=i+1 # continuerun = p.add_run(article[:len(article)-4])run.font.size = Pt(12)run.font.name=u'Arial'run.font.color.rgb=RGBColor(0,0,255)run.bold=Truerun = p.add_run('\n')if __name__ == '__main__':with open(article, "rb") as my_pdf:lines=read_pdf(my_pdf)count=0for line in lines:if count==0:count=1continueif line==u'?':breakif line=='Contents lists available at ScienceDirect' or line=='' or line=='International Journal of Refrigeration ' or line=='journal homepage: www.elsevier.com/locate/ijrefrig ' or line=='a , ' or line=='b , ' or line=='c , ' or line=='d , ' or line=='e , ':continuerun = p.add_run(line)run.font.size = Pt(12)run.font.name=u'Arial'#print (lines[0])run = p.add_run('\n')run = p.add_run(lines[0])run.font.size = Pt(12)run.font.name=u'Arial'run = p.add_run('\n')run = p.add_run('\n')#text.close()test.save(u'C:/Users/Administrator/Desktop/目錄.docx')#print(read_pdf(my_pdf)[0])#%% article_name = os.listdir('G:\嬰兒識(shí)別項(xiàng)目\TEMP') article_name.sort()article=article_name[2] # if i<=8 : # i=i+1 # continueif __name__ == '__main__':with open(article, "rb") as my_pdf:lines=read_pdf(my_pdf)

第二步:
批量重命名文件

# -*- coding: utf-8 -*- """ Created on Wed Jun 5 14:37:46 2019 @author: Administrator """import os movie_name = os.listdir('./July') i=1 for temp in movie_name:new_name = '2019 Jul-' + str(i)+'.pdf'i=i+1os.rename('./July/'+temp,'./July/'+new_name)

也可參考:
https://www.jianshu.com/p/1ec8f4314611

總結(jié)

以上是生活随笔為你收集整理的python实现大批量pdf格式论文的重命名与目录制作功能的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯(cuò),歡迎將生活随笔推薦給好友。