當前位置：首頁 > 编程语言 > python >内容正文

python

html表格转换为csv,python实现将html表格转换成CSV文件的方法

發布時間：2023/12/3 python 22 豆豆

生活随笔收集整理的這篇文章主要介紹了 html表格转换为csv,python实现将html表格转换成CSV文件的方法小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

python實現將html表格轉換成CSV文件的方法

發布于 2015-11-18 16:53:39 | 155 次閱讀 | 評論: 0 | 來源: 網友投遞

Python編程語言Python 是一種面向對象、解釋型計算機程序設計語言，由Guido van Rossum于1989年底發明，第一個公開發行版發行于1991年。Python語法簡潔而清晰，具有豐富和強大的類庫。它常被昵稱為膠水語言，它能夠把用其他語言制作的各種模塊(尤其是C/C++)很輕松地聯結在一起。

這篇文章主要介紹了python實現將html表格轉換成CSV文件的方法,涉及Python操作csv文件的相關技巧,需要的朋友可以參考下

本文實例講述了python實現將html表格轉換成CSV文件的方法。分享給大家供大家參考。具體如下：

使用方法：python html2csv.py *.html

這段代碼使用了 HTMLParser 模塊

#!/usr/bin/python

# -*- coding: iso-8859-1 -*-

# Hello, this program is written in Python - http://python.org

programname = 'html2csv - version 2002-09-20 - http://sebsauvage.net'

import sys, getopt, os.path, glob, HTMLParser, re

try: import psyco ; psyco.jit() # If present, use psyco to accelerate the program

except: pass

def usage(progname):

''' Display program usage. '''

progname = os.path.split(progname)[1]

if os.path.splitext(progname)[1] in ['.py','.pyc']: progname = 'python '+progname

return '''%s

A coarse HTML tables to CSV (Comma-Separated Values) converter.

Syntax : %s source.html

Arguments : source.html is the HTML file you want to convert to CSV.

By default, the file will be converted to csv with the same

name and the csv extension (source.html -> source.csv)

You can use * and ?.

Examples : %s mypage.html

: %s *.html

This program is public domain.

Author : Sebastien SAUVAGE

http://sebsauvage.net

''' % (programname, progname, progname, progname)

class html2csv(HTMLParser.HTMLParser):

''' A basic parser which converts HTML tables into CSV.

Feed HTML with feed(). Get CSV with getCSV(). (See example below.)

All tables in HTML will be converted to CSV (in the order they occur

in the HTML file).

You can process very large HTML files by feeding this class with chunks

of html while getting chunks of CSV by calling getCSV().

Should handle badly formated html (missing

, , ,

extraneous , ...).

This parser uses HTMLParser from the HTMLParser module,

not HTMLParser from the htmllib module.

Example: parser = html2csv()

parser.feed( open('mypage.html','rb').read() )

open('mytables.csv','w+b').write( parser.getCSV() )

This class is public domain.

Author: Sébastien SAUVAGE

http://sebsauvage.net

Versions:

2002-09-19 : - First version

2002-09-20 : - now uses HTMLParser.HTMLParser instead of htmllib.HTMLParser.

- now parses command-line.

To do:

- handle

總結

以上是生活随笔為你收集整理的html表格转换为csv,python实现将html表格转换成CSV文件的方法的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇： cad复制对象快捷键(cad复制快捷键命
下一篇： basemap安装_Python画地图逃