Welcome folks today in this post we will be converting html tables from website url
to csv files using python and htmltable2csv
library. All the full source code of the application is given below.
Get Started
In order to get started you need to install the following library using the pip
command as shown below
pip install htmltable2csv
After installing this library make an app.py
file and copy paste the following code
Usage
1. Using as a package inside a project.
a) Fetch HTML content from a Web Page URL.
1 2 3 4 |
from htmltable2csv.htmltable2csv import Table2CSV Table2CSV('http://example.com/page.html', '/path/to/destination') |
It will parse all the HTML tables in http://example.com/page.html and store theme in specified destination folder.
b) Read HTML content from a html file in the specified path.
1 2 3 4 |
from htmltable2csv.htmltable2csv import Table2CSV Table2CSV('/home/aqib/source.html', '/path/to/destination') |
It will parse all the HTML tables in /home/aqib/source.html file and store theme in specified destination folder.
2. Using as a utility from command line.
a) Fetch HTML content from a Web Page URL.
1 2 3 |
python3 -m htmltable2csv.htmltable2csv 'http://example.com/page.html' '/path/to/destination' |
b) Read HTML content from a html file in the specified path.
1 2 |
python3 -m htmltable2csv.htmltable2csv '/path/to/source.html' '/path/to/destination' |