On the web you can find countless of tables. Those tables (and any webpage) is defined in HTML. So you need to parse HTML right?
Not exactly, there's a module called Pandas which parses the data for you. That data is then stored in a data structure named data frame.
Say you grab the table from https://www.fdic.gov/bank/individual/failed/banklist.html
#!/usr/bin/python3 import pandas as pd import numpy as np url ='https://www.fdic.gov/bank/individual/failed/banklist.html' res2=pd.read_html(url) print(res2) print("+"*50) print(res2["Bank Name"])
So the line
gets the whole table and puts it in a pandas data frame. That easy!
This line shows the whole table
for a specific column
So you can easily grab data from a webpage, without having to parse html language yourself.