Beautiful Soup - HTML and XML Parsing Library in python

#python #webscraping #beautifulsoup #programming

Beautiful Soup is a Python library designed for parsing HTML and XML documents. It makes it easy to navigate, search, and modify the parse tree of web pages. Beautiful Soup is widely used for web scraping, data extraction, and cleaning HTML content from websites. It works well with other libraries like requests to fetch web pages and provides a simple, Pythonic interface to handle complex HTML structures.

Installation:

pip install beautifulsoup4

Example usage:

from bs4 import BeautifulSoup

html_doc = "<html><body><h1>Hello World</h1></body></html>"
soup = BeautifulSoup(html_doc, "html.parser")
print(soup.h1.text)