EmilStenstrom/justhtml: A pure Python HTML5 parser that just works. No C extensions to compile. No system dependencies to install. No complex API to learn.
lxml - Processing XML and HTML with Python
GitHub - jaypyles/Scraperr: Self-hosted webscraper.
PyPi: beautifulsoup4
Beautiful Soup: Homepage