jsoup is a Java library that makes it easy to work with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, ...
If you're reading this on github.com, please note that this is the readme for the development version and that some features described here might not yet have been ...