I am using JSoup, which is an HTML parser library in pure Java, to pull and parse pure HTML pages (well, with very little JS) off the internet. So far, everything is working fine if the data I want to scrape is attached to a very verbose HTML element (like I am using JSoup, which is an HTML parser libra