Nightmare.js, horsemanjs for node any of the headless browsers can also be used https://github.com/dhamaniasad/HeadlessBrowsers
Apart from these https://scrapy.org/ provides some good tools for the above purpose. Auto web extractor library helps you to parse any web page into structured data. You can find more details about this library on https://github.com/indix/web-auto-extractor