© 2022 Hashnode
#web-scraping
What will be scraped Preparation First, we need to create a Node.js* project and add npm packages puppeteer, puppeteer-extra and puppeteer-extra-plugin-stealth to control Chromium (or Chrome, or Fi…
A proxy can hide your IP, but what happens when that gets banned? You would need a new IP. Or you could maintain a list of them and rotate proxies for each request. The final option would be to use Sm…
What will be scraped Preparation First, we need to create a Node.js* project and add npm packages cheerio to parse parts of the HTML markup, and axios to make a request to a website. To do this, in…
Introduction The term "Web Scraping" refers to the extraction of data from a website quickly and accurately. The web scraping software may directly access the World Wide Web using the Hypertext Transf…
What will be scraped Prerequisites Basic knowledge scraping with CSS selectors If you haven't scraped with CSS selectors, there's a dedicated blog post of mine about how to use CSS selectors when web…
What will be scraped 📌Note: the knowledge graph has different layouts, so the code I provide works with this layout as shown in the screenshot. Preparation First, we need to create a Node.js* proj…
In order to protect themselves from credentials stuffing, SoundCloud is using Human antibot protection, formerly known as WhiteOps. But is it 100% bulletproof ? In this article, I will show you how to bypass this antibot and how to attempt …
Introduction Reddit.com is a social news website and forum where content is socially curated and promoted by site members through voting. In this article, you will read about the easiest way to scrape…
❗❗❗ This blog post was written for an old Google Play page design. The code is currently broken due to a redesign of the Google Play website, we're currently working on a fix. What will be scraped P…