Tag feed

#web-scraping

1,080 posts102 followers

Explore Hashnode

Alternatives

Trending tags this week

PUProxy Universeproxyuniverse.hashnode.dev1d ago · 4 min read

Stop Finding Out From a Timeout: Monitor Your Proxy Provider's Uptime in ~40 Lines of Python

Subtitle: After the 2026 outage wave (9Proxy, IP2World, PIA S5), I refuse to learn about provider downtime from a failed job. Here's the tiny health-check monitor that pings me on Telegram before my p

0

RRoamProxyroamproxy.hashnode.dev1d ago · 4 min read

Playwright + Proxies: Sticky Sessions, Rotation, and the SOCKS5 Trap

You wired a proxy into requests in one line. Then you tried the same thing with Playwright and got a soup of ERR_TUNNEL_CONNECTION_FAILED, mystery CAPTCHAs, and a bandwidth bill that made no sense. Be

0

CTChandler Thompsonchandlerthompson.hashnode.dev2d ago · 4 min read

The state of web scraping in the AI era: still relevant or a relic?

Every few months someone declares web scraping dead. APIs cover more ground than they did, and the models can read a page and hand you structured data, so why keep writing parsers? I had the same thou

0

RRoamProxyroamproxy.hashnode.dev3d ago · 3 min read

We Sent 10,000 Requests Through a Residential Proxy Network. Here's the Raw Data.

Most proxy benchmarks you'll find are vendor marketing: a success rate with no methodology, no failure counts, no raw data. We run a residential proxy network, and we wanted numbers we could actually

0

RRoamProxyroamproxy.hashnode.dev5d ago · 4 min read

How to Avoid Getting Blocked While Web Scraping: 7 Rules That Actually Matter

Most "how to not get blocked" advice is a list of 30 tips where 25 barely move the needle. After running scrapers in production for a while, here are the seven that account for nearly all of the diffe

0

ATAethyn Teamaethyn-io.hashnode.devJul 12 · 7 min read

One Identity Per Task: Keeping an AI Agent's Browser Session Coherent

Most proxy advice was written for scrapers, and if you copy it into an agent you will break your agent in a way that's hard to debug. The advice goes: rotate your IP on every request. For a scraper, t

1

O

RRoamProxyroamproxy.hashnode.devJul 12 · 3 min read

Rotating vs. Sticky Proxies: A Practical Guide for Web Scraping

When you route a scraper through a proxy network, one decision quietly determines whether your job succeeds or gets blocked in the first ten minutes: should each request get a fresh IP, or should a ba

0

ATAethyn Teamaethyn-io.hashnode.devJul 10 · 4 min read

Why Your Scraper Works Locally but Dies in Production (and How to Actually Fix It)

Same code, same site — green on your laptop, 403 on AWS. It's almost never your code. It's your network. You built a scraper. It runs beautifully on your laptop. You deploy it to AWS / GCP / a VPS,

1

O

TWTony Wangcrawlora.hashnode.devJul 10 · 9 min read

AI vs Traditional Web Scraping: Which Wins, When

Key takeaways Traditional scraping (CSS/XPath selectors) is fast, cheap, and near-100% accurate on stable pages — but brittle: by one industry estimate, 10–15% of crawlers need maintenance every week

1

O

TWTony Wangcrawlora.hashnode.devJul 10 · 11 min read

Best SERP APIs in 2026 for Rank Tracking and Search Data

Key takeaways Google has no official SERP API — the Custom Search JSON API closed to new customers in 2025 and shuts down January 1, 2027 — so production rank tracking runs on third-party SERP APIs.

0

#web-scraping

Search Hashnode

#web-scraping

Explore Hashnode

Trending tags this week

Stop Finding Out From a Timeout: Monitor Your Proxy Provider's Uptime in ~40 Lines of Python

Playwright + Proxies: Sticky Sessions, Rotation, and the SOCKS5 Trap

The state of web scraping in the AI era: still relevant or a relic?

We Sent 10,000 Requests Through a Residential Proxy Network. Here's the Raw Data.

How to Avoid Getting Blocked While Web Scraping: 7 Rules That Actually Matter

One Identity Per Task: Keeping an AI Agent's Browser Session Coherent

Rotating vs. Sticky Proxies: A Practical Guide for Web Scraping

Why Your Scraper Works Locally but Dies in Production (and How to Actually Fix It)

AI vs Traditional Web Scraping: Which Wins, When

Best SERP APIs in 2026 for Rank Tracking and Search Data