DEV Community

Scraping

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Regex broke my scraper: Using LLMs for robust data extraction

Regex broke my scraper: Using LLMs for robust data extraction

2
Comments
5 min read
I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

I Built an API That Turns Any Website Into JSON Using Just CSS Selectors

Comments
2 min read
Rate Limits & Anti-Bots in Agentic Scraping

Rate Limits & Anti-Bots in Agentic Scraping

1
Comments
5 min read
The Anti-Bot Detection Checklist I Use Before Every Scraping Project

The Anti-Bot Detection Checklist I Use Before Every Scraping Project

Comments
4 min read
My Apify Promotion Filter: Scale Clean APIs, Hold Back Noisy Demand

My Apify Promotion Filter: Scale Clean APIs, Hold Back Noisy Demand

Comments
3 min read
My web scraping nightmare ended when I let an LLM read the HTML

My web scraping nightmare ended when I let an LLM read the HTML

Comments
5 min read
I Thought I Knew Web Scraping — Until I Hit JavaScript

I Thought I Knew Web Scraping — Until I Hit JavaScript

Comments
4 min read
Why I Gave Up on Regex and Started Using AI for Web Scraping

Why I Gave Up on Regex and Started Using AI for Web Scraping

Comments
5 min read
I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked

I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked

Comments
5 min read
Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas

Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas

Comments
6 min read
Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Comments
4 min read
Scraping Real Estate & Job Data with Python: Zillow, Indeed & More (2026)

Scraping Real Estate & Job Data with Python: Zillow, Indeed & More (2026)

Comments
14 min read
Track YC Demo Day Companies in Real Time (with code)

Track YC Demo Day Companies in Real Time (with code)

Comments
5 min read
I spent 3 days scraping a site until I tried LLMs for data extraction

I spent 3 days scraping a site until I tried LLMs for data extraction

2
Comments 2
6 min read
Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.