Firecrawl
Firecrawl turns websites into clean, LLM-ready markdown — a powerful scraping companion for AI workflows and RAG pipelines built in n8n.
Some links on n8n.school are affiliate links. If you sign up or purchase through them, we may earn a commission at no extra cost to you. We only recommend tools we genuinely believe help you automate better.
Best use case with n8n
Scraping and crawling websites into clean markdown for AI workflows — feeding RAG pipelines, research agents, and content monitors in n8n.
Firecrawl makes web data AI-ready. Instead of wrestling with messy HTML, you get clean markdown (or structured JSON) you can drop straight into an LLM. In n8n, it's the natural front-end for research agents and RAG pipelines.
The four core endpoints
| Endpoint | What it does | Use it for |
|---|---|---|
| Scrape | One URL → clean markdown/JSON | Single pages, articles |
| Crawl | Follows links across a site | Whole docs sites, blogs |
| Map | Lists all URLs on a domain | Discovery before crawling |
| Extract | LLM pulls structured fields | Prices, contacts, specs |
Why it's great for n8n AI workflows
A typical Retrieval-Augmented Generation (RAG) pipeline in n8n:
- Firecrawl crawls your docs/site → clean markdown.
- n8n chunks the text and creates embeddings with OpenAI.
- Vectors are stored in Supabase (pgvector) or Pinecone.
- An AI agent retrieves relevant chunks to answer questions accurately.
Map then crawl
Use Map first to see every URL on a domain, filter to the sections you care about, then Crawl only those — it saves credits and keeps your RAG index clean.
Using it in n8n
Firecrawl has a clean REST API you call with n8n's HTTP Request node (or the community Firecrawl node). Send a URL, get back markdown — ready to embed or summarize.
Scrape responsibly
Always honor robots.txt and the terms of service of the sites you crawl, and respect rate limits.
Firecrawl vs Apify
- Firecrawl — best for clean, LLM-ready output from arbitrary pages and AI/RAG ingestion.
- Apify — best for pre-built, site-specific scrapers and massive, scheduled scraping jobs with proxy rotation.
Feed clean web data to your AI workflows
Firecrawl turns any site into LLM-ready markdown for n8n.
Affiliate link — we may earn a commission at no extra cost to you.
Key features
- Scrape — single URL to clean markdown or structured JSON
- Crawl — follow links across an entire site
- Map — discover every URL on a domain fast
- Extract — LLM-powered structured data extraction
- Handles JavaScript-rendered pages & PDFs
- Simple REST API for n8n's HTTP Request node
Pros & cons
Pros
- LLM-ready markdown output out of the box
- Handles modern, JS-heavy sites and PDFs
- Built specifically for AI/RAG ingestion
- Clean API that pairs with n8n AI nodes
Cons
- Credit usage scales with pages crawled
- Less suited to massive site-specific scraping (use Apify)
- Respect target sites' terms of service
Alternatives
Apify is a web scraping and automation platform with thousands of ready-made 'Actors' that plug into n8n workflows for data extraction at scale.
Pricing: Free $5 monthly usage credit; Starter ~$39/mo; Scale ~$199/mo; pay for compute + proxies