FireCrawl - Distributed Web Crawling System

FireCrawl is a powerful API service that takes URLs, crawls them, and converts them into clean markdown or structured data. It crawls all accessible subpages and provides clean data for each, with no sitemap required.

Features

Scraping: Scrapes URLs and gets content in LLM-ready format (markdown, structured data, screenshot, html)
Crawling: Scrapes all URLs of a web page and returns content in LLM-ready format
Mapping: Input a website and get all website URLs - extremely fast
Extraction: Get structured data from single page, multiple pages or entire websites with AI

Services

This template deploys the following services:

1. Playwright Service

Handles web scraping and browser automation
Supports proxy configuration
Can block media content for faster scraping

2. API Service

Main API endpoint handling requests
Manages job queues and processing
Integrates with Redis for caching and rate limiting

3. Worker Service

Processes background jobs
Handles queue tasks
Manages crawling operations

4. Redis

Handles caching and rate limiting
Manages job queues
Provides data persistence

Getting Started

Deploy this template on Zeabur
Configure the required environment variables
Access your API endpoint and start crawling!

For more detailed documentation, visit FireCrawl Documentation

Issues

For questions and support, please visit the GitHub Issues

FireCrawl

A distributed web crawling system with Playwright service

FireCrawl - Distributed Web Crawling System

Features

Services

1. Playwright Service

2. API Service

3. Worker Service

4. Redis

Getting Started

Issues

模版内的服务

playwright-service

api

worker

redis