crawl4ai

crawl4ai

官方网站https://crawl4ai.com
GIT地址https://github.com/unclecode/crawl4ai
GIT Star数26958
开发语言Python
GIT信息最后更新日期2025/01/24 01:42
许可Apache-2.0
简介Crawl4AI 是当前 GitHub 上排名第一的趋势仓库,由一个充满活力的社区积极维护。它提供了超高速的、适用于大语言模型(LLM)、AI 代理和数据管道的 AI 准备型网页爬取能力。

安装手册

official website readme

🐳 Option 1: Docker Hub (Recommended)

Choose the appropriate image based on your platform and needs:

For AMD64 (Regular Linux/Windows):

# Basic version (recommended)
docker pull unclecode/crawl4ai:basic-amd64
docker run -p 11235:11235 unclecode/crawl4ai:basic-amd64

# Full ML/LLM support
docker pull unclecode/crawl4ai:all-amd64
docker run -p 11235:11235 unclecode/crawl4ai:all-amd64

# With GPU support
docker pull unclecode/crawl4ai:gpu-amd64
docker run -p 11235:11235 unclecode/crawl4ai:gpu-amd64

For ARM64 (M1/M2 Macs, ARM servers):

# Basic version (recommended)
docker pull unclecode/crawl4ai:basic-arm64
docker run -p 11235:11235 unclecode/crawl4ai:basic-arm64

# Full ML/LLM support
docker pull unclecode/crawl4ai:all-arm64
docker run -p 11235:11235 unclecode/crawl4ai:all-arm64

# With GPU support
docker pull unclecode/crawl4ai:gpu-arm64
docker run -p 11235:11235 unclecode/crawl4ai:gpu-arm64

Need more memory? Add --shm-size:

docker run --shm-size=2gb -p 11235:11235 unclecode/crawl4ai:basic-amd64