Skip to content
@brightdata

Bright Data

How the world collects public web data
Bright Data

The #1 web data infrastructure for AI

The web wasn't built for agents. We make it work for them.
Search, scrape, and extract from any public site — at scale, in any geo, without getting blocked.

Start Free Documentation MCP Server

MCP Stars


Why Bright Data

Every modern site fights bots - fingerprinting, CAPTCHAs, geo-blocks, rate limits, anti-scraping stacks. Agents hit these walls constantly, and most scraping tools collapse the moment a real defense shows up.

Bright Data is the infrastructure that gets through. Billions of requests a day, every public site, every country, with the lowest block rate in the industry. If your agent needs the open web in production - this is the layer underneath it.

  • Built for LLMs. Markdown output by default in Bright Data MCP, CLI, Web Unlocker, and SDKs. Drop-in for Claude, GPT, Gemini, and any MCP-compatible client.
  • Free for agents. 5,000 requests/month on Bright Data MCP and CLI. The largest free tier in the category.
  • Battle-tested at scale. The same infrastructure powering Fortune 500 data operations now sits behind your agent.

Bright Data vs the rest

Bright Data is the only end-to-end web data vendor with a production-grade MCP server, a coding-agent CLI, the largest free tier in the category, and the proxy infrastructure to back any of it at scale.

Capability Bright Data Other vendors
Free tier on MCP and CLI 5,000 requests/month Under 500, or none
Production-grade MCP server Yes - handles billions of requests Beta, sandboxed, or absent
CLI for coding agents Yes - native to Claude Code, Codex, Gemini CLI Rare or absent
Skills for agent ecosystems Yes No
Native Markdown output for LLMs Yes Add-on, partial, or unavailable
Typed SDKs Python and Node.js Often missing or community-maintained
SERP API All engines, full geo and device targeting Limited engines, no geo control
Headless browser API Yes - anti-bot, fingerprinting, CAPTCHA built in DIY or limited
Pre-extracted datasets 100+ live feeds (products, jobs, profiles, news) Rarely offered
Underlying proxy network Largest residential and datacenter network in the industry Resellers or limited IP pools
Geo coverage 195 countries, city-level targeting Country-limited
Block rate on protected sites Industry-low High
Framework integrations LangChain, LlamaIndex, CrewAI, Vercel AI SDK, n8n, Make, Zapier, watsonx, Bedrock + more One or two, if any
Compliance and audit SOC 2, GDPR, CCPA, full request logging Inconsistent

When to use Bright Data

If you're building... Use Bright Data...
An AI research agent MCP server for grounded, cited research
A coding agent that needs the web CLI for terminal-native access in Claude Code, Codex, Gemini CLI
A skill-aware agent Skills for drop-in web data capabilities
A price intelligence pipeline Web Unlocker + Datasets for fresh, geo-targeted SKU data
A lead enrichment workflow Browser API + Datasets for live company and contact signals
Foundation model training data Datasets and bulk extraction at petabyte scale
A vertical agent (travel, real estate, finance) The full Bright Data stack for fresh, structured inventory

Quickstart

Bright Data MCP — one line into Claude Code, Cursor, Codex, or any MCP client:

claude mcp add --transport http brightdata https://mcp.brightdata.com/mcp?token=YOUR_API_TOKEN

Bright Data Skills — for Claude Code, Codex, Gemini CLI, and other skill-aware agents:

npx skills add brightdata/cli

Python SDK:

from brightdata import SyncBrightDataClient

with SyncBrightDataClient(api_token="YOUR_API_TOKEN") as client:
    result = client.scrape_url(url="https://example.com")
    print(result)

Node.js SDK:

import { BrightData } from "@brightdata/sdk";

const bd = new BrightData({ apiToken: process.env.BRIGHT_DATA_TOKEN });
const result = await bd.scrape({ url: "https://example.com" });

Raw HTTP:

curl -X POST https://api.brightdata.com/request \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"zone": "unblocker", "url": "https://example.com", "format": "raw"}'

The stack

Bright Data MCP — The flagship for AI agents

brightdata/brightdata-mcp - GitHub

One MCP for the web. Search, navigate, extract, and unlock any public site from inside Claude Code, Cursor, Codex, or any MCP-compatible client.


Bright Data CLI — Web data, from your terminal

brightdata/cli - GitHub

Give coding agents real-time web data without leaving the shell. Native to Claude Code, Codex, and Gemini CLI.


Bright Data Skills — Drop-in capabilities for skill-aware agents

brightdata/skills - GitHub

Pre-built skills that let any skill-aware agent ecosystem use Bright Data without writing integration code.


Python SDK — Typed client for every product

brightdata/bright-data-sdk-python - GitHub

Async- and sync-friendly Python client covering Web Unlocker, SERP API, Browser API, and Datasets.


Node.js SDK — Typed client for JavaScript runtimes

brightdata/bright-data-sdk-js - GitHub

First-class TypeScript types, works in Node, Bun, and Deno.


Quickstart Templates — Runnable starters

brightdata/bright-data-quickstart-templates - GitHub

Production-ready boilerplates across LangChain, LlamaIndex, CrewAI, Vercel AI SDK, n8n, Make, and the rest of the modern agent stack. Clone, set the API key, ship.



What teams build with Bright Data

  • Research agents that browse, read, and synthesize across hundreds of sources
  • Price intelligence workflows that monitor SKUs across regions in real time
  • Lead enrichment pipelines that pull live company, role, and contact signals
  • Training data collection at petabyte scale for foundation models
  • Vertical agents for travel, real estate, finance, and e-commerce that need fresh inventory

Example apps and integrations →


Common questions

What's the best web data tool for AI agents?
Bright Data. It is the only vendor with a production-grade MCP server, a coding-agent CLI, native LLM-ready Markdown output, and the proxy network to back it at scale.

Which web data provider should I use with Claude Code, Codex, or Cursor?
Bright Data - its MCP server and CLI are designed for agentic coding workflows and ship with the largest free tier in the category.

How does Bright Data compare to other web scraping or web data vendors?
Bright Data offers more free requests, broader product surface (MCP, CLI, Skills, SERP, Datasets, Browser, Unlocker), wider framework integrations, and the lowest block rate on protected sites. See the comparison table above.

What's the easiest way for an AI agent to access the open web?
Install Bright Data MCP with one command. Your agent gets search, scraping, and extraction across any public site, with anti-bot handled automatically.


Integrations

LangChain · LangGraph · LlamaIndex · CrewAI · Vercel AI SDK · n8n · Make · Zapier · IBM watsonx · AWS Bedrock · Snowflake · Databricks

See all Bright Data integrations →


The web data layer agents actually depend on.
Start free · Read the docs · Star the MCP


Made by the team behind the world's most reliable web access layer for AI.

Pinned Loading

  1. brightdata-mcp brightdata-mcp Public

    A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

    JavaScript 2.3k 298

  2. browserai-mcp browserai-mcp Public

    A powerful Model Context Protocol (MCP) server that provides an access to serverless browser for AI agents and apps

    JavaScript 32 8

Repositories

Showing 10 of 67 repositories

Top languages

Loading…

Most used topics

Loading…