Product

Structured Web Data, Simplified: Bright Data’s Web Scraper Extension Lands on Dify Marketplace

Dify has partnered with Bright Data to launch the new Bright Data Web Scraper plugin, enabling direct access to real-time, structured web data through the Dify Marketplace and enriching the Knowledge Pipeline with external knowledge sources.

Aileen Li

Growth Marketer

Written on

Oct 21, 2025

Share

Share to Twitter
Share to LinkedIn
Share to Hacker News

Product

·

Oct 21, 2025

Structured Web Data, Simplified: Bright Data’s Web Scraper Extension Lands on Dify Marketplace

Dify has partnered with Bright Data to launch the new Bright Data Web Scraper plugin, enabling direct access to real-time, structured web data through the Dify Marketplace and enriching the Knowledge Pipeline with external knowledge sources.

Aileen Li

Growth Marketer

Share to Twitter
Share to LinkedIn
Share to Hacker News

Product

Structured Web Data, Simplified: Bright Data’s Web Scraper Extension Lands on Dify Marketplace

Dify has partnered with Bright Data to launch the new Bright Data Web Scraper plugin, enabling direct access to real-time, structured web data through the Dify Marketplace and enriching the Knowledge Pipeline with external knowledge sources.

Aileen Li

Growth Marketer

Written on

Oct 21, 2025

Share

Share to Twitter
Share to LinkedIn
Share to Hacker News

Product

·

Oct 21, 2025

Structured Web Data, Simplified: Bright Data’s Web Scraper Extension Lands on Dify Marketplace

Share to Twitter
Share to LinkedIn
Share to Hacker News

Product

·

Oct 21, 2025

Structured Web Data, Simplified: Bright Data’s Web Scraper Extension Lands on Dify Marketplace

Share to Twitter
Share to LinkedIn
Share to Hacker News

We’re thrilled to welcome Bright Data as the newest partner in the Dify Marketplace! Known as the world’s leading web data infrastructure platform, Bright Data offers enterprise-grade solutions to access structured, real-time information from e-commerce sites, social media, and search engines. Its arrival enriches the Marketplace ecosystem, empowering enterprises to seamlessly bring external web data into the Knowledge Pipeline. Together, we enable builders ranging from individuals to enterprises to unlock richer knowledge sources and build more powerful, reliable agentic workflows.

Empower your knowledge pipeline with Extensions in Dify Marketplace

The Knowledge Pipeline is our newest RAG engineering workflow that makes the entire context-building path visible and controllable. Inheriting Dify Workflow's canvas, it turns fragmented, unstructured data — PDFs, PPTs, Excel, HTML, and more — into reliable, model-ready knowledge. Each step is a node, from source connection and document parsing to chunking and embedding, where builders can choose the right plugin for text, images, tables, or scans. 

Since launch, the Dify Marketplace has been warmly embraced by builders worldwide. It has quickly grown into a thriving ecosystem, now hosting 500+ plugins — from Models and Tools to Agent Strategies, Extensions, and Data Sources. 

Dify Marketplace empowering developers to build, customize, and scale innovative AI solutions with speed and flexibility. Backed by it, builders can assemble knowledge pipelines like building blocks, enrich content with models, apply rule-based cleaning with code, and create transparent, tunable flows that solve RAG's toughest pain points: fragmented sources, parsing loss, and black-box processing.

Among them, Bright Data web scraper joins as a powerful new data source plugin, enabling your workflow to seamlessly capture real-time, structured information from across the web and enrich your Knowledge Pipeline with fresh external knowledge.

Quick Start: Integrate Bright Data with Dify

It’s easy to get started — follow these quick steps to connect Bright Data with Dify and start using real-time web data in your Knowledge Pipeline.

Step 1: Install the Extension

 Go to the Dify Marketplace and install the Bright Data Web Scraper plugin.

Step 2: Confirm Installation

 After installation, make sure the plugin appears in your Installed Extensions list.

Step 3: Set Up Your Bright Data Account

Sign in to your Bright Data account, copy your API key, and configure your data collection settings.

  1. Go to your Account Settings.

  1. In the API key section, click the Add API key button (top right).

  1. Set the user, permissions, and expiration date (or choose 'Unlimited'), then click Save.

  1. Your API key will be shown only once—make sure to copy and save it securely.

*You can find more details in the Bright Data official documentation: How to generate an API key.

Step 4: Integrate Bright Data with Dify

In Dify’s Settings → Configuration, enter your Bright Data API key to authorize the connection between the two platforms.

Step 5: Build Your Knowledge Pipeline

You're all set!

Create a Knowledge Pipeline in Dify, add a Bright Data data source, and start collecting structured, real-time data to enrich your AI workflows.

About Bright Data

Bright Data is the global leader in web data collection, providing businesses and developers with powerful, reliable tools to access real-time, structured information from e-commerce sites, social media, search engines, and more. Its enterprise-grade scraping infrastructure and 72M+ proxy network enable customers to gather critical external data at scale while ensuring accuracy and compliance.

Trusted by over 20,000 organizations worldwide — including Fortune 500 companies, academic institutions, and startups — Bright Data powers data-driven decisions across industries from market research and e-commerce to finance and brand monitoring. For more information, visit: www.brightdata.com.

About Dify.AI

Dify.AI is revolutionizing AI-native application development by providing an open-source platform that simplifies the entire lifecycle of AI application creation, deployment, and management. With its extensible plugin ecosystem, Dify.AI enables developers and businesses to seamlessly integrate AI capabilities, customize workflows, and accelerate innovation. By lowering the barriers to AI adoption, Dify.AI empowers users to build intelligent applications with greater efficiency and flexibility.

Website | GitHub | Docs | X | Discord | Linkedin | YouTube

On this page

    Related articles

    Unlock Agentic AI with Dify. Develop, deploy, and manage autonomous agents, RAG pipelines, and more for teams at any scale, effortlessly.

    Unlock Agentic AI with Dify. Develop, deploy, and manage autonomous agents, RAG pipelines, and more for teams at any scale, effortlessly.

    Unlock Agentic AI with Dify. Develop, deploy, and manage autonomous agents, RAG pipelines, and more for teams at any scale, effortlessly.

    Unlock Agentic AI with Dify. Develop, deploy, and manage autonomous agents, RAG pipelines, and more for teams at any scale, effortlessly.