n8n

How to Automate Bright Data Research Summaries?

Get fast research briefs from public web pages. The setup collects live data through a scrape, cleans the text, and creates a short, clear summary. It fits marketing teams that need quick insights for content and campaigns.

The run starts by clicking Test. A request is sent to Bright Data to launch a scrape with a target site and a prompt. The flow stores the snapshot id, waits, and checks status in a loop until data is ready or an error is found. Once ready, it downloads the JSON, turns messy HTML into plain text using an information extractor powered by Google Gemini, then builds a concise summary with a second Gemini model. Chunking is included to handle long text. The final output is posted to a webhook so other tools can catch it.

Add your Bright Data API key and Google Gemini credentials. Replace the sample webhook URL with your own endpoint. Expect research time to go from hours to minutes, with consistent summaries you can paste into briefs, ads, or reports. Good for competitor reviews, product comparisons, and quick topic overviews.

What are the key features?

  • On demand start with a manual trigger for controlled runs
  • Bright Data dataset trigger sends a POST request with url, prompt, and country
  • Snapshot id capture and loop with If and Wait nodes to poll until ready
  • Snapshot download with header authentication and a set timeout
  • HTML to text conversion using an information extractor powered by Google Gemini
  • Summarization chain uses a Gemini model to produce a clear, short brief
  • Recursive text splitter with overlap to handle long content safely
  • Webhook notification posts the final summary as a JSON field to your endpoint

What are the benefits?

  • Reduce manual research from 60 minutes to under 10 minutes per topic
  • Automate web data collection and status checks to cut 80% of repetitive work
  • Deliver consistent summaries by standardizing extraction and prompts
  • Connect Bright Data, Google Gemini, and your webhook without custom code
  • Handle more topics per day by running on demand with reliable polling

How do you set it up?

  1. Import the template into n8n: Create a new workflow in n8n > Click the three dots menu > Select 'Import from File' > Choose the downloaded JSON file.
  2. You'll need accounts with Bright Data, Google Gemini and Webhook.site. See the Tools Required section above for links to create accounts with these services.
  3. In the n8n credentials manager, create a Header Auth credential for Bright Data. If unsure, double click the Perplexity Search Request node, choose Credential to connect with, click Create new credential, and follow the on screen steps to add your API key as a header.
  4. Open the Perplexity Search Request node. Adjust the JSON body to set your target site and your prompt. Keep the method as POST and leave authentication as the header auth you created.
  5. Check the Set Snapshot Id node. Confirm the snapshot_id field maps from the Bright Data trigger response so later nodes can use it.
  6. Open the Check Snapshot Status node and keep the default endpoint from the template. Make sure it uses the same Bright Data header auth. This node should read the snapshot status using the stored id.
  7. Review the If and Wait nodes. The Wait node is set to 30 seconds. Increase this if your dataset takes longer to build or if you see rate limit errors.
  8. Configure Google Gemini credentials. Double click both Gemini nodes, choose Credential to connect with, click Create new credential for Google Gemini, and paste your API key from the Google AI API page.
  9. Update the Webhook Notifier node with your own URL. Replace the sample Webhook.site link with a URL from your app, Zapier, Make, or your own webhook server.
  10. Click Test workflow. Watch the execution log. Confirm the snapshot status turns ready, the JSON is downloaded, text is extracted and summarized, and the response arrives at your webhook endpoint.
  11. Troubleshoot if needed. If no snapshot_id appears, check Bright Data credentials and request body. If runs loop forever, increase the wait time or verify the If conditions. If text is too long, adjust the text splitter overlap or reduce the prompt scope.

Tools Required

$24 / mo or $20 / mo billed annually to use n8n in the cloud. However, the local or self-hosted n8n Community Edition is free.

Bright Data

Sign up

Pay as you go: $1.5 per 1K records (Web/LinkedIn Scraper API)

Google Gemini

Sign up

Free tier: $0 via Gemini API; e.g., Gemini 2.5 Flash-Lite free limits 1,000 requests/day (15 RPM, 250k TPM). Paid from $0.10/1M input tokens and $0.40/1M output tokens.

Webhook.site

Sign up

Free tier: $0, public API available; free URLs expire after 7 days and accept up to 100 requests

Similar Templates

Join Futurise to access 1,200+ automation templates

Get instant access to ready-made automation workflows for n8n, Make.com, AI agents, and more. Download, customise, and deploy in minutes.