Collect website content on demand and send it to your team or system. An AI agent chooses the right scraper, pulls the page in markdown or HTML, and keeps a file copy for records. This is great for marketing research, SEO checks, and content operations.
You start the run with a manual trigger. The flow sets a URL and chosen format, keeps short term memory, and uses Google Gemini to understand the request text. The agent then calls Bright Data tools through MCP to scrape either markdown or HTML. Results are posted to a webhook endpoint with a field named response, and the same content is written to disk as a file. A second path also shows a direct tool call without the agent, which helps with testing and learning.
Setup needs n8n self hosted with the community MCP client, a Google Gemini API key, and a unique webhook URL. Expect faster collection, fewer copy paste errors, and a clear audit trail of each pull. Use it to capture competitor pages, refresh content briefs, or archive landing pages for review. You can switch formats based on your next step, like sending markdown to writers or sending HTML to an internal parser.