Build a clean research library from any website sitemap and save it as markdown files in Google Drive. Ideal for content teams that need quick audits, topic research, or backups without manual copy and paste.
Here is how it works. You start the run manually. A sitemap URL is set, then a web request pulls the XML and converts it to JSON. The list of pages is split out, optional filters keep only the pages you care about, and a limit controls how many links to process per run. Each URL is fetched through a readable page endpoint, a small code step extracts the title and markdown, and each page is stored as a text file in Google Drive. A wait step can slow the loop to respect rate limits.
Setup is simple. You only need a Google Drive connection in n8n Cloud, a valid sitemap URL, and a folder to store files. Expect faster content reviews, organized files named by page title, and less time spent copying text from the browser. Great for SEO snapshots, competitor scans, and internal documentation backups.