Need a fast way to find files and links on your site? This setup reads a sitemap URL, turns it into a clean list, and filters only the items you care about like PDFs. It helps marketing and SEO teams build content inventories and spot resources for campaigns without manual copy and paste.
Here is how it runs from start to finish. A manual trigger starts the run. A Set block holds one field called sitemapUrl. The HTTP Request block pulls the sitemap file from the web. The XML block converts the XML into JSON and normalizes the keys so they are easy to read. Split Out breaks the urlset.url array into single items. The Filter block returns only links that match your rules. Out of the box it focuses on .pdf links, which is great for asset audits.
Getting started is simple. Add your sitemap address to the Set block and adjust the Filter rules for your link patterns. Expect a big time savings on content audits and fewer mistakes from manual scans. Common uses include collecting gated resources, mapping old files for redirects, and tracking compliance documents across large sites.