Turn a site crawl export into a clean text catalog that highlights the best pages for AI and SEO work. Ideal for content and SEO teams that want a fast way to prepare curated links with titles and short descriptions.
The flow starts with a simple form where you enter the site name, a short summary, and upload a CSV from your crawler. The file is parsed, then mapped to seven key fields like URL, title, description, status, indexability, content type, and word count. A filter keeps only pages that return 200, are indexable, and are text HTML. You can also enable an AI step with OpenAI to classify pages as useful content or other content. Each page is formatted as a simple line, then all lines are combined and saved as a downloadable text file. You can swap the last node to upload the file to cloud storage.
Use a CSV export that includes internal URLs, ideally the internal HTML version. The mapping handles multiple languages, so non English exports still work. Expect big time savings by moving from manual sorting to a guided flow. Teams can build a clean list in minutes and reuse the same steps for new sites or larger crawls.