Turn scattered API docs into clean, reusable schemas. This build researches each service, extracts real API operations, and outputs a structured schema file. It suits product and engineering teams that need fast vendor due diligence, partner onboarding, or an internal API catalog.
The run starts with a list in Google Sheets. It calls Apify to search the web for official docs and scrapes pages to collect content. Text is split into chunks, embedded with Google Gemini, and stored in Qdrant for fast search. The flow then identifies a service’s products, finds matching docs in the vector store, and uses Gemini to extract endpoints and methods. Results write back to Google Sheets. A code step builds a custom JSON schema and uploads it to Google Drive. Event routing, batching, and wait steps coordinate the three stages end to end.
Plan for Google Sheets and Drive access, an Apify token, a Google Gemini API key, and a Qdrant URL and key. Expect research time to drop from hours to minutes per service, with more consistent results. Useful for partner evaluations, integration planning, and building a private API knowledge base. Start with a few services in the sheet, run a test, and confirm each stage writes results before scaling up.