Turn bank statements into usable data without manual typing. The flow pulls a PDF from Google Drive, converts pages to images, transcribes each page to markdown with a vision model, and extracts only the deposit rows. It is built for finance teams that need fast, accurate data for reconciliation and reporting.
The run starts on click. A Google Drive node downloads the PDF. An HTTP request sends the file to a PDF service that returns a zip of images. The zip is extracted, images are listed and sorted by file name to keep page order, then resized to speed up AI processing. A language model converts each page image to markdown, the pages are combined, and an information extractor pulls structured deposit rows. This design handles scanned PDFs and messy layouts better than basic OCR.
Use a Google Drive account and a Google Gemini API key. For privacy, point the HTTP request to your own PDF service. Expect a clean table of deposits ready for review or export, saving hours of manual work. Ideal for monthly bookkeeping, cash posting, and audit support.