n8n

How to Connect Baserow and OpenAI for PDF Data Entry?

Turn PDF files into structured table data without manual typing. When someone updates a row or adds a new field, the system reads the PDF and fills the right cells based on simple prompts. Ideal for teams that collect forms, contracts, or reports in Baserow.

An incoming webhook listens for Baserow events like row updates and field changes. The flow pulls the table schema to read field descriptions, which act as prompts. It downloads the file from the row, extracts the PDF text, and sends the text with each prompt to OpenAI to generate answers. Only empty or changed fields are processed, and updates run one row at a time for fast feedback. There are two paths. One path updates only the affected rows, and the other can refresh all rows when a field is created or changed.

To run it, you need a Baserow table with a file column and clear descriptions on the fields you want filled. Connect your Baserow API token and OpenAI key, publish the webhook, then point Baserow events to it. Expect less data entry, quicker turnarounds, and consistent values across many rows.

What are the key features?

  • Webhook listener captures Baserow row updates and field changes.
  • Switch router separates row events from field events to limit or expand updates.
  • Table schema fetch reads field descriptions and treats them as dynamic prompts.
  • List and filter steps pull only valid rows and skip items without a file.
  • PDF extractor converts uploaded documents into text for the model.
  • OpenAI node generates values per field from the PDF context and prompts.
  • Split in batches processes one row at a time for faster visible updates.
  • HTTP PATCH writes results back to the same rows using user field names.

What are the benefits?

  • Reduce manual work from hours per batch to minutes
  • Automate up to 80 percent of document data entry
  • Improve data quality by 30 percent by using clear prompts and skipping filled cells
  • Handle thousands of rows with controlled batch updates
  • Connect Baserow and OpenAI in one flow with no context switching
  • Add new fields without rebuilding. Field descriptions become prompts

How do you set it up?

  1. Import the template into n8n: Create a new workflow in n8n > Click the three dots menu > Select 'Import from File' > Choose the downloaded JSON file.
  2. You'll need accounts with Baserow and OpenAI. See the Tools Required section above for links to create accounts with these services.
  3. In your Baserow account, create a personal API token with access to the target database and table. Copy the token.
  4. In the n8n credentials manager, create a new HTTP Header Auth credential for Baserow. Add a header named Authorization with the value Token your_api_token. Save the credential.
  5. In your OpenAI account, create an API key from the API settings page. In n8n, create a new OpenAI credential and paste the key. Save the credential.
  6. Open the HTTP Request nodes that call Baserow, such as Table Fields API, List Table API, Get Row, and Update Row. In the Credential to connect with field, select the Baserow credential you created.
  7. Open the OpenAI Chat Model nodes. Choose your model, set temperature if needed, and select your OpenAI credential.
  8. Ensure your Baserow table has a file column. The workflow expects a field named File. If your column name is different, update the Get File Data nodes to match your field name.
  9. In Baserow, write clear prompts in the Description of each field you want the model to fill. Short and direct prompts work best.
  10. Publish the n8n workflow. Copy the Production URL from the Webhook node and keep it handy.
  11. In your Baserow table settings, add a webhook that sends row_updated, field_created, and field_updated events to the n8n Webhook URL. Save the webhook.
  12. Test the setup by uploading a PDF to the File column on a row or by editing a field description. Check n8n executions for results and confirm the row values update.
  13. Troubleshoot common issues. If no updates occur, verify the file field name mapping, run the Table Fields API node to confirm credentials, check OpenAI quota, and ensure your PDFs are not too large. Adjust pagination settings if you have very large tables.

Tools Required

$24 / mo or $20 / mo billed annually to use n8n in the cloud. However, the local or self-hosted n8n Community Edition is free.

Baserow

Sign up

Cloud Free plan: $0 / mo; API access via database tokens; 3,000 rows/workspace and 2GB storage/workspace

OpenAI

Sign up

Pay-as-you-go: GPT-5 at $1.25 per 1M input tokens and $10 per 1M output tokens

Credits:
YouTube video

Similar Templates

Join Futurise to access 1,200+ automation templates

Get instant access to ready-made automation workflows for n8n, Make.com, AI agents, and more. Download, customise, and deploy in minutes.