Extract text from PDF and image using Vertex AI (Gemini) into CSV

Target Audience

Finance Professionals: Those who need to extract transaction data from bank statements or invoices.
- Data Analysts: Individuals looking to automate the extraction of data from various document formats.
- Small Business Owners: Entrepreneurs who want to streamline their financial reporting and record-keeping processes.
- Developers: Tech-savvy users who wish to integrate AI capabilities into their applications for document processing.
- Students and Researchers: Anyone needing to analyze financial documents for projects or studies.

Problem Solved

This workflow automates the extraction of text from PDFs and images, converting the data into a structured CSV format. It eliminates the need for manual data entry, saving time and reducing errors in financial documentation. Users can quickly categorize transactions, making data analysis more efficient.

Workflow Steps

Step 1: Trigger the Workflow - The workflow starts when a new PDF or image file is uploaded to a specified Google Drive folder.
- Step 2: Identify File Type - The workflow determines whether the uploaded file is a PDF or an image using a routing node.
- Step 3: Download File - Depending on the file type, the workflow downloads the relevant file from Google Drive.
- Step 4: Extract Data - For PDFs, it extracts text data using a dedicated extraction node. For images, it sends the image to Vertex AI for text recognition.
- Step 5: Process Data with AI - The extracted text data is sent to an AI model for processing, where it categorizes transactions and formats them into CSV data.
- Step 6: Convert to CSV Format - The structured data is converted into CSV format for easy use in spreadsheets or databases.
- Step 7: Upload CSV to Google Drive - Finally, the generated CSV file is uploaded back to a specified Google Drive folder for storage and access.

Customization Guide

Changing the Google Drive Folder: Users can modify the folder ID in the Get PDF or Images node to point to a different location in their Google Drive.
- Adjusting AI Model Parameters: Users can customize the AI model used in the Send data to A.I. node by changing the model name or parameters to fit their specific needs.
- Modifying CSV Output: The output format in the Convert to CSV nodes can be adjusted to include additional columns or change the headers based on user requirements.
- Adding More File Types: Users can extend the workflow to handle additional file types by modifying the Route based on PDF or Image node to include more rules.