Google Drive to Vector Database RAG Workflow in 5 Minutes

This is a plug-and-play workflow that allows you to call it from any of your main workflows to build your RAG. You can use it multiple times, allowing you to build multiple vector tables, whilst still linking them all together. Just point it to a Google Drive and it recursively pulls everything down into a vector database categorising all your data. It supports most files including images and can extract table data from images. It takes 5 minutes to set up.

I sometimes refer to the database as a RAG in the video, but to be technically precise: it’s a vector database (or RAG database) (and the table database together). The RAG is actually the entire process - retrieving information from the database AND using it to generate responses with the AI agent.

It’s a long video sorry, so I’ve broken it down into chapters so you can just jump through sections.

:fire: What it Does

Point it at ANY Google Drive folder → and It automatically:

  • :magnifying_glass_tilted_left: Recursively scans ALL subfolders and files
  • :page_facing_up: Intelligently processes PDFs, Google Docs, Sheets, even IMAGES
  • :brain: Creates vector embeddings using OpenAI
  • :card_index_dividers: Auto-categorizes everything based on folder structure
  • :high_voltage: Detects changes and only updates what’s new/modified
  • :robot: Includes a basic AI agent with connection settings so that it can answer questions from your data

Result? A fully searchable AI knowledge base that knows EXACTLY where every piece of information came from!

:gem_stone: KEY HIGHLIGHTS

:video_game: Plug & Play

  • Zero configuration headaches - just update a few variables and you’re LIVE
  • Works with ANY Google Drive structure - your folders become categories automatically
  • Drop into any workflow - designed as a reusable sub-workflow MULTIPLE times

:brain: AI-Powered Content Processing

  • Multi-format support: PDFs, Google Docs, Sheets, Images
  • GPT-4 Vision extracts text from images and screenshots
  • Smart Excel processing converts tables into searchable text
  • Maintains source attribution for every piece of information

:bar_chart: Hierarchical Organization System

  • 4-level categorization automatically built from folder structure
  • Parent categories let you connect multiple workflows/vector tables
  • Relationship mapping between documents and categories
  • Cross-workflow vector table relationships

:high_voltage: Smart Change Detection

  • Only processes new/modified files - saves time and API costs
  • Automatic cleanup of deleted files from database
  • Folder rename detection maintains data integrity
  • Recursive lookups through entire folder structures

:bullseye: USE CASES

:books: Educational Institutions: Transform course materials into AI tutors, generate assignments, mark assignments and more.

:office_building: Corporate Knowledge: Make company documents instantly searchable such as case studies, policies. Cross reference documents related to HR, Finance, IT etc.

:open_book: Research Teams: Turn research papers into queryable databases

:briefcase: Consultancies: Create client-specific knowledge bases

:hospital: Healthcare: Organize medical documents with AI-powered retrieval

:rocket: GET STARTED IN 5 MINUTES

  1. Import the workflow (JSON attached)
  2. Update ‘Set Database Names’ node
  3. Run the setup nodes (creates tables automatically)
  4. Update 4 variables in “Set Globals” (Google Drive folder ID, Database table names, Parent category name)
  5. Update Vector Table Name Node (for AI Agent)
  6. Setup Authentication (Google, Supabase, Cohere)
  7. Execute and watch the magic happen!

Workflow is available for download in the loom summary page.

Overview video here

Full tutorial here

2 Likes