PDF to vector database workflow with tables and images (n8n, embeddings, RAG, Qdrant/Weaviate, Supabase)

pochta.butin · November 13, 2025, 3:49am

How can all the information from a PDF, including text, tables, and images, be saved to a vector database, such as supabase vector, so that all data can be read in the future without exception?

Help me find a suitable template.

achamm · January 19, 2026, 10:48pm

To save all information from a PDF, including text, tables, and images, to a vector database like Supabase, you can use a combination of OCR (Optical Character Recognition) for text and tables, and embeddings for images. Check out the “[Build a PDF Search System with Mistral OCR and Weaviate DB](Build a PDF search system with Mistral OCR and Weaviate DB | n8n workflow template)” template on n8n, which uses Mistral OCR for text extraction and Weaviate for vector storage. You can adapt this workflow to use Supabase instead of Weaviate by modifying the vector storage node.

DudaNogueira · February 5, 2026, 6:02pm

hi @pochta.butin !! Duda from Weaviate here

One option is using ColPali with Weaviate. But note: This is really new territory, so not yet integrated with n8n.

If you want to give it a try using python, here we have some code:

By the way, I will keep pushing Weaviate N8N integration, so stay tuned for some cool features