Item Duplication in Pinecone using n8n with Supabase & OpenAI Embeddings

Hi everyone,

I’m new to n8n and automation in general, and I’m running into an item duplication issue. I’m trying to create a Pinecone vector database for my courses. My n8n workflow pulls course data from Supabase (5 courses), generates embeddings with OpenAI, and stores them in Pinecone. Instead of 5 items, I’m getting 93 in Pinecone (as shown in the screenshot). The duplication seems to be happening between Supabase and OpenAI where the item count jumps from 5 to 15. To make things worse, the AI agent is simply responding with “I don’t know.”
Any help would be greatly appreciated!

Information on my n8n setup

  • **n8n version:**1.64.3
  • Database (default: SQLite):
  • n8n EXECUTIONS_PROCESS setting (default: own, main):
  • Running n8n via (npm):
  • Operating system: Windows 10 - 64bit

It looks like your topic is missing some important information. Could you provide the following if applicable.

  • n8n version:
  • Database (default: SQLite):
  • n8n EXECUTIONS_PROCESS setting (default: own, main):
  • Running n8n via (Docker, npm, n8n cloud, desktop app):
  • Operating system:

Hi @MohammadFarouk1971

Thanks for posting here and welcome to the community! :cake:

Unfortunately working with vector databases is not as straight forward as with traditional databases. The duplication stems from Pinecone being a completely different system than Supabase. Supabase is a postgres database-like vector store whereas Pinecone is built to cater towards the collections-approach.

I think it would be quite a difficult task to build a workflow that “transfers” vector store data from one to the other adhering to the right structures and configurations. You would be better off to isolate your vector store embeddings from your source data.

Is there a reason why you can’t “feed” pinecone directly from your source data (i.e. human-readible text) ?

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.