The problematic
I want to create and maintain a Vector Database (Qdrant) based on all files present in a SharePoint folder and its subfolders. I want every day to follow these steps:
-
Drop the existing content of the vector DB
-
Insert the files found in SharePoint to the vector DB. This is done by :
-
Listing all items in the Sharepoint folder using the “Get many items” Sharepoint node available in n8n
-
Looping over each item and using a “Download item” Sharepoint node available in n8n to download each file
-
Embedding each file and storing it in the DB
-
The issue
The problem I’m encountering is:
-
The “Get many items” Sharepoint node an “id” that is not the official item ID that Sharepoint want us to use in the “Download item” Sharepoint node. Another value we get is the “webUrl” of the Sharepoint item, but that neither is an accepted input parameter of the “Download item” node.
-
Additionally, the “Download item” Sharepoint node expects the “Parent Folder ID” as input parameter, which is not returned neither from the “Get many items” Sharepoint node
Can someone help me and tell me how I could build such RAG starting from documents in SharePoint?