I’d like to request a feature to enhance n8n’s support for PPTX and DOCX file handling. Currently, working with these formats in n8n is somewhat limited, and having built-in nodes or improved functionality to read, modify, and generate DOCX/PPTX files would be extremely valuable for automation workflows.
Has anyone found alternatives for extracting data from ppts? I am currently working with an api in the background to convert ppts to pdf and then trying to use that for my rag setup
Any good options here that doesn’t include another paid subscription? I’m going to try to use MS Graph and have it export pptx as a pdf and then try to extract the text from there. I think MS’s base MS365 subscription includes this ability but I’m not 100% sure on this yet.
I was able to get data from xlsx and docx into a vector data base using existing nodes - it’s cumbersome but doable. Not much out there on how to make that happen - almost exclusively Google Docs…