Processing Multiple Document Types for LLM Input in n8n
Hello, everyone! I’m completely begginer in n8n.
I’m working on a workflow to process documents of various formats and feed them to an LLM model. Before I build my implementation, I’d like to get advice from the community on the best approach.
Describe the problem/error/question
I need to create a workflow that:
- Accepts multiple documents of different types (doc, docx, xml, xlsx, txt, html, odf, xls)
- Processes these documents to extract text content
- Combines all extracted text
- Sends the combined text to an LLM for analysis
My main challenge is implementing the loop to process each document and then merging all the extracted text together. I’m not sure how to structure this workflow efficiently.
What is the error message (if any)?
No specific error, I’m seeking guidance on implementation.
Share the output returned by the last node
Empty. I wish to get llm answer.
Information on your n8n setup
- n8n version: 1.91.2
- Database (default: SQLite): Postgres
- n8n EXECUTIONS_PROCESS setting (default: own, main): default
- Running n8n via (Docker, npm, n8n cloud, desktop app): docker compose starter-kit
- Operating system: Ubuntu