How to handle large PDF file (6M characters) on Cloud

I need to ingest an entire book in my RAG. The PDF is 18 MB.
If I handle it like any other PDF, n8n (Cloud) runs out of memory. So I have to do something offline, before uploading it.
If I extract the text I get a 6MB txt file. It can’t be converted to GDocs (too much text), but it should use all the memory in n8n.

Is there a better way to do this?

Hey @albertocv hope all is well.

Two things come to mind, you could either try to compress the pdf to make it smaller or you could split it up.

See if this service can help:
Compress or Split.

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.