Extract from File node – PDF content gets jumbled for one format

Hi everyone,

I’m using the Extract from File node in n8n to retrieve content from PDF files.

For most PDF formats, it’s working great and maintains the structure well.
However, for one particular format that i used in the flow, the extracted data gets jumbled and out of order, meaning the text flow is not preserved properly.

Could you please help me with resolving this issue?
Are there any recommended workarounds or alternative approaches
that provide better accuracy and preserve text order?

Could you please include an example of a PDF which doesn’t want to return the text properly?

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.