How to extract information from downloaded email pdfs?

I am using the Email Trigger (IMAP) to download the content from emails containing resume pdfs. I see that the attachments from each email are being output by the node in the “Binary” section, but I cannot find a way to pass them to other nodes for extraction of information.

Any help would be greatly appreciated!

  • Alain Fornes

Hi Alain,
pls. use Read PDF node to extract the text from the binary attachment. Then you should use the Code node to parse the text with regex JS.
Best regards, Kurt

I am getting this error when using that node:
ERROR: Cannot read properties of undefined (reading ‘id’)

Maybe I am using the wrong input for the “Binary Property” field?

Thank you for your help.

  • Alain

Asuming you have only one PDF attachment in the email message, you could use “attachment_0” in the field “Binary Property”

If your email messages contain more than 1 attachment use “Split in Batches” node set to batchsize 1 to process each attachment individually.

  • Kurt
2 Likes