Accept multi-modal payloads to ChatGPT AI Agent nodes

From what I’ve found the past few hours working on this, I hit a limitation. I’m trying to pass a binary file from a parent workflow to a sub-workflow. The file comes through, but when I try to pass it to my AI agent, it’s not working.

The idea is:

OpenAI’s latest model n8n AI Agent node should accept multi-modal binary file payloads.

My use case:

I need to pass a file (pdf, docx or image) to my AI agent as, or included in, my prompt.

I think it would be beneficial to add this because:

Everyone would love this.

Any resources to support this?

GPT-4o can now directly accept files (PDF, DOCX, images) with the /v1/chat/completions endpoint, using tool_choice: auto.

Are you willing to work on this?

If i got paid to.