Using Ollama vision models

It would help if there was a node for:

Ollama vision models

My use case:

Currently, image analysis is only available through the AI vendor nodes (Gemini, Anthropic, OpenAI). However, Ollama supports vision models that can analyze images. I’d like to use local Ollama vision models for image analysis.

Any resources to support this?

Are you willing to work on this?

Yes I have actually worked on this but I need some help before continuing

My question:

I’ve implemented this by adding Ollama as a new AI vendor (similar to Gemini/Anthropic/OpenAI), which works for vision tasks.

However, I’m questioning this approach because Ollama is not really like the other vendors. The vendor interface also feels empty because Ollama does not have that many capabilities. The other vendors have features such as image generation, which Ollama does not (yet)

Should I continue with Ollama as a separate AI vendor? Or should I change the AI agent node to allow image upload when using an Ollama chat model?

Here is my current implementation:

I have included screenshots.

I’d love to have this as I’m trying to setup an image classifier for all my photography work.

Ideally I’d have n8n running a local file search on my server, picking up on the images from multiple folders, doing some local AI classification and image keyword generation, slapping the whole lot into a vector DB (again all local) and potentially sorting into folders by key classification and highlighting potential duplicates.

Looking to use the llava:7b model, Qdrant, n8n and openwebui/ollama all on the homelab to do this.

Got the tools and the back end, just not the nodes it seems…

Keeping an eye on this request :).

S.

New version [email protected] got released which includes the GitHub PR 19371.