Ollama chat Node request to extend the 5 mins timeout [GOT CREATED]

WTeeth · April 9, 2024, 12:38am

The idea is:

Ollama provided 'keep_alive" parameter in API call, while N8N Ollama node doesn’t support that. As a user of Continue with the ollama LLM serving backend, I frequently experience long delays in responses in my workflow as ollama unloads the model and weights after 5 minutes by default. Ollama recently added support for the keep_alive parameter in requests which can prevent unloading or make the model in-memory persistence configurable. Please add support for configuring the keep_alive parameter and adding it to inference requests sent to the ollama backend. The parameter has been added to ollama 0.1.23 through the merged pull request here: add keep_alive to generate/chat/embedding api endpoints by pdevine · Pull Request #2146 · ollama/ollama · GitHub (edited)

My use case:

Use Ollama for local AI model but faced the long timeout issue.

I think it would be beneficial to add this because:

For private consideration, using local LLM model is exepcted.

Any resources to support this?

(https://github.com/ollama/ollama/pull/2146

Are you willing to work on this?

Able to do testing anytime.

Jon · April 9, 2024, 7:49am

Hey @WTeeth,

Don’t forget to vote to make it count.

jan · May 2, 2024, 12:52pm

New version n8n@1.40.0 got released which includes the GitHub PR 9215.

Topic		Replies	Views
Ollama chat Node has limited the 5 mins timeout Feature Requests core	2	1787	September 21, 2024
Ollama node "keep alive" setting has no effect on local n8n install Questions ollama	3	1008	May 28, 2025
Timeout option for "Ollama chat model" Nodes ollama	0	205	August 22, 2025
UND_ERR_HEADERS_TIMEOUT on Basic LLM Chain or Agent also Questions core	5	2101	October 24, 2024
Ollama taking so long to respond Questions	7	1629	February 2, 2026