Concurrency Limit for Summarization Chain

NN708 · May 7, 2025, 1:27pm

The current Summarization Chain sends multiple requests in parallel, which could cause timeouts when used with local models like Ollama that can only process requests sequentially. This has been reported in this thread:

To address this, I propose adding a concurrency limit option to the Summarization Chain. This would allow users to control the number of parallel requests, preventing timeouts when working with models that cannot handle high levels of parallelism.

Freilichtbuehne · June 19, 2025, 12:45pm

Yes, same problem here. Even when setting the batch processing it runs all requests at once. Especially for larger texts.

Topic		Replies	Views
Summarize chain timeout Questions	3	659	August 11, 2024
Ollama chat model options missing in summarization chain Questions ollama	2	164	July 26, 2025
[Problem in node ‘Summarization Chain‘ : fetch failed] error occurred and I need help! (local & ollama) Questions	4	3774	September 24, 2024
AI Summarization looping for no reason Questions	4	119	February 2, 2025
Timeout option for "Ollama chat model" Nodes ollama	0	198	August 22, 2025

Concurrency Limit for Summarization Chain

Related topics