I’m regularly getting 503s as soon as there is a bit of traffic on the n8n instance. I have hundreds of Workflows, but the load is pretty low (10-30 consurrent Executions). How can I approach debugging this?
Helm chart:
global:
autoscaling:
enabled: true
minReplicas: 5
n8n:
n8n:
concurrency: 30
scaling:
webhook:
count: 10
worker:
concurrency: 15
count: 12
webhookResources:
limits:
cpu: 2
memory: 1Gi
requests:
cpu: 200m
workerResources:
limits:
cpu: 2
memory: 3Gi
requests:
cpu: 1
I have very long workflows (1 hour +) that calls others workflows.
Here are 30mn of n8n logs while the instance was 503ing:
https://file.io/tuoSnpuxhvgw
Thanks a lot