Hi everyone
I’m running a multi-tenant n8n system with queues and retries, and I’m trying to design a better strategy for jobs that keep failing. Webhook → Queue → Worker → External API
The problem is that some jobs fail repeatedly because of:
Invalid data
Expired credentials
Third-party API issues
Business logic errors
After several retries, I don’t want these jobs clogging the queue forever.
I’m considering:Queue → Retry → Dead Letter Queue
where permanently failed jobs are moved to a separate queue for investigation.
For teams running n8n at scale:
Do you implement a Dead Letter Queue pattern?
How do you decide when a job should stop retrying?
Describe the problem/error/question
What is the error message (if any)?
Please share your workflow
(Select the nodes on your canvas and use the keyboard shortcuts CMD+C/CTRL+C and CMD+V/CTRL+V to copy and paste the workflow.)
Share the output returned by the last node
Information on your n8n setup
- n8n version:
- Database (default: SQLite):
- n8n EXECUTIONS_PROCESS setting (default: own, main):
- Running n8n via (Docker, npm, n8n cloud, desktop app):
- Operating system: