Is n8n fault-tolerant and resumes workflows after a failure?

I’m making a research of n8n on how does it fit to our requirements. What we want from n8n is to run fault-tolerant workflows: in case of abruption (pod restart, pod termination and etc) resume the workflow execution from the last executed step.

We will use worker mode of n8n.

Does it have this functionality? I’m confused because documentation says the following:

If your instance is down or restarting, missed executions (for example, Cron or Webhook nodes) during this time aren't recoverable. If it's important for you to maintain 100% uptime, you need to build another proxy in front of it which caches the data.

The documentation also says there is a EXECUTIONS_DATA_SAVE_ON_PROGRESS variable that saves progress of an execution. But the question is - does it automatically enable a mechanism of resuming stale jobs? Or it’s only for monitoring running executions?

Information on your n8n setup

  • n8n version: latest (1.98)
  • Database (default: SQLite): PostgreSQL
  • n8n EXECUTIONS_PROCESS setting (default: own, main): ?
  • Running n8n via (Docker, npm, n8n cloud, desktop app): Docker
  • Operating system:

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.