Need Generic Catch Error Mechanism within Main Workflows

dkindlund · March 18, 2024, 7:23pm

For any node in any workflow, we need a generic mechanism to catch and handle errors within the workflow – rather than wiring up a separate “error workflow” for these cases.

Specifically, separate error workflows are great when you want to have a generic notification mechanism for alerting on any/all workflow errors (like Send Alert to Slack).

But, separate error workflows are bad when you want to catch specific errors and handle them in a unique way. Why? Because how you handle these errors likely depends on which node in your workflow threw the error. And if that handling logic is divorced from the originating workflow, then now, as a developer, you have to constantly flip between the “main” workflow and the “error” workflow in order to handle/manage unique errors on a node-by-node basis.

From a quality-of-life perspective, it’s much easier to handle node-specific errors when you are editing the main workflow and not as a separate activity later in the development cycle.

Make.com’s error handling has the following different types of “error handling” nodes:

Ignore
Resume
Commit
Rollback
Break

It would be very useful if n8n had any sort of equivalent capabilities within the main workflow (not as a separate error workflow).

For example, let’s say a node throws a very specific error message that you want to catch and handle in a very specific way. It doesn’t feel like that’s currently possible within the original execution of the workflow – why is that important? – because I want to reference state information from earlier nodes that did execute correctly – and I can’t do that if the error handling is in an entirely different execution.

dkindlund · March 18, 2024, 8:07pm

Couple of insights per @jan:

You can make a workflow handle it’s own errors by having a separate pipeline tied to the Error Trigger.
The only catch, is that those errors will be handled in a separate execution.

My thoughts:

Okay, so if you have the error handled in a separate execution and you know the execution ID of the workflow containing the error, then you can essentially pull up the state of that previous execution using an internal n8n API call.
I’m currently looking at the feasibility of this approach.

dkindlund · March 18, 2024, 8:23pm

Okay, I hope this illustrates how awkward custom error handling currently is within n8n workflows.

Here’s an example of a workflow that:

Takes an Airtable record’s field value
Feeds the value into OpenAI as a prompt
Updates a corresponding field value in the same Airtable record with the result

Now, whenever the data from Airtable triggers OpenAI’s content filtering logic, the Basic LLM Chain throws a 400 error.

We then “catch” this error in the Error Trigger and make sure the error is actually a content filtering violation and then update a “error” field within the same Airtable record:

But in order to do this, we have to pull in the entire state of the previous execution into the Error Trigger workflow explicitly, and then also, if any other errors happen (that we didn’t account for), those will get silently dropped for now (which isn’t ideal).

barn4k · March 19, 2024, 11:41am

It’s not the only option. You can handle the errors within the Node options:

Continue is pretty old option, that has been even in the 0.x version
Continue using error output is the very new option, but it’s really helps to handle specific errors.

I have some workflows that utilizing that approach

dkindlund · March 19, 2024, 2:57pm

Hey @barn4k , thanks for the clarification. I guess it would help a ton if the On Error mechanism were supported generically across all node types rather than having to build custom error handling on a per-node-type basis. Does that make sense?

dkindlund · March 19, 2024, 3:03pm

Oh, maybe the On Error mechanism does exist across all node types? I’m going to go through and check to be sure…

dkindlund · March 19, 2024, 3:09pm

Okay, this is really weird… I tried using the On Error set to Continue (using error output) and I’m now seeing these sorts of executions:

In this example, the Basic LLM Chain clearly threw an error, but still evaluated the Success path… huh?

dkindlund · March 19, 2024, 3:21pm

I submitted a ticket tracking this issue here:

github.com/n8n-io/n8n

Basic LLM Chain doesn't handle On Error correctly

opened 03:20PM - 19 Mar 24 UTC

dkindlund

### Bug Description I have a workflow using a `Basic LLM Chain` with the `On Er…ror` param set to `Continue (using error output)` like this: ![image](https://github.com/n8n-io/n8n/assets/85660/7a8bd510-7f69-4d2c-b1cf-5aad82fd0b53) When I feed in an input to the LLM that flags on their content filtering policy, this sort of error gets thrown by the node: ``` NodeOperationError: 400 AzureException - Error code: 400 - {'error': {'message': "The response was filtered due to the prompt triggering Azure OpenAI's content management policy. Please modify your prompt and retry. To learn more about our content filtering policies please read our documentation: https://go.microsoft.com/fwlink/?linkid=2198766", 'type': None, 'param': 'prompt', 'code': 'content_filter', 'status': 400, 'innererror': {'code': 'ResponsibleAIPolicyViolation', 'content_filter_result': {'hate': {'filtered': False, 'severity': 'safe'}, 'self_harm': {'filtered': False, 'severity': 'safe'}, 'sexual': {'filtered': True, 'severity': 'medium'}, 'violence': {'filtered': False, 'severity': 'safe'}}}}} at ChatOpenAI.callMethodAsync (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/dist/utils/logWrapper.js:34:23) at processTicksAndRejections (node:internal/process/task_queues:95:5) at Proxy.connectionType (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/dist/utils/logWrapper.js:156:47) at async Promise.allSettled (index 0) at Proxy._generateUncached (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/node_modules/langchain/node_modules/@langchain/core/dist/language_models/chat_models.cjs:114:25) at LLMChain._call (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/node_modules/langchain/dist/chains/llm_chain.cjs:157:37) at LLMChain.call (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/node_modules/langchain/dist/chains/base.cjs:120:28) at createSimpleLLMChain (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/dist/nodes/chains/ChainLLM/ChainLlm.node.js:84:23) at getChain (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/dist/nodes/chains/ChainLLM/ChainLlm.node.js:93:16) at Object.execute (/usr/local/lib/node_modules/n8n/node_modules/@n8n/n8n-nodes-langchain/dist/nodes/chains/ChainLLM/ChainLlm.node.js:407:31) ``` But what's really weird is that the **SUCCESS** path of the node is executed instead of the **ERROR** path: ![image](https://github.com/n8n-io/n8n/assets/85660/0020e132-9ddd-4866-a9c7-3ba5332b6336) And when I try to see the data returned to the Airtable nodes, it's completely **EMPTY**: ![image](https://github.com/n8n-io/n8n/assets/85660/c38b944b-01d8-4129-b7a7-a5e799001f49) ### To Reproduce 1. Create a workflow using the `Basic LLM Chain` 2. Configure the node to use `On Error` set to `Continue (using error output)` 3. Feed in a prompt that triggers OpenAI's content filtering policy 4. Watch as error gets caught but sent down the **SUCCESS** path instead of **ERROR** path 5. Also, no data from the error is provided to subsequent nodes downstream from the error, which is also a bug ### Expected behavior I'd expect this configuration to catch the error, evaluate the ERROR path, and provide details of the error to the downstream node. ### Operating System Google Cloud Run ### n8n Version 1.31.2 ### Node.js Version 18.10 ### Database PostgreSQL ### Execution mode main (default)