when execution concurrency increases, I start seeing random workflow failures with no deterministic node pattern. I’m using queue mode with Redis. Could this be worker memory pressure or stalled jobs due to event loop blocking?
Describe the problem/error/question
What is the error message (if any)?
Please share your workflow
(Select the nodes on your canvas and use the keyboard shortcuts CMD+C/CTRL+C and CMD+V/CTRL+V to copy and paste the workflow.)
Yeah this is almost certainly stalled jobs from event loop blocking, what’s your per-worker concurrency set to? The default of 10 is way too high under load, drop it to like 3-5 and add more workers instead. Also make sure you’re on Postgres and not SQLite because SQLite will silently corrupt under concurrent writes in queue mode.