K8s horizontal scaling and non-http triggers

Hello everyone,

I’m planning for a bunch of workflows to run in a kubernetes cluster. All of them are triggered by kafka events. We usually have a preference for scaling up using K8s horizontal scaling, but I understand this would imply each and every N8n instance would be listening for the same event and get triggered, therefore duplicating executions.

I was wondering if it would be reasonable to use queue mode to still leverage on K8s HPA, for example by keeping one single instance of the N8n main process, and setting HPA just for worker nodes, so that they would spawn automatically basing on CPU/memory consumption. My understanding is that triggers would still remain on main instance, avoiding duplication of executions.

Would that be a sensible approach? Any more reasonable ways of using kafka triggers and k8s HPA? Or should I just avoid HPA and manually add workers?

Thank you very much