Hi everyone,
Iβm working on a workflow where I need to log in to multiple B2B supplier portals and download a small number of documents (mostly PDFs).
Context:
-
Around 15β20 different supplier websites
-
Mostly B2B energy providers
-
Access is via private / reserved areas (username + password)
-
No public APIs available (or only enterprise-level integrations)
-
Execution frequency is very low:
1β2 runs per month per site -
Each run usually means:
-
login
-
navigate to a specific section
-
download 1β5 documents
-
-
No aggressive scraping, no parallel requests, human-like behavior
The goal is simply to replicate what a human operator does manually today, but in an automated and repeatable way.
My current questions
-
From your experience, what is the best technical approach in n8n for this kind of scenario?
-
HTTP Request node with cookies/session handling?
-
Browser automation (Playwright / Puppeteer)?
-
External services integrated via Execute Command / Code node?
-
-
For sites with:
-
heavy JS
-
dynamic DOM
-
protected download buttons
is Playwright the most reliable solution in your opinion?
-
-
Any best practices to avoid account blocking, given:
-
very low frequency
-
sequential requests
-
realistic delays between actions?
-
-
Would you recommend:
-
one generic Playwright workflow parametrized per site
-
or one workflow per supplier?
-
Constraints / notes
-
Using code is not a problem (JS / TS / Python are fine)
-
Credentials will be handled securely via n8n credentials
-
No CAPTCHA solving services planned (if CAPTCHA appears β manual fallback is acceptable)
-
Main priority: stability over performance
What Iβm looking for
Real-world advice from people who:
-
already automated login + download flows
-
used Playwright with n8n
-
dealt with B2B portals without APIs
Any architectural suggestions, node recommendations, or links to similar implementations are very welcome.
Thanks in advance ![]()