Set user agent in HTTP node to avoid 403 forbidden error while scraping

chrisgereina · July 29, 2021, 3:10pm

Hi everyone, I’m currently running down a CSV list with website urls on each line, I’m running a GET request to get the content of the page and looking for certain keywords with the extract HTML extract node. I’ve noticed a small problem, some sites that work perfectly fine when visiting from my browser are throwing up errors in the HTTP request

That got me curious about what could be happening so I looked up the error code plus scraping and found this:

This is probably because of mod_security or some similar server security feature which blocks known spider/bot user agents (urllib uses something like python urllib/3.3.0 , it’s easily detected). Try setting a known browser user agent with:

I wonder if there is any plan to add the ability to set user agent properties to the HTTP request node for scraping use cases? I actually need this myself so I may just contribute for this feature, if anyone could direct me to the right area in the project then I can give it a crack

harshil1712 · July 29, 2021, 3:34pm

Hey @chrisgereina, thank you for creating this feature request!

I’ve not used the HTTP Request extensively for scraping and hence didn’t come across the exact same issues. However, there were cases where I needed to specify the user-agent. I passed the user-agent in the Header parameters and it worked for me. Did you try passing the User-Agent header via the Header parameters?

RicardoE105 · July 29, 2021, 3:38pm

As @harshil1712 already mentioned you should be to accomplish this by passing the User-Agent in the header.

rafuru · August 3, 2022, 1:55pm

Are passing User-Agent works properly? Is there any difference between CURL and HTTP Request in n8n?

I have a case when using this curl works perfectly fine

curl 'https://www.mytheresa.com/sitemaps/sitemap.xml' \
  -H 'authority: www.mytheresa.com' \
  -H 'user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.0.0 Safari/537.36' \
  --compressed

but when using http request node i have 403.

RicardoE105 · August 4, 2022, 11:35pm

Yes, I just tested it, and it works fine. If you want to try, just make an HTTP request to something like https://webhook.site/ where you can explore the request.

Is there any difference between CURL and HTTP Request in n8n?

Yes, you can do more with cURL probably. But, for what you are trying to do, there should not be any problems.

but when using http request node i have 403.

It works for me when changing the user agent or not sending it at. It’s something with how the site handles the sessions.

automat0r · October 15, 2024, 9:30am

by the way, to those running into the issue that the added header is being sent only lowercaps, go to options > lowercase headers , select it then disable it, so that it will go on whatever case you specifiec. Otherwise it’s always gonna send it in lowercase.

Topic		Replies	Views
Can't visit site on HTTP node 403 error Questions	1	139	July 13, 2025
403 error when using the http Get Request Questions	4	1258	March 17, 2025
403 Error - GET request in HTTP Node Questions	6	551	July 1, 2025
Website scraping - help pls Questions http-request	7	698	July 25, 2023
Getting 403 error when executing HTTP request to and RSS feed Questions	5	496	July 2, 2025

Set user agent in HTTP node to avoid 403 forbidden error while scraping

Related topics