I’m trying to build a site scraper in n8n using DataForSEO api to extract sitemap URLs and their names from a given sitemap (e.g., example.com/sitemap).
Basically now it just one node ‘Http request’.
Extracting URLs with names works fine, but the challenge is that the data is nested in multiple arrays, making it hard to map them out etc.
I tried using split or set nodes to map the data, but the output is unclear.
What’s the best approach after the HTTP node to extract clean, structured data?
I can achieve this easily with Serper or Firecrawl, (they kind a do this automatically), but they’re more expensive, so I’m trying to use DataForSEO.
Any ideas?