How can I scrap infinite scroll content?

I am using HTTP request node for web scrap but the problem is it is not scraping infinite scroll content. How can I do that?

1 Like

It looks like your topic is missing some important information. Could you provide the following if applicable.

  • n8n version:
  • Database (default: SQLite):
  • n8n EXECUTIONS_PROCESS setting (default: own, main):
  • Running n8n via (Docker, npm, n8n cloud, desktop app):
  • Operating system:

Sure,
n8n version

  • n8n version: Version 1.76.3
  • Database : SQLite
  • n8n EXECUTIONS_PROCESS setting (default: own, main): own
  • Running n8n via: npm
  • OS: ubuntu 20.0

We’ve created a new category for help with designing workflows, and I’ve moved your question there: Help me Build my Workflow.

To scrape a website that uses infinite scroll, you cannot directly use the HTTP request node because the HTTP request system works on a one-request-one-response basis. The response you receive is the initial HTML, but loading content that appears when scrolling requires executing JavaScript.

The solution is to use a headless browser like Puppeteer or Cloudflare Browser Rendering. These tools can simulate a real browser, including executing JavaScript and performing actions like auto-scrolling to load additional content. You can call an API for the headless browser, which will handle the auto-scrolling, load the content, and then send the resulting HTML back to your n8n workflow for further processing. This way, you can effectively scrape dynamically loaded content from infinite scroll websites.

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.