N8n Community Node: Firecrawl

I’ve just created a new node for n8n that allows you to use the Firecrawl API, for my own use case. I’m sharing it here in case it’s useful for someone else.

What is Firecrawl?

Firecrawl is a web scraping API that allows you to extract data from any website. It’s a great tool for automating data extraction from websites that don’t have an API. especially when you need to extract data for LLMs systems, data analysis, or any other use case.

Operations

Scrape

Scapes a website and returns the HTML content.

Crawl

Crawls a website and returns the HTML content of all the pages.

Map

Maps the links extracted from a website.

Hope you find it useful!, any bug report or feature request is welcome.

Thanks!

16 Likes

Hi it seems that the the node does not work with firecrawl v1 api.

I confirm this. v1 api does not work.

I’m attempting to gather all car listings from this page:
https://es.wallapop.com/search?source=search_box&category_id=100&keywords=coches

However, I’m encountering the issue of the “Load More” button, which makes it nearly impossible to retrieve all the listings in one go.

Has anyone managed to scrape this page or have any suggestions on how to tackle this challenge?

2 Likes

Dear Mario,

For scraping website with dynamic client-side data (f.i. pressing a load more button or inserting information like city and miles around city or other data input request before loading content to the webpage) you need to work with the actions capabilities of Firecrawl or use other scraping methods like playwright to ‘mimic’ datainput in order to retreive the wanted data. Regards and good luck!

1 Like