HTML Extract Node Enhancement: XPath Support

hellotimking · September 6, 2021, 2:08am

I love that n8n contains the HTML Extract node, incredibly useful indeed, but I’d love to see an implementation of XPath added to the node to make it easier to target specific on-page elements.

This is particularly useful to extract elements which are housed in deep XML hierarchies or on pages which reuse classes in odd ways.

Resources:
Mozilla MDN Web Docs - Xpath
NPMJS - XPath Module

artick · March 19, 2022, 9:29pm

Yes I also need it
I am making RSS feeds using html scraping and this would be lifechanging

sherppard · September 8, 2022, 9:59am

I also need it too!!!
I come from huginn, huginn has the website agent ,and have the XPath featrue.

rafuru · November 7, 2022, 8:02pm

@jan
In 2019 had been case for adding xpath functionality to html extract node. Then it didn’t happen.
Is there any chance to get this feature as an enhancement?

I found some packages in npm database that work on html:

mmac · May 29, 2023, 4:02am

Another vote for this. CSS selectors aren’t enough for some dynamic page types, this would be a big help.

Ruan17 · January 10, 2024, 11:15pm

One more vote, it will be of great help

MindTrick_Radio · February 2, 2024, 11:32am

Another vote for this, really need that to work with dynamic pages