Node postlight-parser

Kool_Baudrillard · November 2, 2022, 4:59pm

It would help if there was a node for:

Parsing scraped web-content into a more readable format.

My use case:

My use case is scraping jobs, while path, id, classes etc are not reliable enough it would be nice to grab the whole page, parse it and use further down in the workflow.

Any resources to support this?

Are you willing to work on this?

I would like, but not experienced enough with javascript.

hismayilov · July 23, 2023, 11:29am

I tried to do by adding external library.

      - NODE_FUNCTION_ALLOW_EXTERNAL=* #moment,sugar-date,sugar-core,@postlight/parser

example code block

it seems ‘@babel/runtime-corejs2/regenerator’ is for parser, and babel is also located under the folder.