HTML Extract keep empty or missing

RedPacketSec · June 7, 2022, 1:51pm

When i am doing my scraping and extracting HTML, sometimes there is a “missing” or non existent bit of CSS

What I need for consistency is for that to continue and add NULL or N/A or something, otherwise all the results get out of order. Whats the best way to do this?

I’ve manually copied and pasted to work out where the issue is

but need to find a way to automate keeping the blanks if they are not found when scraping HTML

any ideas?

MutedJam · June 8, 2022, 7:46am

Hey @RedPacketSec, so you mean each of your arrays returns a different number results (meaning the 10th item in your victim_name field might not correspond to the 10th item in your victim_website field)?

If so, you might want to check if you can first extract each individual dataset, for example by selecting all div.cards. So you have one array element for each dataset.

Then split up that array into individual items (using the Item Lists node) and only then extract the individual fields. These individual fields would then no longer be arrays with differing lengths, they’d just have a value or not.

RedPacketSec · June 8, 2022, 8:20am

interesting… i’ll have a play doing it that way.

RedPacketSec · June 9, 2022, 12:52pm

yes this worked very well and now something I will adopt across some of my other flows! so simple when you think about it, and annoying I didn’t think about it lol. cheers

MutedJam · June 9, 2022, 12:53pm

Awesome, glad to hear this works for you. Many thanks for confirming!

Topic		Replies	Views
HTML Extractor Default value Questions data-transformation	3	69	October 1, 2024
HTML Extraction / cheerio.extract Questions	1	5069	November 15, 2023
Extracting elements from HTML dom with optional tags Questions data-transformation	3	392	June 15, 2025
How to remove the first empty key and value from the HTML Extract return Questions data-transformation , node , expressions , if	9	1387	May 3, 2022
Html extract Nodes	8	1870	November 25, 2020

HTML Extract keep empty or missing

Related topics