Thanks Mohamed and thanks for the warm welcome! I am not trying to be difficult, but is there a way to extract the emails in the http node already? Because I am doing this up to 40k times across companies and individuals - so if I go with your proposed solution, the process will take exponentionally longer as its extracting the entire html via the http node and then doing a match accross the entire html. (again, sorry, not trying to be ungrateful, but hoping for a more efficient solutin - if that exists)
I actually inspected the website’s network traffic before replying to see if there was a hidden API or a background XHR request loading the emails but unfortunately I didn’t find any,
It appears that the emails are hardcoded directly into the HTML source, so you’ll need to make an HTTP request to get the HTML and then scrape the data you need,
Regarding performance: I don’t think extracting text from HTML will make it slower,
It’s just about 1 ms:
So with 40k calls, that would be 40 seconds..
Heya! Thanks for looking under the hut of the network movements beforehand, and the great explanation! Will go with that solution that you kindly provided! Thanks a lot again!