Gmail get many messages node cant read HTML as text

The text attribute from my Get many messages node sometimes just says: “Your E-Mail program doesnt support HTML E-Mails, you can find the newsletter online here: …”
With other newsletter, the text gets “extracted”/shown correctly.
Why is this the case and what could be a possible fix?
Of course, the html code is shown correctly in the html attribute, but I need to feed the “plain” text to a LLM and not the entire html code.
Is there a workaround for this “bug”?

Hi @Lucas_Flores

Please the check possibility to scrape HTML with HTML node (Extract HTML content) for this kind of email contents.

Several ways:

  1. Use AI to generate a code and use it in Code node that can remove all html tag to get the text.
  2. Use a smaller AI model to get rid of html tag first. Then send the plain text to your main LLM.
1 Like