I am not a programmer and not an native english speaker but i will do my best to explain:
(IN SHORT) I need to remove html tags from a text - : <a href <p …and so one
LONG DESCRIPTION) I retrieve the feed from an Rss agregator. I extract the content with n8n and i translate it with deepl in order to post the content to a blog.
The result of the feed extraction contain only HTML code with all the formating, achors so on. Also the output / translated content of the Deepl API is also full of HTML tags .
I need to have only the formatted text or at least ( but not so good) the plain text.
I tried almost every solution in n8n.
I am only a casual coder in C# but i know how to parse the html text and to get ride of the unvanted html tags.
Java script i do not know , as i saw that is used in the n8n. But maibe i could implement a JS code.
And directions or sugestions please
I run desktop app of N8n
no errors, just output text is not clean, but full of original anchors ant nasty tags
I tried also HTML EXTRACT node, but maybe i was something wrong -there i have different errors depending on the internal confirgurations .
one is ERROR: No property named " some text here " exists!