The output html file contains the content twice somehow.
After the nice and parsable html content there is the input “string” once more, in a single line, see screenshot:
Information on your n8n setup
n8n version: 0.214.2
Database you’re using (default: SQLite): postgres
Running n8n with the execution process [own(default), main]: default
Running n8n via [Docker, npm, n8n.cloud, desktop app]: Docker
Hi @sgw, I am sorry for the trouble. I wouldn’t be super surprised if writing HTML into an HTML spreadsheet has unforeseen side-effects, though I was unable to reproduce the problem. Perhaps you can share an example workflow allowing me to see this first hand?
Unfortunately I don’t have a great solution for this, the library powering the Spreadsheet node just seems to struggle with HTML content in an HTML table.
I also can’t think of a manual way of wrapping your HTML into an HTML table that would preserver the full original HTML structure. Your data includes an html tag for example, which defines the root of an HTML document. Nesting this in another HTML document would render the document invalid.
Perhaps you want to convert your existing HTML data into a different format before creating your spreadsheet?
The HTML should be piped into a converter to pdf. Currently my workflow uses Gotenberg in a docker container to do that conversion. Its API expects a file named index.html:
So my approach was to write my html object(?) to a file, name it accordingly and then send it to the API via http request.
If there is another way of naming that file accordingly: fine.
Or I somehow look for another pdf converter, that I can run on premise in my own environment.