Sorry, I don’t speak English.
While parsing HTML, I found a bug in the HTML (Extract HTML Content) node — it unexpectedly adds line breaks (\n) where there were none.
For example, the original HTML was:
<p>NOVOS is at the forefront of longevity science, offering cutting-edge solutions designed to help people live healthier, longer lives. Rooted in research and developed by scientists, NOVOS provides innovative supplements and lifestyle insights aimed at optimizing aging at the cellular level. With a commitment to science-backed wellness, NOVOS empowers individuals to take control of their health and longevity.</p>
But the output becomes:
NOVOS is at the forefront of longevity science, offering cutting-edge solutions\ndesigned to help people live healthier, longer lives. Rooted in research and\ndeveloped by scientists, NOVOS provides innovative supplements and lifestyle\ninsights aimed at optimizing aging at the cellular level. With a commitment to\nscience-backed wellness, NOVOS empowers individuals to take control of their\nhealth and longevity.
This issue was tested and reproduced on versions 1.113.3 and 1.116.2.
Could someone please check and confirm this bug? Thank you!
PS. I couldn’t attach the test workflow because the forum reports that the post exceeds the character limit.
The URL being parsed: https://wordpress.org/showcase/novos/
Css selector: .entry-content.wp-block-post-content.is-layout-constrained.wp-block-post-content-is-layout-constrained p

