Scrape Reddit's Voice of Customer Data and turn them into AEO Blogs with n8n

Turning Voice of Customer Questions into SEO Blogs with n8n + AI

Most people treat Reddit as a content pitstop. We turned it into an inbound engine.

In this post, I’ll show you how we built a workflow that scrapes Reddit questions, filters relevant ones, and turns them into fully formatted blog posts using OpenAI—all on autopilot using n8n.

No more blank pages. Just real user questions → clean content → instant publishing.

Let’s break it down.


:brain: The Problem

Our team wanted a better way to create blog content that actually answers what users are asking online.

Manual scraping, sorting through forums, and brainstorming headlines? Not scalable.

We needed a system that could:

  • Find high-intent questions from Reddit
  • Turn them into clear, SEO-friendly blog posts
  • Do it all without human bottlenecks

:hammer_and_wrench: The Workflow

Here’s the full breakdown of the workflow, built in n8n:

1. Collect Questions from Reddit

We start with a Reddit node that fetches all posts from the r/n8n subreddit.

2. Filter for Real Questions

A Code node filters titles to keep only those that:

  • End in a ?, or
  • Start with common question words like “how,” “why,” or “can”

3. Store in Google Sheets

We save the filtered questions and their descriptions into a Google Sheet. This acts as a central database and lets us track what’s been used.

4. Loop Through Each Question

Using SplitInBatches, we run the content generation process one question at a time.

5. Rephrase the Question

We use the LangChain AI Agent node to rewrite the question slightly—making it more readable while keeping the intent.

6. Generate Blog Content (The Factory)

This is where the real magic happens. For each rephrased question, we:

  • :label: Create an SEO slug
  • :writing_hand: Generate an intro
  • :receipt: Write a detailed step-by-step guide
  • :brain: Conclude the article

Each section uses its own AI node (powered by GPT-4o-mini) and includes memory handling via LangChain’s SimpleMemory to preserve context.

7. Assemble and Publish

The final blog content (intro, steps, conclusion, slug, and title) gets written back to a second Google Sheet, ready for publishing on our CMS.


:exploding_head: Why This Works

  • Reddit is full of user questions: We’re tapping into real demand instead of guessing what to write about.
  • AI handles the heavy lifting: Every article is generated using structured prompts and fine-tuned memory handling.
  • It’s always on: Once set up, the system runs daily and keeps generating high-quality, SEO-friendly content on autopilot.

:light_bulb: Bonus Tips

  • Want to target a different audience? Just change the subreddit.
  • Want different blog formats? Adjust the AI prompts inside each article section.
  • Add a webhook to auto-publish to CMS (e.g. Dorik, Webflow, WordPress).

This workflow turns Reddit noise into a blog goldmine.
And it all runs on a single, visual automation canvas—thanks to n8n.

If you’re stuck creating content manually or looking for ways to scale organic traffic with AI, give this setup a try.

2 Likes