Convert PDF to HTML

Hi Team,

I am in the process of creating a RAG workflow by transferring PDF files from Google Drive to the Supabase database. The PDF files need to be converted to HTML and then converted to Markdown to save the titles, subtitles, and tables.
Please, I need your help to achieve this.

Describe the problem/error/question

What is the error message (if any)?

Please share your workflow

(Select the nodes on your canvas and use the keyboard shortcuts CMD+C/CTRL+C and CMD+V/CTRL+V to copy and paste the workflow.)

Share the output returned by the last node

Information on your n8n setup

  • n8n version:
  • Database (default: SQLite):
  • n8n EXECUTIONS_PROCESS setting (default: own, main):
  • Running n8n via (Docker, npm, n8n cloud, desktop app):
  • Operating system:

Hey @fedora hope all is good.

Could you provide an example of the PDF you with to convert to markdown?

Than you for your response Jabbson.
I am building a workflow template.
I want the workflow to work with any PDF file.
For scanned PDF files, I plan to make another workflow.

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.