Hello everyone,
I’m running into an issue with the Google Gemini node (Transcribe a recording) when trying to transcribe audio messages from Instagram, and I’m hoping to get some advice from the community.
My Workflow Setup
My workflow is triggered by the WhatsApp Cloud webhook and handles messages from both WhatsApp and Instagram. For audio messages, the flow is:
- Webhook receives the message with a URL to the audio file.
- An
HTTP Requestnode downloads the file from the Meta URL (lookaside.fbsbx.com/...). This step works correctly, and I get the binary data. - The binary data is passed to a
Google Gemininode with the “Audio” resource selected to transcribe it.
The Problem
The process works perfectly for WhatsApp messages.
- The
HTTP Requestnode downloads a file withMime Type: audio/ogg. - The Gemini node receives this
.oggfile and transcribes it without any issues.
However, the process fails for Instagram audio messages.
- The
HTTP Requestnode successfully downloads the file, but I’ve noticed the file format is different:- File Name:
audioclip-....mp4 - Mime Type:
video/mp4(even though it’s just an audio message)
- File Name:
- When this
.mp4binary data is passed to the Gemini node, the node fails with the following error:
{
"error": {
"code": 500,
"message": "Failed to convert server response to JSON",
"status": "INTERNAL"
}
}
My Hypothesis
My guess is that the Gemini node’s audio transcription endpoint cannot process a video/mp4 container, even if it only contains an audio track. It expects a pure audio format like the audio/ogg it receives from WhatsApp. The error message seems generic, but the root cause appears to be the file format incompatibility.
My Question for the Community
- Has anyone else encountered this issue with Instagram audio messages?
- Is there a recommended best practice for handling Instagram audio transcription in n8n?
Information on your n8n setup
- n8n version: 1.104.0
- Database (default: SQLite): Postgres
- n8n EXECUTIONS_PROCESS setting (default: own, main): default
- Running n8n via (Docker, npm, n8n cloud, desktop app): self-hosted
- Operating system: Ubuntu 24.10 VPS