(I generated this with the N8N AI Assistant that helped me with my solution attempts.)
I am using the “@n8n/n8n-nodes-langchain.openAi” node (version 1.8) (aka OpenAI - Transcribe a Recording) in n8n version 1.88.0 to transcribe an audio file. The node is configured to use the “transcribe” operation with the binaryPropertyName set to “data”. The audio file I am working with has the extension .oga and the MIME type audio/ogg;codecs=opus.
The previous node in my workflow provides the following data for the file:
I have confirmed that the file is accessible and playable in n8n as a binary file, and the MIME type appears to be correct. The OpenAI credentials are set up and working.
Hi @mohamed3nan, thanks for your answer. I thought that “data” is the name of the file. I believe I can change that in the previous get request as well.
@Grot also got back to me via DM saying that the whisper API doesn’t support .oga or .ogg which I was able to confirm. But I saw similar workflows using telegrams voice messages, which have the same format without converting to other filetypes.
I am now looking for a way to convert the .oga to .mpa in the cloud version.
I did that know with exactly the voice message in question. Unfortunately for me it works without issue. The filetype .ogg works even if the documentations says something else, but the file downloaded from drive has .ogg as file extension and audio/ogg instead of audio/ogg;codecs=opus but also a file name.
It seems there is no way to receive the file differently than with the wrong mime type. It seems Gdrive optimizes the file and takes the opus encoding out. Is there any way to do that in n8n? I have been looking at cloudconvert but it is not optimal. Are there any alternatives maybe even something else if it is on the hosted version instead?
I’m having the same issue, I can’t work out how to fix from your description, do you think you could describe or put a screenshot of the fix please? Where did you put this: "$input.item.binary.data.fileName = ‘myNewFilename.ogg’