Good morning,
We are looking for an experienced n8n developer to design and implement a complete automation workflow for a video dubbing platform.
This is a freelance, results-oriented project. The objective is to deliver a fully functional, documented, and production-ready workflow.
The project involves building a modular and scalable dubbing system inside n8n that manages all stages of the video dubbing process — from speech recognition and translation to voice synthesis, audio mixing, and final video generation.
The workflow should integrate both local tools and cloud APIs (Azure, OpenAI, Google Cloud), and support batch processing for large-scale operations.
Workflow Scope:
-
Audio Extraction and Preparation
-
Separate voice, background, and sound effects using FFmpeg or equivalent tools.
-
Handle mono, stereo, and multi-channel formats.
-
Prepare clean tracks for speech recognition.
-
-
Automatic Speech Recognition (ASR)
-
Local models: Whisper, Faster-Whisper, SpeechBrain.
-
Cloud APIs: Azure Speech-to-Text, Google Speech API, OpenAI Whisper API.
-
-
Translation
-
Local tools: Argos Translate, M2M100, or similar.
-
Cloud APIs: Azure Translator, Google Translate, OpenAI GPT Translation.
-
Maintain time alignment between source and translated text.
-
-
Text-to-Speech (TTS)
-
Local voices: Bark, Coqui TTS, XTTS, or similar.
-
Cloud TTS: Azure Neural Voices, Google Cloud TTS, OpenAI Voice.
-
Control voice parameters such as tone, pace, and expression.
-
-
Audio Mixing and Video Assembly
-
Combine generated voice with background audio or effects.
-
Adjust synchronization, timing, and volume balance.
-
Export final video using FFmpeg or similar frameworks.
-
-
Batch Processing
-
Implement automation for high-volume dubbing.
-
Include queue management, concurrency control, and progress tracking.
-
-
Workflow Configuration and Control
-
Support both local and cloud execution modes.
-
Use environment variables for API credentials and model paths.
-
Include detailed logging, error handling, and retry mechanisms.
-
Requirements:
-
Proven experience developing advanced workflows with n8n.
-
Strong knowledge of REST APIs, OAuth2, and JSON data handling.
-
Practical experience with FFmpeg, SoX, or other multimedia automation tools.
-
Familiarity with AI-based ASR, translation, and TTS systems (both local and cloud).
-
Experience with batch and asynchronous task orchestration.
-
Ability to deliver a robust, production-ready workflow with clear documentation.
Deliverables:
-
Fully functional n8n workflow implementing the complete dubbing process.
-
Hybrid setup supporting both local and cloud services.
-
Batch processing capability for large-scale dubbing.
-
Documentation covering architecture, configuration, and dependencies.
-
Final validation and workflow testing.
Other Details:
This is a remote project with flexible timing but focused on technical precision and functional delivery.
If you have solid experience in multimedia automation and n8n workflow development, please reply with a short summary of your background and examples of relevant work.