OpenSource-Hub

OpenMontage

命令行工具

calesthio/OpenMontage

全球首个开源 Agent 视频制作系统。

项目简介

OpenMontage 可将 AI 编程助手变为完整视频工作室,提供 12 条流水线和 52 个工具,支持研究、脚本、素材生成和编辑,兼容免费与付费工作流。

README 预览

\n  \n\n\nOpenMontage\n\nThe first open-source, agentic video production system.\n\n\n  Paste A Video  · \n  Quick Start  · \n  Try These Prompts  · \n  Pipelines  · \n  How It Works  · \n  Providers  · \n  Agent Guide\n\n\n\n  \n\n\nFollow The Build\n\n\n  \n  \n  \n\n\n---\n\nTurn your AI coding assistant into a full video production studio. Describe what you want in plain language — your agent handles research, scripting, asset generation, editing, and final composition.\n\n**Important distinction:** OpenMontage can make image-based videos, but it can also make a real **video video** for free/open-source workflows: the agent builds a corpus from free stock footage and open archives, retrieves actual motion clips, edits them into a timeline, and renders a finished piece. That is not the usual "animate a handful of stills and call it video" trick.\n\n\n  \n\n\n> **"SIGNAL FROM TOMORROW"** — a cinematic sci-fi trailer fully produced through OpenMontage: concept, script, scene plan, Veo-generated motion clips, soundtrack, and Remotion composition.\n\n\n  \n\n\n> **"THE LAST BANANA"** — a 60-second Pixar-style animated short about a lonely banana who finds friendship with a kiwi. 6 Kling v3-generated motion clips (via fal.ai), Google Chirp3-HD narration, royalty-free piano music, TikTok-style word-level captions, and Remotion composition. Total cost: **$1.33**.\n\n\n  \n\n\n> **"VOID — Neural Interface"** — a product ad produced with just one API key (OpenAI). 4 AI-generated images (gpt-image-1), TTS narration, auto-sourced royalty-free music, word-level subtitles via WhisperX, and Remotion data visualizations. Total cost: **$0.69**. Zero manual asset work.\n\n\n  \n\n\n> **"Afternoon in Candyland"** — a Ghibli-style anime animation. A little girl's whimsical afternoon adventure through candy gates, gumdrop rivers, and lollipop gardens. 12 FLUX-generated images with multi-image crossfade, cinematic camera mot