Sana

Q: 如何将自定义训练的基础扩散模型转换为diffusers格式，以便用于Dreambooth和LoRA微调？

使用提供的脚本：`python tools/convert_sana_to_diffusers.py`。该脚本将非diffusers权重转换为预期的diffusers状态字典格式，使模型与Dreambooth和LoRA训练脚本兼容。

框架

NVlabs/Sana

高效高分辨率图像与视频合成，基于线性扩散Transformer。

访问仓库项目主页

项目简介

SANA 是一个面向高效高分辨率图像与视频生成的开源代码库，提供完整的训练与推理流程。基于线性扩散 Transformer，支持文生图、文生视频以及世界模型等任务。

README 预览

\n  \n\n\n\n📚 Docs | SANA | SANA-1.5 | SANA-Sprint | SANA-Video | SANA-WM | Sol-RL\n  \nDemo | 🤗 HuggingFace | ComfyUI | SGLang | Cosmos-RL\n\n\n\n\n   &ensp;\n   &ensp;\n   &ensp;\n   &ensp;\n   &ensp;\n\n\n\n   &ensp;\n   &ensp;\n\n\nICLR 2025 Oral | ICML 2025 | ICCV 2025 Highlight | ICLR 2026 Oral \n\n**SANA** is an efficiency-oriented codebase for high-resolution image and video generation, providing complete training and inference pipelines. This repository contains code for [SANA](https://nvlabs.github.io/Sana/), [SANA-1.5](https://nvlabs.github.io/Sana/Sana-1.5/), [SANA-Sprint](https://nvlabs.github.io/Sana/Sprint/), [SANA-Video](https://nvlabs.github.io/Sana/Video/), [SANA-WM](https://nvlabs.github.io/Sana/WM/), and [Sol-RL](https://nvlabs.github.io/Sana/Sol-RL/). More details can be found in our [📚 documentation](https://nvlabs.github.io/Sana/docs/).\n\nJoin our [Discord](https://discord.gg/rde6eaE5Ta) to engage in discussions with the community! If you have any questions, run into issues, or are interested in contributing, don't hesitate to reach out!\n\n\n  \n\n\n## News\n\n- 🔥 [2026/05] 🌍 **SANA-WM: 2.6B Controllable World Model** is released! Supports 720p, 1-min video generation with 6-DoF camera control. A new baseline for World Modeling and Embodied AI. See [Project](https://nvlabs.github.io/Sana/WM/) | [Paper](https://huggingface.co/papers/2605.15178).\n- 🔥 [2026/04] ⚡ **Sol-RL: NVFP4 Rollout, BF16 Training RL** is available! All training recipes for **SANA**, **FLUX.1**, and **SD3.5-L**, together with bundled post-training datasets, are released. See [Sol-RL doc](https://nvlabs.github.io/Sana/docs/sol_rl/) | [Page](https://nvlabs.github.io/Sana/Sol-RL/) | [Paper](https://arxiv.org/abs/2604.06916).\n- 🔥 [2026/03] 📺 **SANA-Video 720p model with LTX-VAE** is released. Use it with LTX2 Refiner to upscale the videos to 2K resolution! See [Model Zoo](https://nvlabs.github.io/Sana/docs/model_zoo/#sana-video), [SANA-Video doc](https://nvlabs.github

常见问题 (2)

故障排除

如何将自定义训练的基础扩散模型转换为diffusers格式，以便用于Dreambooth和LoRA微调？

使用提供的脚本：python tools/convert_sana_to_diffusers.py。该脚本将非diffusers权重转换为预期的diffusers状态字典格式，使模型与Dreambooth和LoRA训练脚本兼容。

来源 Issue #278

故障排除

为什么LongSanaVideoPipeline从diffusers中缺失，以及如何运行LongSana？

LongSanaVideoPipeline在diffusers库中尚未实现（即使在最新的开发版本中）。Hugging Face模型页面显示其状态为“即将推出”。作为替代方案，请克隆官方的Sana仓库：git clone https://github.com/nvlabs/Sana.git，并按照其推理指南操作。如果您需要视频生成，也可以尝试已有的SANA-Video pipeline。

来源 Issue #354

Sana

项目简介

README 预览

常见问题 (2)

同类型项目

superpowers

everything-claude-code

flutter

langflow