# EchoVideo **Repository Path**: ByteDance/EchoVideo ## Basic Information - **Project Name**: EchoVideo - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-02-28 - **Last Updated**: 2026-03-12 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion This repo contains PyTorch model definitions, pre-trained weights and inference code for our video generation model, EchoVideo. > [**EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion**](https://arxiv.org/abs/2501.13452) # News **[2025.02.27]** We release the inference code and model weights of EchoVideo. [DownLoad](ckpts/README.md) # Introduction EchoVideo is capable of generating a personalized video from a single photo and a text description. It excels in addressing issues related to "semantic conflict" and "copy-paste" problems. And demonstrates state-of-the-art performance. # Gallery **Strongly recommend visiting** [this link](https://bytedance.github.io/EchoVideo/) **for more results.** ## 1. Text-to-Video Generation | Face-ID Preserving | Full-Body Preserving| | ---- | ---- | |