Wan2.1-T2V-1.3B

Maintained By
Wan-AI

Wan2.1-T2V-1.3B

PropertyValue
Parameter Count1.3 Billion
Model TypeText-to-Video Generation
ArchitectureDiffusion Transformer with T5 Encoder
LicenseApache 2.0
VRAM Required8.19GB

What is Wan2.1-T2V-1.3B?

Wan2.1-T2V-1.3B is a revolutionary text-to-video generation model that makes high-quality video generation accessible to users with consumer-grade GPUs. As part of the Wan2.1 suite, this 1.3B parameter model can generate 480P videos efficiently while maintaining impressive quality comparable to some closed-source solutions.

Implementation Details

The model utilizes a sophisticated architecture combining a T5 Encoder for multilingual text processing with a Diffusion Transformer featuring 1536 dimensions, 12 heads, and 30 layers. It employs a novel 3D causal VAE (Wan-VAE) for efficient video processing, with the ability to handle both Chinese and English text generation.

  • Model Dimension: 1536
  • Number of Heads: 12
  • Number of Layers: 30
  • Feedforward Dimension: 8960

Core Capabilities

  • 480P video generation from text descriptions
  • Efficient operation on consumer GPUs (RTX 4090 generates 5-second videos in ~4 minutes)
  • Multilingual text generation support
  • High-quality video synthesis with temporal consistency
  • Prompt extension capabilities for enhanced detail

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its ability to generate high-quality videos on consumer-grade GPUs with minimal VRAM requirements (8.19GB), making it accessible to a wider audience while maintaining competitive performance.

Q: What are the recommended use cases?

The model is ideal for creative teams needing video generation capabilities, academic researchers with limited computing resources, and developers looking to integrate video generation into their applications. It's particularly effective for generating 480P videos from text descriptions.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.