Wan2.1-Fun-1.3B-Control

Maintained By
alibaba-pai

Wan2.1-Fun-1.3B-Control

PropertyValue
Model Size1.3B parameters
Storage Space19.0 GB
LicenseApache License 2.0
Authoralibaba-pai

What is Wan2.1-Fun-1.3B-Control?

Wan2.1-Fun-1.3B-Control is a specialized video control model designed for advanced video manipulation and generation. It represents a significant advancement in controlled video synthesis, offering support for multiple control conditions and resolutions.

Implementation Details

The model is trained on 81 frames at 16 frames per second, supporting multiple resolutions including 512x512, 768x768, and 1024x1024. It incorporates various control mechanisms and supports multilingual prediction capabilities.

  • Multi-resolution support (512, 768, 1024)
  • Training configuration: 81 frames at 16fps
  • Multiple control condition support
  • Memory-efficient options including model_cpu_offload and qfloat8 quantization

Core Capabilities

  • Control condition support (Canny, Depth, Pose, MLSD)
  • Trajectory control functionality
  • Multi-resolution video prediction
  • Multilingual prediction support
  • Memory optimization options for different GPU configurations

Frequently Asked Questions

Q: What makes this model unique?

The model's ability to handle multiple control conditions while supporting various resolutions and trajectory control makes it particularly versatile for video generation tasks. Its memory optimization options also make it accessible for different hardware configurations.

Q: What are the recommended use cases?

The model is ideal for controlled video generation tasks requiring specific visual elements like edge detection (Canny), depth information, pose estimation, or line detection (MLSD). It's particularly useful for applications requiring precise control over video generation parameters.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.