SkyReels-A2

Maintained By
Skywork

SkyReels-A2

PropertyValue
AuthorSkywork
Model URLhttps://huggingface.co/Skywork/SkyReels-A2
Video ResolutionUp to 720x1080 (Infinity version)
Publication Year2025

What is SkyReels-A2?

SkyReels-A2 is an advanced video diffusion transformer framework designed to compose and generate high-quality video content. It introduces a novel dual-branch encoding system that processes both spatial and semantic features from reference images to create coherent and contextually relevant video outputs.

Implementation Details

The model employs two distinct processing branches: a spatial feature branch utilizing a fine-grained VAE encoder, and a semantic feature branch leveraging CLIP vision encoding with MLP projection. These features are integrated through diffusion transformer blocks, with semantic features incorporated via cross-attention layers.

  • Supports multiple model variants including A2-Wan2.1-14B-Preview and upcoming Infinity version
  • Processes videos at resolutions of 81x480x832 and higher
  • Implements advanced diffusion transformer architecture
  • Features dual-branch encoding system for comprehensive feature extraction

Core Capabilities

  • High-resolution video generation
  • Semantic-aware content composition
  • Flexible video length handling (including infinite length in Infinity version)
  • Reference image-based composition control

Frequently Asked Questions

Q: What makes this model unique?

SkyReels-A2's distinctive feature is its dual-branch encoding system that processes both spatial and semantic features independently, allowing for more nuanced and controlled video generation.

Q: What are the recommended use cases?

The model is particularly suited for video composition tasks requiring high-quality output, especially when specific reference images need to guide the generation process. It's ideal for creative content generation and video synthesis applications.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.