dc-ae-f128c512-mix-1.0

Maintained By
mit-han-lab

Deep Compression Autoencoder (DC-AE)

PropertyValue
Parameter Count1.12B
Tensor TypeF32
PaperarXiv:2410.10733
AuthorMIT-HAN-Lab

What is dc-ae-f128c512-mix-1.0?

DC-AE is an innovative autoencoder model designed specifically for accelerating high-resolution diffusion models. It achieves an impressive 128x spatial compression while maintaining high reconstruction quality, a significant improvement over traditional approaches that typically max out at 8x compression.

Implementation Details

The model implements two key technical innovations: Residual Autoencoding and Decoupled High-Resolution Adaptation. It processes images through a space-to-channel transformation approach, allowing for efficient processing of high-resolution images while maintaining quality.

  • Achieves 19.1x inference speedup on H100 GPU
  • Provides 17.9x training speedup while maintaining or improving FID scores
  • Supports efficient text-to-image generation on laptop hardware

Core Capabilities

  • High spatial compression (up to 128x) with quality preservation
  • Efficient processing of 512x512 resolution images
  • Seamless integration with existing diffusion models
  • Optimized for both training and inference workflows

Frequently Asked Questions

Q: What makes this model unique?

DC-AE's ability to maintain high reconstruction accuracy at extreme compression ratios (128x) sets it apart from traditional autoencoders, which typically struggle beyond 8x compression.

Q: What are the recommended use cases?

The model is ideal for high-resolution image generation tasks, particularly when computational efficiency is crucial. It's especially suitable for deployment in resource-constrained environments like laptops while maintaining high-quality output.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.