wd-1-5-beta3

Maintained By
waifu-diffusion

WD 1.5 Beta 3

PropertyValue
LicenseFair AI Public License 1.0-SD
Downloads1,243
FrameworkDiffusers
TaskText-to-Image

What is wd-1-5-beta3?

WD 1.5 Beta 3 is an advanced text-to-image generation model that comes in five distinct variants: Base, Radiance, Ink, Mofu, and Illusion. Each variant is optimized for different aesthetic styles in anime-style image generation. The model utilizes the StableDiffusionPipeline architecture and includes safetensors implementation.

Implementation Details

The model employs the same VAE as WD 1.4 (specifically kl-f8-anime2.ckpt) and is built upon the successful foundation of previous WD versions. The Base model serves as a foundation for training, while the aesthetic models (Radiance, Ink, Mofu, and Illusion) are optimized for specific generation tasks.

  • Implements StableDiffusionPipeline architecture
  • Uses safetensors for model weight storage
  • Compatible with Inference Endpoints
  • Includes specialized VAE from WD 1.4

Core Capabilities

  • High-quality anime-style image generation
  • Multiple aesthetic variants for different artistic styles
  • Fine-tuning compatibility with Base model
  • Specialized image generation through aesthetic models

Frequently Asked Questions

Q: What makes this model unique?

This model's uniqueness lies in its multiple specialized variants, each optimized for different aesthetic styles in anime art generation. The Base model serves as a foundation for custom training, while the aesthetic models offer ready-to-use solutions for specific styles.

Q: What are the recommended use cases?

The Base model is recommended for training custom fine-tunes and LoRA models. For direct image generation, users should utilize one of the aesthetic models (Radiance, Ink, Mofu, or Illusion) based on their desired style output.

The first platform built for prompt engineering