WD 1.5 Beta 3

Property	Value
License	Fair AI Public License 1.0-SD
Downloads	1,243
Framework	Diffusers
Task	Text-to-Image

What is wd-1-5-beta3?

WD 1.5 Beta 3 is an advanced text-to-image generation model that comes in five distinct variants: Base, Radiance, Ink, Mofu, and Illusion. Each variant is optimized for different aesthetic styles in anime-style image generation. The model utilizes the StableDiffusionPipeline architecture and includes safetensors implementation.

Implementation Details

The model employs the same VAE as WD 1.4 (specifically kl-f8-anime2.ckpt) and is built upon the successful foundation of previous WD versions. The Base model serves as a foundation for training, while the aesthetic models (Radiance, Ink, Mofu, and Illusion) are optimized for specific generation tasks.

Implements StableDiffusionPipeline architecture
Uses safetensors for model weight storage
Compatible with Inference Endpoints
Includes specialized VAE from WD 1.4

Core Capabilities

High-quality anime-style image generation
Multiple aesthetic variants for different artistic styles
Fine-tuning compatibility with Base model
Specialized image generation through aesthetic models

Frequently Asked Questions

Q: What makes this model unique?

This model's uniqueness lies in its multiple specialized variants, each optimized for different aesthetic styles in anime art generation. The Base model serves as a foundation for custom training, while the aesthetic models offer ready-to-use solutions for specific styles.

Q: What are the recommended use cases?

The Base model is recommended for training custom fine-tunes and LoRA models. For direct image generation, users should utilize one of the aesthetic models (Radiance, Ink, Mofu, or Illusion) based on their desired style output.

wd-1-5-beta3