T2I-Adapter-Sketch-SDXL

Property	Value
Parameter Count	77M
Base Model	Stable Diffusion XL 1.0
License	Apache 2.0
Paper	arXiv:2302.08453

What is t2i-adapter-sketch-sdxl-1.0?

T2I-Adapter-Sketch-SDXL is a specialized adapter network developed by TencentARC that enhances Stable Diffusion XL's ability to generate images from sketches. It uses PidiNet edge detection to process input sketches and provides additional conditioning to the base SDXL model, enabling precise control over the generation process.

Implementation Details

The model is implemented as a lightweight adapter network with 77M parameters that integrates with the larger 2.6B parameter SDXL base model. It was trained on 3M high-resolution image-text pairs from LAION-Aesthetics V2 for 20,000 steps using a batch size of 256 and a constant learning rate of 1e-5.

Trained with PidiNet edge detection for sketch processing
Supports hand-drawn monochrome images with white outlines on black backgrounds
Implements mixed precision training with fp16
Uses EulerAncestralDiscreteScheduler for generation

Core Capabilities

Convert simple sketches into detailed, photorealistic images
Maintain structural fidelity to input sketches
Support high-resolution image generation
Controllable generation through adapter_conditioning_scale parameter

Frequently Asked Questions

Q: What makes this model unique?

This model is specifically designed to work with the SDXL architecture, offering superior sketch-to-image generation capabilities while requiring minimal additional parameters (77M) compared to the base model (2.6B).

Q: What are the recommended use cases?

The model excels at converting rough sketches into detailed images, making it ideal for artists, designers, and creators who want to quickly visualize their concepts. It's particularly useful for concept art, architectural visualization, and rapid prototyping of visual ideas.