Gemma-3-Glitter-12B
Property | Value |
---|---|
Base Model | Gemma 3 12B IT |
Author | allura-org |
Model URL | HuggingFace |
What is Gemma-3-Glitter-12B?
Gemma-3-Glitter-12B is a specialized creative writing model that combines roleplay instruction and storytelling capabilities. It's built through a 50/50 merge of two distinct training approaches: an instruct-based RP training (~13.5M tokens) and long-form creative writing completion training (~20M tokens). The model maintains vision capabilities from its base architecture while adding enhanced creative writing features.
Implementation Details
The model implements a unique training methodology combining two separate training sets: ToastyPigeon/g3-12b-rp-system-v0.1 and ToastyPigeon/g3-12b-storyteller-v0.2-textonly. The architecture supports both standard Gemma2/3 instruct format and an optional system role, providing flexibility in implementation.
- RP Training: ~13.5M tokens with 2:1 human to synthetic ratio
- Storyteller Training: ~20M tokens including 1.6M synthetic from R1
- Vision Capability: Fully supported
- Custom Instruct Format Support
Core Capabilities
- Creative Writing and Storytelling
- Roleplay System Integration
- Vision Processing
- Flexible Prompt Formatting
- System Role Implementation
Frequently Asked Questions
Q: What makes this model unique?
The model's unique strength lies in its balanced fusion of roleplay and storytelling capabilities, combined with maintained vision features. The dual training approach ensures both structured instruction following and creative freedom.
Q: What are the recommended use cases?
The model excels in creative writing tasks, storytelling, roleplay scenarios, and applications requiring both visual and narrative capabilities. It's particularly suited for interactive storytelling and creative content generation with visual context.