Gemma-3-Starshine-12B

Maintained By
ToastyPigeon

Gemma-3-Starshine-12B

PropertyValue
AuthorToastyPigeon
Model Size12B parameters
Base ArchitectureGemma-3
Model TypeCreative Writing / Storytelling
SourceHuggingFace

What is Gemma-3-Starshine-12B?

Gemma-3-Starshine-12B is a specialized creative writing model that combines the best aspects of instruction-tuned and base-model capabilities. It represents a strategic merger of two models: Gemma-3-Glitter-12B (instruction-tuned) and Gemma-3-Confetti-12B (base model with adventure data), resulting in a balanced system that excels at storytelling while maintaining instruction-following capabilities.

Implementation Details

The model employs a linear merge methodology with equal weights (0.5) between its parent models. It utilizes the Gemma2/3 instruction format and includes optional system role support, though it may perform better without it. The implementation includes vision tower capabilities and maintains compatibility with traditional Gemma instruction patterns.

  • Linear merge configuration with 50-50 weight distribution
  • Built-in vision tower support
  • Flexible instruction format with optional system role
  • Enhanced storytelling capabilities with novel-like prose generation

Core Capabilities

  • Advanced creative writing and storytelling
  • Character impersonation and dialogue generation
  • Instruction following while maintaining creative freedom
  • Visual processing capabilities
  • Balanced between structured and free-form responses

Frequently Asked Questions

Q: What makes this model unique?

This model uniquely combines instruction-following capabilities with creative freedom, effectively reducing the hesitancy often found in instruction-tuned models while maintaining coherent storytelling abilities.

Q: What are the recommended use cases?

The model is particularly well-suited for creative writing tasks, storytelling, scenario generation, and character-based interactions. It excels in generating novel-like prose and can effectively impersonate user characters in storytelling contexts.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.