Gemma-2-Ataraxy-9B

Property	Value
Parameter Count	10.2B
Model Type	Text Generation
Architecture	Gemma 2
License	Gemma License
Tensor Type	BF16

What is Gemma-2-Ataraxy-9B?

Gemma-2-Ataraxy-9B is an innovative merged language model that combines princeton-nlp/gemma-2-9b-it-SimPO and nbeerbower/gemma2-gutenberg-9B using SLERP merge methodology. The model has achieved remarkable results, particularly ranking #1 on eqbench.com's creative writing benchmark with a score of 82.64.

Implementation Details

The model utilizes a sophisticated SLERP merge configuration with carefully tuned parameters for self-attention and MLP layers. It implements varying interpolation values across different components, ranging from 0.0 to 1.0, optimizing the balance between the base models.

Advanced SLERP merge methodology with custom layer filtering
Comprehensive benchmark performance across multiple evaluation metrics
Available in various quantized versions (GGUF/EXL2) for different deployment scenarios

Core Capabilities

Top-tier creative writing performance (82.64 on Creative Writing V2)
Strong performance on IFEval (30.09) and BBH (42.03)
Enhanced human-like text generation through Gutenberg dataset influence
Balanced performance across technical and creative tasks

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its innovative merge of SimPO and Gutenberg finetunes, achieving superior creative writing capabilities while maintaining strong performance on technical benchmarks. It successfully combines the benefits of both parent models while avoiding typical merge-related degradation.

Q: What are the recommended use cases?

The model excels in creative writing tasks and general text generation, making it ideal for content creation, storytelling, and general-purpose text generation tasks. It maintains a good balance between technical accuracy and natural, human-like writing style.