L3-8B-Lunaris-v1
Property | Value |
---|---|
Parameter Count | 8.03B |
Model Type | LLaMA-3 Based Merge |
License | LLaMA-3 |
Tensor Type | BF16 |
Primary Language | English |
What is L3-8B-Lunaris-v1?
L3-8B-Lunaris-v1 is a sophisticated model merge based on LLaMA-3, specifically designed to balance creative capabilities with logical reasoning. Created by Sao10K, this model represents an evolution from the Stheno v3.2 architecture, incorporating multiple specialized models to achieve enhanced performance in both generalist and roleplaying tasks.
Implementation Details
The model utilizes a complex merging strategy combining five distinct models with carefully calibrated weights and densities. It employs the 'ties' merge method and operates with bfloat16 precision. The recommended configuration includes a temperature of 1.4 and min_p of 0.1, optimized for the Llama-3-Instruct context template.
- Base Model: Meta-Llama-3-8B-Instruct
- Integrated with specialized RP models and general knowledge enhancement through badger-iota
- Utilizes int8_mask and rescaling for optimization
- Implements density-weighted model merging ranging from 0.4 to 0.7
Core Capabilities
- Enhanced creative text generation and storytelling
- Improved logical reasoning compared to predecessor models
- Balanced performance in both generalist and roleplay scenarios
- Optimized for conversational and instruction-following tasks
Frequently Asked Questions
Q: What makes this model unique?
The model's distinctive feature is its carefully calibrated merge of multiple specialized models, creating a balance between creativity and logical reasoning. The developer's extensive experimentation with merge parameters and model selection has resulted in improved performance over its predecessor, Stheno v3.2.
Q: What are the recommended use cases?
Lunaris-v1 is particularly well-suited for creative writing, roleplaying scenarios, and general conversation tasks. Its balanced architecture makes it versatile for both narrative generation and logical reasoning tasks, with enhanced performance in instruction-following contexts.