Dolphin 2.0 Mistral 7B GPTQ

Property	Value
Base Model	Mistral 7B
License	Apache 2.0
Quantization	GPTQ (Multiple Options)
Training Time	48 hours on 4x A100s

What is dolphin-2.0-mistral-7B-GPTQ?

Dolphin 2.0 Mistral 7B GPTQ is a quantized version of Eric Hartford's Dolphin model, based on Microsoft's Orca approach. It's an uncensored, highly capable model that combines the Mistral architecture with specialized training for enhanced compliance and versatility. The model uses the ChatML format and offers multiple quantization options for different hardware configurations.

Implementation Details

The model implements GPTQ quantization with various configurations, ranging from 4-bit to 8-bit precision, with different group sizes (32g, 64g, 128g) and Act Order options. It's trained on a combination of the Dolphin dataset and Jon Durbin's Airoboros dataset for improved creativity and performance.

Multiple GPTQ quantization options for different hardware requirements
ChatML prompt format implementation
Compatible with AutoGPTQ, Transformers, and ExLlama
Provides 4-bit and 8-bit versions with varying group sizes

Core Capabilities

Enhanced compliance and response generation
Optimized for both commercial and non-commercial use
Efficient memory usage through quantization
Support for long context windows up to 4096 tokens

Frequently Asked Questions

Q: What makes this model unique?

This model combines the powerful Mistral architecture with specialized training for enhanced compliance, while offering multiple quantization options for different hardware setups. It's particularly notable for its uncensored nature and flexible deployment options.

Q: What are the recommended use cases?

The model is suitable for various applications requiring helpful AI assistance, from general text generation to specialized tasks. However, users should implement their own alignment layer before deploying it as a service due to its uncensored nature.