Dolphin 2.9 Llama3 8B GGUF

Property	Value
Parameter Count	8.03B
License	META LLAMA 3 COMMUNITY LICENSE
Base Model	Meta-Llama-3-8B
Context Length	8K (base model), 4K (fine-tuning)
Training Duration	2.5 days on 8x L40S

What is dolphin-2.9-llama3-8b-gguf?

Dolphin 2.9 is an advanced language model built on Meta's Llama 3 architecture, developed by Eric Hartford, Lucas Atkins, and Fernando Fernandes at Cognitive Computations. This uncensored model represents a significant advancement in conversational AI, trained using full-parameter fine-tuning with ChatML prompt formatting.

Implementation Details

The model underwent comprehensive training using 10 diverse datasets, including OpenHermes-2.5, CodeFeedback, and UltraChat-200k. It employs the ChatML template format for interactions and was trained using the Axolotl framework with specific optimizations like gradient checkpointing and flash attention.

Full-weight fine-tuning with 4K sequence length
Trained using adamw_8bit optimizer with cosine learning rate scheduler
Implements gradient checkpointing and flash attention for optimal performance
Uses ChatML prompt template for standardized interactions

Core Capabilities

Advanced conversational abilities with uncensored responses
Robust coding and programming support
Function calling capabilities
Initial agentic abilities
Instruction-following and task completion
Mathematical problem-solving capabilities

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its uncensored nature and comprehensive training across multiple domains. It combines conversational abilities with coding expertise and function-calling capabilities, all while maintaining compliance with user requests.

Q: What are the recommended use cases?

The model is suitable for programming assistance, conversational applications, and general task automation. However, users should implement their own alignment layer before deploying it as a service, given its uncensored nature.