Nous-Hermes-2-Mistral-7B-DPO

Maintained By
NousResearch

Nous-Hermes-2-Mistral-7B-DPO

PropertyValue
Parameter Count7.24B
Base ModelMistral-7B-v0.1
LicenseApache 2.0
Training MethodDPO (Direct Preference Optimization)
FormatChatML

What is Nous-Hermes-2-Mistral-7B-DPO?

Nous-Hermes-2-Mistral-7B-DPO represents a significant advancement in language model development, built upon the Mistral 7B architecture. This model is the result of applying Direct Preference Optimization (DPO) to the OpenHermes-2.5 model, trained on 1,000,000 high-quality instructions and chat interactions. It demonstrates improved performance across multiple benchmarks including AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA.

Implementation Details

The model utilizes ChatML as its prompt format, enabling structured multi-turn dialogue with system-level instruction capabilities. It supports BF16 precision and can be efficiently run with 4-bit quantization requiring approximately 5GB of VRAM.

  • Trained on GPT-4 quality synthetic data
  • Implements ChatML format for enhanced dialogue control
  • Supports system prompts for better steerability
  • Compatible with OpenAI endpoint formatting

Core Capabilities

  • Strong performance on reasoning tasks (73.72% on GPT4All)
  • Advanced dialogue handling with multi-turn conversations
  • Improved truthfulness (56.42% on TruthfulQA MC2)
  • Flexible system-level instruction following
  • Efficient resource usage with quantization support

Frequently Asked Questions

Q: What makes this model unique?

The model's unique strength lies in its DPO training approach and comprehensive benchmark improvements over its predecessor. It combines high-quality instruction following with efficient resource usage, making it practical for both research and production deployments.

Q: What are the recommended use cases?

The model excels in conversational AI applications, instruction following, reasoning tasks, and general-purpose text generation. It's particularly well-suited for applications requiring structured dialogue management through its ChatML implementation.

The first platform built for prompt engineering