Nous-Hermes-Llama2-13b

Property	Value
Parameter Count	13B
License	MIT
Training Data	300,000+ instructions
Architecture	Llama-2 Based
Sequence Length	4096 tokens

What is Nous-Hermes-Llama2-13b?

Nous-Hermes-Llama2-13b is a state-of-the-art language model that represents a significant advancement in AI text generation. Developed by Nous Research, it's built upon the Llama-2 architecture and fine-tuned using over 300,000 carefully curated instructions, primarily derived from GPT-4 outputs. The model was trained on an 8x A100 80GB DGX machine, emphasizing quality and performance.

Implementation Details

The model leverages synthetic GPT-4 outputs and diverse datasets including GPTeacher, roleplay datasets, code instruct datasets, and Nous Instruct & PDACTL. It follows the Alpaca prompt format and has achieved impressive benchmark results, including top positions in ARC-c, ARC-e, Hellaswag, and OpenBookQA tests.

Extensive fine-tuning on 300k+ instructions
4096 token sequence length capability
Supports both instruction-only and instruction-with-input formats
Built using the Axolotl framework

Core Capabilities

Enhanced response length and detail
Reduced hallucination rate compared to similar models
Strong performance in reasoning and comprehension tasks
Versatile application from creative writing to technical instruction
Benchmark scores averaging 70.0 on GPT4All tests

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its combination of long-form responses, reduced hallucination rate, and absence of traditional censorship mechanisms. It achieves state-of-the-art performance on multiple benchmarks while maintaining versatility across different use cases.

Q: What are the recommended use cases?

The model excels in various applications including creative writing, technical documentation, coding assistance, and complex instruction following. It's particularly well-suited for tasks requiring detailed, accurate responses and can be integrated into chat interfaces through platforms like LM Studio.