Pygmalion-13B

Property	Value
Base Model	LLaMA-13B
Primary Use	Conversational AI
Language	English
License	Requires LLaMA access

What is Pygmalion-13B?

Pygmalion-13B is a sophisticated dialogue model built upon Meta's LLaMA-13B architecture. It represents version 1 of the model, fine-tuned using a carefully selected subset of data from Pygmalion-6B-v8-pt4. This model is specifically designed for generating natural, context-aware conversational responses.

Implementation Details

The model implements a unique deployment approach requiring XOR decoding with the original LLaMA weights. Users must obtain official LLaMA access from Meta and perform specific conversion steps using provided scripts. The model utilizes a persona-based prompting format for generating contextually appropriate responses.

Requires original LLaMA weights access
Custom XOR decoding implementation
Specialized persona-based prompting system
Trained using LoRA with custom configuration

Core Capabilities

Natural dialogue generation
Character persona maintenance
Context-aware responses
Automatic completion detection with end-of-text tokens

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its specialized conversational fine-tuning combined with a persona-based approach, allowing for more natural and contextually appropriate dialogue generation.

Q: What are the recommended use cases?

The model is specifically designed for fictional conversation and entertainment purposes. It's not intended for factual or safety-critical applications, as it may produce socially unacceptable or factually incorrect content.

pygmalion-13b