codellama-13b-oasst-sft-v10

Maintained By
OpenAssistant

CodeLlama-13B-OASST-SFT-v10

PropertyValue
Parameter Count13B
Model TypeCausal decoder-only transformer
LicenseLLAMA 2 Community License
Training Steps6123 (BS 64)
Tensor TypeBF16

What is codellama-13b-oasst-sft-v10?

CodeLlama-13B-OASST-SFT-v10 is an Open-Assistant fine-tuned version of Meta's CodeLlama 13B language model. This model represents a significant advancement in code-generation capabilities, incorporating the chatml standard prompt format for better compatibility with chat applications. It's trained on a carefully curated mix of datasets including OpenAssistant/oasst1 and specialized code datasets.

Implementation Details

The model utilizes a new RoPE Theta value of 1e6 (upgraded from 1e4) and requires loading with trust_remote_code=True. It's implemented using the epfLLM/Megatron-LLM trainer and supports multi-turn conversations through a structured prompt template system.

  • Trained for 6123 steps with batch size 64
  • Implements chatml standard prompt format
  • Supports multiple conversation turns between user and assistant
  • Includes integrated system message support

Core Capabilities

  • Advanced code generation and completion
  • Multi-turn conversation support
  • Compatibility with chat inference/frontend applications
  • Multilingual support with focus on European languages
  • Safe and ethical response generation

Frequently Asked Questions

Q: What makes this model unique?

This model combines CodeLlama's powerful code generation capabilities with Open-Assistant's conversational abilities, using the widely-adopted chatml format. The integration of multiple high-quality datasets and careful fine-tuning makes it particularly effective for both coding and general assistance tasks.

Q: What are the recommended use cases?

The model is well-suited for code generation, technical assistance, and general conversation tasks. It's particularly effective in scenarios requiring both programming expertise and natural language understanding, while maintaining ethical guidelines and safety considerations.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.