Llama3 8B CPT Sahabat-AI v1 Instruct
Property | Value |
---|---|
Parameter Count | 8.03B |
Languages | English, Indonesian, Javanese, Sundanese |
License | Llama3 Community License |
Context Length | 8192 tokens |
Base Model | Llama3 |
What is llama3-8b-cpt-sahabatai-v1-instruct-GGUF?
Sahabat-AI v1 Instruct is a multilingual Large Language Model specifically optimized for Indonesian languages. Co-initiated by GoTo Group and Indosat Ooredoo Hutchison, it has been fine-tuned on 448,000 Indonesian instruction pairs, along with 96,000 Javanese and 98,000 Sundanese pairs, plus 129,000 English instruction pairs.
Implementation Details
The model uses the Llama3 architecture with 8B parameters and employs the default Llama-3-8B tokenizer. It has undergone extensive evaluation on multiple benchmarks including SEA HELM, IndoMMLU, and standard English tasks.
- Full parameter fine-tuning completed in 4 hours
- Alignment training conducted for 2 hours
- Trained on 8x H100-80GB GPUs
- Implements on-policy alignment and model merges
Core Capabilities
- Strong performance in Indonesian language tasks (57.221% on BHASA benchmark)
- Effective handling of Javanese (56.460%) and Sundanese (47.495%) content
- Competitive English language capabilities (24.43% average on standard benchmarks)
- 8192 token context length for handling longer conversations
Frequently Asked Questions
Q: What makes this model unique?
This model stands out for its specialized optimization for Indonesian languages and dialects, backed by major tech companies and extensive instruction tuning across multiple languages.
Q: What are the recommended use cases?
The model is well-suited for Indonesian language processing tasks, multilingual applications, and general instruction-following scenarios in Indonesian, Javanese, Sundanese, and English contexts.