code-millenials-34b

Maintained By
budecosystem

Code Millenials 34B

PropertyValue
Parameter Count33.7B
Model TypeText Generation / Code Generation
LicenseLLaMA 2
Tensor TypeFP16

What is code-millenials-34b?

Code Millenials 34B is a state-of-the-art code generation model that achieves remarkable performance on coding benchmarks. With an impressive 80.48% score on HumanEval Pass@1, it outperforms many proprietary models including Gemini Ultra and Claude, positioning itself as a leading open-source solution for code generation tasks.

Implementation Details

The model was trained on 16 A100 80GB GPUs for approximately 50 hours, utilizing advanced training techniques including cosine learning rate scheduling and AdamW optimizer. The implementation uses the transformers library and supports FP16 precision for efficient inference.

  • Training batch size: 16 per device
  • Learning rate: 2e-5 with cosine scheduling
  • Training duration: 3 epochs (2157 steps)
  • Warmup ratio: 0.1

Core Capabilities

  • State-of-the-art code generation with 80.48% HumanEval pass@1
  • Strong performance on MBPP benchmark (74.68%)
  • Impressive results on general tasks: HellaSwag (75.09%), Winogrande (69.06%)
  • Balanced performance on reasoning tasks: AI2 Reasoning Challenge (49.83%), MMLU (49.28%)

Frequently Asked Questions

Q: What makes this model unique?

The model's exceptional performance on code generation tasks, particularly its 80.48% score on HumanEval, places it among the top performers in the field, competing with proprietary models like GPT-4.

Q: What are the recommended use cases?

The model excels in code generation tasks, making it ideal for software development assistance, code completion, and programming education. It can handle various programming languages and complex coding scenarios.

The first platform built for prompt engineering