DistilBERT Base Turkish Cased
Property | Value |
---|---|
Parameter Count | 68.1M |
License | MIT |
Paper | View Paper |
Author | dbmdz |
Framework Support | PyTorch, TensorFlow |
What is distilbert-base-turkish-cased?
DistilBERT Base Turkish Cased is a lightweight, distilled version of the original BERTurk model, specifically designed for Turkish language processing. Developed by the MDZ Digital Library team at the Bavarian State Library, this model maintains case sensitivity while reducing the computational footprint of the original BERT architecture.
Implementation Details
The model was trained on 7GB of Turkish text data using knowledge distillation techniques, with the cased version of BERTurk serving as the teacher model. The training process took 5 days using 4 RTX 2080 TI GPUs, implementing the official Hugging Face distillation approach.
- Maintains case sensitivity for better handling of Turkish text
- Compatible with PyTorch-Transformers framework
- Achieves performance within 1.18% of the original BERTurk model
Core Capabilities
- Part-of-Speech (PoS) tagging with performance exceeding 24-layer XLM-RoBERTa
- Named Entity Recognition (NER)
- General Turkish language understanding tasks
- Efficient inference with reduced model size
Frequently Asked Questions
Q: What makes this model unique?
This model uniquely combines the efficiency of distillation techniques with specific optimization for Turkish language processing, achieving near-original performance while being significantly smaller and faster.
Q: What are the recommended use cases?
The model is particularly well-suited for Turkish language processing tasks including PoS tagging and NER, especially in environments where computational resources are limited but high performance is required.