readability-es-3class-paragraphs

Maintained By
somosnlp-hackathon-2022

readability-es-3class-paragraphs

PropertyValue
Parameter Count125M
LicenseCC-BY-4.0
LanguageSpanish
TaskText Classification
F1 Score0.7881

What is readability-es-3class-paragraphs?

This is a specialized Spanish language model designed to assess text readability across three complexity levels: Basic, Intermediate, and Advanced. Built on the BERTIN RoBERTa architecture, it specifically analyzes paragraph-level text to determine its complexity level, making it valuable for educational content creators and language learning applications.

Implementation Details

The model is implemented using the RoBERTa architecture and fine-tuned on BERTIN's Spanish base model. It processes paragraph-level inputs and was trained on a diverse dataset including coh-metrix-esp corpus, website scrapes, and proprietary datasets like newsela-es and simplext.

  • Architecture: RoBERTa-based BERTIN model
  • Training Data: Mixed dataset approach including public and private sources
  • Granularity: Paragraph-level analysis
  • Performance: 0.7881 F1 macro average score

Core Capabilities

  • Three-level classification of text complexity
  • Paragraph-level readability assessment
  • Spanish text processing optimization
  • Integration with Common European Framework of Reference for Languages

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specific focus on Spanish text readability assessment at the paragraph level, offering a three-class classification system aligned with educational standards. It's particularly valuable for content adaptation and educational material development.

Q: What are the recommended use cases?

The model is ideal for educational content creators, language learning platforms, and content adaptation services that need to assess and categorize Spanish text difficulty levels. It can be used for content simplification, curriculum development, and reading material selection.

The first platform built for prompt engineering