es_text_neutralizer

Maintained By
somosnlp-hackathon-2022

es_text_neutralizer

PropertyValue
Parameter Count60.5M
Model TypeText-to-Text Generation
Base ArchitectureSpanish T5-small
LicenseApache 2.0
BLEU Score93.83

What is es_text_neutralizer?

es_text_neutralizer is a specialized language model designed to transform Spanish text into gender-neutral alternatives. Built on the spanish-t5-small architecture, it addresses the important social goal of promoting inclusive language by automatically suggesting gender-neutral expressions while maintaining the original meaning.

Implementation Details

The model is trained using a comprehensive dataset compiled from various Spanish institutional guidelines on inclusive language. It employs a text-to-text generation approach with 60.5M parameters and achieves impressive metrics including a 93.83 BLEU score and 0.98 BertScore.

  • Fine-tuned on spanish-t5-small base model
  • Trained with batch size of 32 over 10 epochs
  • Uses learning rate of 1e-04 with weight decay of 0.01
  • Implements custom DiffBleu metric for evaluation

Core Capabilities

  • Converts gendered Spanish expressions to neutral alternatives
  • Maintains semantic meaning while improving inclusivity
  • Supports real-time text transformation
  • Handles complex linguistic patterns

Frequently Asked Questions

Q: What makes this model unique?

The model's ability to understand and transform Spanish gender-specific language while maintaining context and meaning sets it apart. It's specifically designed to support UN's gender equality objectives through language modification.

Q: What are the recommended use cases?

The model is ideal for content writers, educational institutions, and organizations looking to make their Spanish communications more inclusive. It can be used for document translation, content creation, and as part of larger text processing pipelines.

The first platform built for prompt engineering