stsb-xlm-r-multilingual

Maintained By
sentence-transformers

stsb-xlm-r-multilingual

PropertyValue
Parameter Count278M
LicenseApache 2.0
PaperSentence-BERT Paper
Embedding Dimension768

What is stsb-xlm-r-multilingual?

stsb-xlm-r-multilingual is a powerful sentence embedding model built on XLM-RoBERTa architecture, designed to map sentences and paragraphs into a 768-dimensional dense vector space. This model is specifically optimized for multilingual semantic similarity tasks and can process text across multiple languages effectively.

Implementation Details

The model utilizes a two-component architecture: an XLM-RoBERTa transformer followed by a pooling layer. It supports a maximum sequence length of 128 tokens and implements mean pooling for generating sentence embeddings. The model can be easily integrated using either the sentence-transformers library or HuggingFace Transformers.

  • Built on XLM-RoBERTa base architecture
  • Implements mean pooling strategy
  • Supports multiple deep learning frameworks including PyTorch, TensorFlow, and ONNX
  • Optimized for cross-lingual semantic search and clustering

Core Capabilities

  • Multilingual sentence embedding generation
  • Semantic similarity comparison across languages
  • Text clustering and classification
  • Cross-lingual information retrieval

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its multilingual capabilities and optimization for the Semantic Textual Similarity Benchmark (STS-B). It combines the robust XLM-RoBERTa architecture with specialized training for semantic similarity tasks, making it particularly effective for cross-lingual applications.

Q: What are the recommended use cases?

The model is ideal for multilingual semantic search, document similarity comparison, clustering of text documents across languages, and building cross-lingual information retrieval systems. It's particularly useful when working with multilingual datasets where semantic understanding across languages is crucial.

The first platform built for prompt engineering