distilbert-base-nli-mean-tokens

Property	Value
Parameter Count	66.4M
License	Apache 2.0
Paper	Sentence-BERT Paper
Downloads	232,750

What is distilbert-base-nli-mean-tokens?

This is a Sentence-BERT model based on DistilBERT architecture that maps sentences and paragraphs to 768-dimensional dense vector space. However, it's important to note that this model is now deprecated due to producing lower quality sentence embeddings compared to newer alternatives.

Implementation Details

The model utilizes a DistilBERT base architecture combined with mean token pooling strategy. It's implemented using the sentence-transformers framework and can be easily used with both sentence-transformers and HuggingFace Transformers libraries.

Supports max sequence length of 128 tokens
Uses mean pooling over token embeddings
Outputs 768-dimensional embeddings
Implements F32 tensor type

Core Capabilities

Sentence and paragraph embedding generation
Semantic similarity computation
Text clustering applications
Semantic search functionality

Frequently Asked Questions

Q: What makes this model unique?

This model combines DistilBERT's efficiency with mean token pooling, making it lightweight compared to BERT-based alternatives. However, its deprecated status means it's not recommended for new projects.

Q: What are the recommended use cases?

While historically used for semantic search and text similarity tasks, it's recommended to use newer sentence-transformer models listed on SBERT.net due to this model's known quality limitations.