gte-reranker-modernbert-base

Maintained By
Alibaba-NLP

gte-reranker-modernbert-base

PropertyValue
Model Size149M parameters
Max Sequence Length8192 tokens
Primary LanguageEnglish
AuthorAlibaba-NLP (Tongyi Lab)
Model TypeText Reranker
BEIR Score56.19
LoCo Score90.68

What is gte-reranker-modernbert-base?

The gte-reranker-modernbert-base is an advanced text reranking model developed by Alibaba's Tongyi Lab. Built on the modernBERT architecture, it's specifically designed for improving search and retrieval systems by reranking candidate documents based on their relevance to a query. The model demonstrates exceptional performance across various benchmarks, particularly in long document retrieval (LoCo) and BEIR evaluation tasks.

Implementation Details

The model can be easily implemented using either the transformers or sentence-transformers libraries. It supports efficient Flash Attention 2 for improved performance and can handle sequences up to 8192 tokens in length. The model outputs relevance scores for query-document pairs, making it ideal for search result refinement and document ranking tasks.

  • Built on modernBERT foundation model architecture
  • Supports both transformers and sentence-transformers implementations
  • Compatible with Flash Attention 2 for enhanced efficiency
  • Outputs calibrated relevance scores for text pairs

Core Capabilities

  • Long document processing (up to 8192 tokens)
  • State-of-the-art performance on LoCo benchmark (90.68)
  • Strong BEIR evaluation scores (56.19)
  • Efficient cross-encoder architecture for precise relevance scoring
  • Robust performance across various retrieval tasks

Frequently Asked Questions

Q: What makes this model unique?

The model's key strengths lie in its ability to handle long documents effectively and its superior performance on reranking tasks. Built on the modernBERT architecture, it achieves impressive scores on LoCo (90.68) and BEIR (56.19) benchmarks while maintaining a relatively compact size of 149M parameters.

Q: What are the recommended use cases?

The model is ideal for improving search systems, document retrieval applications, and any scenario requiring precise ranking of text pairs. It's particularly effective for long document retrieval, question-answering systems, and information retrieval tasks where accurate relevance scoring is crucial.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.