cocodr-large-msmarco

Maintained By
OpenMatch

COCO-DR Large MS MARCO

PropertyValue
Parameter Count335M
Model TypeDense Retrieval
LicenseMIT
PaperResearch Paper

What is cocodr-large-msmarco?

COCO-DR Large MS MARCO is an advanced dense retrieval model built on BERT-large architecture, specifically designed to address distribution shifts in zero-shot dense retrieval scenarios. The model combines contrastive and distributionally robust learning approaches to enhance retrieval performance across varying data distributions.

Implementation Details

The model utilizes a BERT-large backbone with 335M parameters and implements a two-stage training process: pretraining on the BEIR corpus followed by fine-tuning on the MS MARCO dataset. It can be easily integrated using the HuggingFace transformers library.

  • Built on BERT-large architecture
  • Pretrained on BEIR corpus
  • Fine-tuned on MS MARCO dataset
  • Implements contrastive and distributionally robust learning

Core Capabilities

  • Zero-shot dense retrieval
  • Handling distribution shifts in retrieval tasks
  • Text embedding generation
  • Document ranking and retrieval

Frequently Asked Questions

Q: What makes this model unique?

The model's unique strength lies in its ability to combat distribution shifts in zero-shot dense retrieval scenarios through a combination of contrastive and distributionally robust learning techniques. This makes it particularly effective for applications where the target data distribution differs from the training data.

Q: What are the recommended use cases?

This model is particularly well-suited for information retrieval tasks, document ranking, and search applications where robust performance across different data distributions is required. It's especially valuable in scenarios where zero-shot capabilities are needed.

The first platform built for prompt engineering