BioMistral-7B

Maintained By
BioMistral

BioMistral-7B

PropertyValue
Base ModelMistral-7B-Instruct-v0.1
LicenseApache 2.0
Supported Languages9 (English, French, German, Dutch, Spanish, Portuguese, Polish, Romanian, Italian)
PaperarXiv:2402.10373
Context Length2048 tokens

What is BioMistral-7B?

BioMistral-7B is an open-source large language model specifically designed for medical and biomedical applications. Built upon the Mistral-7B architecture, it has been further pre-trained on PubMed Central data to develop deep domain expertise in medical knowledge. The model represents a significant advancement in accessible medical AI, offering competitive performance against both open-source and proprietary medical language models.

Implementation Details

The model utilizes the Mistral architecture with advanced pre-training techniques and is available in multiple variants including DARE, TIES, and SLERP merged versions. It supports various quantization options for efficient deployment, with VRAM requirements ranging from 4.68GB to 15.02GB depending on the configuration.

  • Multiple quantization options (AWQ, BnB.4, BnB.8) for deployment flexibility
  • Comprehensive evaluation across 10 medical QA benchmarks
  • Multilingual support with evaluation in 8 languages
  • 2048 token context window

Core Capabilities

  • Medical question-answering with state-of-the-art performance
  • Multilingual medical knowledge processing
  • Biomedical literature understanding
  • Clinical knowledge graph comprehension
  • Medical genetics and anatomy expertise

Frequently Asked Questions

Q: What makes this model unique?

BioMistral-7B stands out for its specialized medical domain expertise, multilingual capabilities, and competitive performance against larger proprietary models. It achieved an average accuracy of 57.3% across medical benchmarks, with some variants reaching 59.4%.

Q: What are the recommended use cases?

While the model demonstrates strong capabilities in medical knowledge processing, it is recommended strictly for research purposes. The developers explicitly advise against using it in production environments or for professional medical applications without thorough testing and alignment for specific use cases.

The first platform built for prompt engineering