evo-1-131k-base

Property	Value
Parameter Count	6.45B
License	Apache 2.0
Context Length	131,072 tokens
Paper	Science Journal Publication

What is evo-1-131k-base?

evo-1-131k-base is a cutting-edge biological foundation model designed for long-context genomic sequence modeling and design. Built on the innovative StripedHyena architecture, it represents a significant advancement in processing biological sequences at single-nucleotide resolution. Trained on the OpenGenome dataset containing approximately 300 billion tokens, this model demonstrates exceptional capability in handling sequences up to 650,000 tokens in length.

Implementation Details

The model utilizes a hybrid architecture combining multi-head attention and gated convolutions arranged in Hyena blocks. This design enables near-linear scaling of compute and memory relative to context length, making it particularly efficient for long sequence processing.

Deep signal processing architecture with specialized layer classes
Supports multiple parametrization options for different workloads
Mixed precision implementation with float32 precision for poles and residues
Efficient autoregressive generation capability

Core Capabilities

Processing sequences at single-nucleotide, byte-level resolution
Generation of sequences exceeding 650k length on a single 80GB GPU
3x faster training and finetuning at 131k context length
Robust performance beyond traditional compute-optimal frontiers
Flexible parametrization for various inference and finetuning scenarios

Frequently Asked Questions

Q: What makes this model unique?

The model's StripedHyena architecture sets it apart by enabling efficient processing of extremely long biological sequences while maintaining linear scaling characteristics. Its hybrid approach combining attention and convolution mechanisms makes it particularly suited for genomic-scale applications.

Q: What are the recommended use cases?

The model is specifically designed for biological sequence modeling at the genome scale, making it ideal for tasks involving whole-genome analysis, sequence design, and other genomics applications requiring long-context understanding.

evo-1-131k-base

evo-1-131k-base

What is evo-1-131k-base?

Implementation Details

Core Capabilities

Frequently Asked Questions

Q: What makes this model unique?

Q: What are the recommended use cases?

Related Models

The first platform built for prompt engineering