nb-whisper-large

Maintained By
NbAiLab

NB-Whisper Large

PropertyValue
Parameter Count1.54B
LicenseApache 2.0
LanguagesNorwegian, Norwegian Bokmål, Norwegian Nynorsk, English
Training Data66,000 hours of speech
Base ModelOpenAI Whisper Large

What is nb-whisper-large?

NB-Whisper Large is a state-of-the-art speech recognition model developed by the National Library of Norway, specifically designed for Norwegian language processing. Built upon OpenAI's Whisper architecture, this model represents the largest variant in the NB-Whisper series with 1.54B parameters, trained on an extensive dataset of 8 million samples totaling 66,000 hours of speech.

Implementation Details

The model utilizes a transformer-based architecture and has been trained for 250,000 steps on diverse Norwegian speech data. It supports multiple deployment formats including PyTorch, TensorFlow, and ONNX, making it versatile for different implementation needs.

  • Supports both transcription and translation tasks
  • Handles long-form audio through chunk-based processing
  • Provides word-level and sentence-level timestamp capabilities
  • Compatible with WhisperX for speaker diarization

Core Capabilities

  • Automatic Speech Recognition in Norwegian (Bokmål and Nynorsk)
  • English translation support
  • Real-time transcription capability through whisper.cpp
  • Flexible output formatting with semantic and verbatim versions

Frequently Asked Questions

Q: What makes this model unique?

The model's specialization in Norwegian language processing, combined with its extensive training on 66,000 hours of Norwegian speech data, makes it particularly effective for Norwegian ASR tasks. It offers superior performance compared to general-purpose models for Norwegian language processing.

Q: What are the recommended use cases?

The model is ideal for transcribing Norwegian speech in various contexts, including parliamentary speeches, broadcast media, audiobooks, and general conversational audio. It's particularly suitable for applications requiring high accuracy in Norwegian language understanding and transcription.

The first platform built for prompt engineering