ConvNeXt Nano Model

Property	Value
Parameter Count	15.6M
License	Apache 2.0
Framework	PyTorch (timm)
Paper	A ConvNet for the 2020s

What is convnext_nano.in12k_ft_in1k?

ConvNeXt Nano is a lightweight convolutional neural network designed for efficient image classification. Initially pre-trained on ImageNet-12k (a subset of ImageNet-22k with 11,821 classes) and fine-tuned on ImageNet-1k, it represents a modern approach to CNN architecture that combines efficiency with strong performance.

Implementation Details

The model features 15.6M parameters and requires 2.5 GMACs (billion multiply-accumulate operations) for inference. It operates on images with 224x224 resolution during training and 288x288 during testing, offering a good balance between computational efficiency and accuracy.

Training performed on TPUs through Google's TRC program
Fine-tuning completed on 8x GPU Lambda Labs cloud instances
Achieves 82.282% top-1 accuracy on ImageNet-1k
Processes 3926.52 samples per second with batch size 256

Core Capabilities

Image classification with 1000 classes
Feature map extraction at multiple scales
Image embedding generation
Efficient inference with F32 tensor type support

Frequently Asked Questions

Q: What makes this model unique?

This model represents an excellent balance between model size and performance, making it particularly suitable for resource-constrained applications while maintaining competitive accuracy. Its nano architecture is specifically optimized for efficiency while preserving the core benefits of the ConvNeXt architecture.

Q: What are the recommended use cases?

The model is well-suited for production environments where computational resources are limited but accurate image classification is required. It's particularly effective for mobile applications, edge devices, or scenarios requiring real-time inference with reasonable accuracy.

convnext_nano.in12k_ft_in1k