Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

VACE-Annotators

Brief-details: VACE-Annotators: A preprocessing component of the VACE framework for all-in-one video creation and editing, supporting tasks like depth estimation and inpainting.

ali-vilab

VACE-LTX-Video-0.9

BRIEF DETAILS: All-in-one video creation and editing model supporting reference-to-video, video-to-video editing, and masked editing with 512x768 resolution capability under RAIL-M license.

Delta-Vector

Rei-V2-12B

Brief Details: Rei-V2-12B is a fine-tuned 12B parameter LLM based on Mistral-Nemo-Instruct, optimized with gradient clipping for Claude 3-like prose quality using ChatML format.

codermert

busra_fluxx

Brief Details: A LoRA model trained on Replicate using Flux, designed for image generation with specific trigger word 'tugce'. Compatible with 🧨 diffusers library and optimized for Canopus architecture.

secretmoon

YankaGPT-8B-v0.1

Brief Details: YankaGPT-8B-v0.1 is a creative-focused 8B parameter Russian language model fine-tuned from YandexGPT5-lite, optimized for RP and creative writing.

nomic-ai

colnomic-embed-multimodal-3b

Brief-details: A state-of-the-art 3B parameter multimodal embedding model for visual document retrieval, achieving 61.2 NDCG@5 on Vidore-v2, specializing in unified text-image encoding.

google

gemma-3-12b-pt-qat-q4_0-gguf

Brief Details: Google's Gemma 3B model (12B parameter variant) quantized to 4-bit precision for efficient deployment, requiring license acceptance on HuggingFace.

twinkle-ai

Llama-3.2-3B-F1-Instruct

Brief-details: A 3B parameter instruction-tuned LLaMA model optimized for Traditional Chinese & English, developed by Twinkle AI & APMIC with strong performance on Taiwan-specific tasks and legal domains.

weizhiwang

Open-Qwen2VL

Brief-details: Open-Qwen2VL is a compute-efficient multimodal LLM that processes both images and text, released in 2025 with academic focus and open-source implementation.

oumi-ai

HallOumi-8B

BRIEF-DETAILS: HallOumi-8B is a breakthrough 8B-parameter model for hallucination detection, achieving 77.2% F1 score and outperforming larger models like Claude and GPT-4

google

gemma-3-27b-pt-qat-q4_0-gguf

Brief-details: Gemma 3B 27B is Google's quantized, pretrained language model optimized for efficiency while maintaining strong performance. License agreement required for access via Hugging Face.

SakanaAI

Llama-3-Karamaru-v1

Brief Details: Llama-3-Karamaru-v1 is a specialized Japanese LLM that converts modern queries into Edo-period style responses, trained on 25M characters of historical text including kuzushiji OCR data.

google

gemma-3-4b-pt-qat-q4_0-gguf

Brief-details: Gemma 3.4B is Google's quantized pre-trained language model, optimized for efficiency with 4-bit quantization in GGUF format. Requires license acceptance.

saturated-labs

T-Rex-mini

Brief Details: T-Rex-mini is an 8B parameter LLM optimized for roleplay and storytelling, built on Llama-3 Instruct with efficient local deployment capabilities.

OuteAI

Llama-OuteTTS-1.0-1B-GGUF

Brief-details: Llama-OuteTTS is a 1B parameter multilingual text-to-speech model supporting 12+ languages, featuring one-shot voice cloning and native text processing with DAC encoder integration.

efficientscaling

Z1-7B

Brief-details: Z1-7B is an efficient 7B parameter LLM focused on test-time scaling and shifted thinking, enabling enhanced reasoning capabilities through a novel two-stage generation approach

nomic-ai

nomic-embed-multimodal-7b

BRIEF-DETAILS: State-of-the-art 7B parameter multimodal embedding model for visual document retrieval, achieving 58.8 NDCG@5 on Vidore-v2 with unified text-image encoding.

google

gemma-3-1b-it-qat-q4_0-gguf

Brief-details: Google's Gemma 3.1B instruction-tuned quantized model (4-bit) optimized for efficient deployment while maintaining strong language understanding capabilities.

DataoceanAI

dolphin-small

Brief-details: Multilingual ASR model supporting 40+ Eastern languages & Chinese dialects. 372M params, 25.2% WER. Features speech recognition, VAD, segmentation & LID.

all-hands

openhands-lm-7b-v0.1

Brief-details: OpenHands LM 7B is a compact version of the 32B model, designed for local deployment and software development tasks, featuring a 128K context window and optimized for GitHub issue resolution.

stduhpf

google-gemma-3-12b-it-qat-q4_0-gguf-small

Brief-details: Optimized Gemma-3 12B model with efficient quantization, combining Google's QAT weights with improved embedding table for better memory usage and performance.

VACE-Annotators

VACE-LTX-Video-0.9

Rei-V2-12B

busra_fluxx

YankaGPT-8B-v0.1

colnomic-embed-multimodal-3b

gemma-3-12b-pt-qat-q4_0-gguf

Llama-3.2-3B-F1-Instruct

Open-Qwen2VL

HallOumi-8B

gemma-3-27b-pt-qat-q4_0-gguf

Llama-3-Karamaru-v1

gemma-3-4b-pt-qat-q4_0-gguf

T-Rex-mini

Llama-OuteTTS-1.0-1B-GGUF

Z1-7B

nomic-embed-multimodal-7b

gemma-3-1b-it-qat-q4_0-gguf

dolphin-small

openhands-lm-7b-v0.1

google-gemma-3-12b-it-qat-q4_0-gguf-small

The first platform built for prompt engineering