FireFunction-v1

Property	Value
Parameter Count	46.7B
Model Type	Function Calling
License	Apache 2.0
Tensor Type	FP16

What is firefunction-v1?

FireFunction-v1 is a state-of-the-art function calling model developed by Fireworks AI, designed to achieve near GPT-4 level quality for structured information generation and routing decision-making. This model stands out for its exceptional speed, operating approximately 4x faster than GPT-4 when deployed on the Fireworks platform.

Implementation Details

The model is built on a transformer architecture and optimized for single-turn request routing and structured information extraction. It features API compatibility with OpenAI's function calling interface and supports up to 20 function specifications in its function pool.

46.7B parameter model with FP16 precision
Compatible with OpenAI function calling API
Unique support for "any" parameter in tool_choice
Optimized for high-speed inference

Core Capabilities

Single-turn request routing to functions
Structured information extraction
Fast inference processing
Function selection from large function pools
API compatibility with existing frameworks

Frequently Asked Questions

Q: What makes this model unique?

FireFunction-v1 stands out due to its combination of GPT-4 level quality, significantly faster inference speeds, and unique support for the "any" parameter in tool_choice. It's also one of the few models specifically optimized for function calling with a commercial-friendly license.

Q: What are the recommended use cases?

The model excels in two primary areas: single-turn request routing when choosing from up to 20 function specifications, and structured information extraction. It's not recommended for general multi-turn chat or parallel/nested function calls in a single response.

firefunction-v1