The Llama-4-Maverick-17B-128E-Instruct-FP8 AI model by Llama offers developers a powerful tool with a context window of 128,000 tokens and an output limit of 4,096 tokens. With capabilities for text input, vision processing, and function calling, this model excels in handling complex tasks requiring large-scale reasoning. Open weights and a pricing model of $0/M input and $0/M output tokens make this AI model a cost-effective choice for cutting-edge projects.

Key Specifications

Context Window

128K tokens

Max Output Tokens

4K tokens

Input Pricing

Free

per million tokens

Output Pricing

Free

per million tokens

Capabilities

Text Generation
Vision
Function Calling
File Attachments

Additional Details

Provider: Llama
Release Date: April 5, 2025
Supported Input Types: text, image

Compare with Similar Models

Llama-4-Maverick-17B-128E-Instruct-FP8 vs Llama-3.3-8B-Instruct

Compare specifications and pricing

Llama-4-Maverick-17B-128E-Instruct-FP8 vs Llama-3.3-70B-Instruct

Compare specifications and pricing

Llama-4-Maverick-17B-128E-Instruct-FP8 vs Kimi K2 0711

Compare specifications and pricing