Cracked AI Engineering

Llama-4-Maverick-17B-128E-Instruct-FP8

LlamaOpen Weights

The Llama-4-Maverick-17B-128E-Instruct-FP8 AI model by Llama offers developers a powerful tool with a context window of 128,000 tokens and an output limit of 4,096 tokens. With capabilities for text input, vision processing, and function calling, this model excels in handling complex tasks requiring large-scale reasoning. Open weights and a pricing model of $0/M input and $0/M output tokens make this AI model a cost-effective choice for cutting-edge projects.

Key Specifications

Context Window

128K tokens

Max Output Tokens

4K tokens

Input Pricing

Free

per million tokens

Output Pricing

Free

per million tokens

Capabilities

  • Text Generation
  • Vision
  • Function Calling
  • File Attachments

Additional Details

Provider
Llama
Release Date
April 5, 2025
Supported Input Types
text, image

Compare with Similar Models