Cracked AI Engineering

@cf/meta/llama-3.3-70b-instruct-fp8-fast

Cloudflare Workers AIOpen Weights

The @cf/meta/llama-3.3-70b-instruct-fp8-fast AI model by Cloudflare Workers AI offers developers a powerful tool with a context window of 24,000 tokens and an output limit of 24,000 tokens, enabling advanced text input and function calling capabilities. With open weights and competitive pricing at $0.29/M input and $2.25/M output tokens, this model is ideal for a wide range of AI applications requiring large-scale processing and reasoning abilities.

Key Specifications

Context Window

24K tokens

Max Output Tokens

24K tokens

Input Pricing

$0.29

per million tokens

Output Pricing

$2.25

per million tokens

Capabilities

  • Text Generation
  • Function Calling

Additional Details

Release Date
December 6, 2024
Supported Input Types
text

Compare with Similar Models