Cracked AI Engineering

@cf/meta/llama-3.3-70b-instruct-fp8-fast vs @cf/deepgram/aura-1

Compare @cf/meta/llama-3.3-70b-instruct-fp8-fast by Cloudflare Workers AI and @cf/deepgram/aura-1 by Cloudflare Workers AI. See detailed specifications, pricing, and capabilities side-by-side.

Quick Overview

@cf/meta/llama-3.3-70b-instruct-fp8-fast

Cloudflare Workers AI

24K tokens context • $0.29 / $2.25 per 1M tokens

View full specifications →

@cf/deepgram/aura-1

Cloudflare Workers AI

0 tokens context • $0.01 / $0.01 per 1M tokens

View full specifications →

Detailed Comparison

Specification
@cf/meta/llama-3.3-70b-instruct-fp8-fast
@cf/deepgram/aura-1
Provider
Cloudflare Workers AI
Cloudflare Workers AI
Context Window
24K tokens
0 tokens
Max Output Tokens
24K tokens
0 tokens
Input Pricing (per 1M tokens)
$0.29
$0.01
Output Pricing (per 1M tokens)
$2.25
$0.01
Release Date
Dec 2024
Aug 2025

Capabilities

Capability
@cf/meta/llama-3.3-70b-instruct-fp8-fast
@cf/deepgram/aura-1
Text Generation
Function Calling

Which Model Should You Choose?

Choose @cf/meta/llama-3.3-70b-instruct-fp8-fast if:

  • • You need a larger context window

Choose @cf/deepgram/aura-1 if:

  • • Cost efficiency is a priority