@cf/meta/llama-3.1-8b-instruct-fp8 vs Qwen-Omni Turbo
Qwen-Omni Turbo supports vision, @cf/meta/llama-3.1-8b-instruct-fp8 has open weights. Compare full specs, pricing, and choose the best model for your use case.
Quick Overview
@cf/meta/llama-3.1-8b-instruct-fp8
Cloudflare Workers AI
32K tokens context • $0.15 / $0.29 per 1M tokens
View full specifications →Detailed Comparison
Specification
@cf/meta/llama-3.1-8b-instruct-fp8
Qwen-Omni Turbo
Provider
Cloudflare Workers AI
Alibaba
Context Window
32K tokens
33K tokens
Max Output Tokens
32K tokens
2K tokens
Input Pricing (per 1M tokens)
$0.15
$0.07
Output Pricing (per 1M tokens)
$0.29
$0.27
Release Date
Jul 2024
Jan 2025
Capabilities
Capability
@cf/meta/llama-3.1-8b-instruct-fp8
Qwen-Omni Turbo
Text Generation
Function Calling
Vision
Audio Input
Video Understanding
Which Model Should You Choose?
Choose @cf/meta/llama-3.1-8b-instruct-fp8 if:
- • You prefer open weights
Choose Qwen-Omni Turbo if:
- • You need a larger context window
- • Cost efficiency is a priority