@cf/meta/llama-3.1-8b-instruct-fp8 vs @hf/thebloke/mistral-7b-instruct-v0.1-awq
Compare @cf/meta/llama-3.1-8b-instruct-fp8 by Cloudflare Workers AI and @hf/thebloke/mistral-7b-instruct-v0.1-awq by Cloudflare Workers AI. See detailed specifications, pricing, and capabilities side-by-side.
Quick Overview
@cf/meta/llama-3.1-8b-instruct-fp8
Cloudflare Workers AI
32K tokens context • $0.15 / $0.29 per 1M tokens
View full specifications →@hf/thebloke/mistral-7b-instruct-v0.1-awq
Cloudflare Workers AI
4K tokens context • Free
View full specifications →Detailed Comparison
Specification
@cf/meta/llama-3.1-8b-instruct-fp8
@hf/thebloke/mistral-7b-instruct-v0.1-awq
Provider
Cloudflare Workers AI
Cloudflare Workers AI
Context Window
32K tokens
4K tokens
Max Output Tokens
32K tokens
4K tokens
Input Pricing (per 1M tokens)
$0.15
Free
Output Pricing (per 1M tokens)
$0.29
Free
Release Date
Jul 2024
Sep 2023
Capabilities
Capability
@cf/meta/llama-3.1-8b-instruct-fp8
@hf/thebloke/mistral-7b-instruct-v0.1-awq
Text Generation
Function Calling
Which Model Should You Choose?
Choose @cf/meta/llama-3.1-8b-instruct-fp8 if:
- • You need a larger context window
Choose @hf/thebloke/mistral-7b-instruct-v0.1-awq if:
- • Cost efficiency is a priority