Cracked AI Engineering

Qwen 3 Embedding 4B vs Google Gemma 3

Google Gemma 3 offers 125K tokens context vs 32K tokens, Google Gemma 3 supports vision. Compare full specs, pricing, and choose the best model for your use case.

Quick Overview

Qwen 3 Embedding 4B

Inference

32K tokens context • $0.01 / Free per 1M tokens

View full specifications →

Google Gemma 3

Inference

125K tokens context • $0.15 / $0.30 per 1M tokens

View full specifications →

Detailed Comparison

Specification
Qwen 3 Embedding 4B
Google Gemma 3
Provider
Inference
Inference
Context Window
32K tokens
125K tokens
Max Output Tokens
2K tokens
4K tokens
Input Pricing (per 1M tokens)
$0.01
$0.15
Output Pricing (per 1M tokens)
Free
$0.30
Release Date
Jan 2025
Jan 2025

Capabilities

Capability
Qwen 3 Embedding 4B
Google Gemma 3
Text Generation
Vision
Function Calling
File Attachments

Which Model Should You Choose?

Choose Qwen 3 Embedding 4B if:

  • • Cost efficiency is a priority

Choose Google Gemma 3 if:

  • • You need a larger context window