Cracked AI Engineering

Llama-3.2-11B-Vision-Instruct

GitHub ModelsOpen Weights

The Llama-3.2-11B-Vision-Instruct AI model by GitHub Models is a powerful tool for developers seeking advanced reasoning and multimodal capabilities. With a context window of 128,000 tokens and the ability to process text, vision, audio, and function calling, this model offers unparalleled flexibility for a wide range of applications. Open weights, affordable pricing, and a training data cutoff of 2023 make it an accessible and practical choice for developers looking to harness cutting-edge AI technology.

Key Specifications

Context Window

128K tokens

Max Output Tokens

8K tokens

Input Pricing

Free

per million tokens

Output Pricing

Free

per million tokens

Capabilities

  • Text Generation
  • Vision
  • Audio Input
  • Function Calling
  • Advanced Reasoning

Additional Details

Release Date
September 25, 2024
Advanced Reasoning
Supported
Supported Input Types
text, image, audio

Compare with Similar Models