Llama-3.2-11B-Vision-Instruct
GitHub ModelsOpen Weights
The Llama-3.2-11B-Vision-Instruct AI model by GitHub Models is a powerful tool for developers seeking advanced reasoning and multimodal capabilities. With a context window of 128,000 tokens and the ability to process text, vision, audio, and function calling, this model offers unparalleled flexibility for a wide range of applications. Open weights, affordable pricing, and a training data cutoff of 2023 make it an accessible and practical choice for developers looking to harness cutting-edge AI technology.
Key Specifications
Context Window
128K tokens
Max Output Tokens
8K tokens
Input Pricing
Free
per million tokens
Output Pricing
Free
per million tokens
Capabilities
- Text Generation
- Vision
- Audio Input
- Function Calling
- Advanced Reasoning
Additional Details
- Provider
- GitHub Models
- Release Date
- September 25, 2024
- Advanced Reasoning
- Supported
- Supported Input Types
- text, image, audio