Llama 4 documentation. See the following sections for details about the meta. 1 405B— the first frontier-level open source AI model. Amazon Bedrock supports a variety of tiers including Standard, Flex, Priority, and Reserved tiers. - ollama/ollama. Output: multilingual text, code Models Llama 4 Scout ollama run llama4:scout 109B parameter MoE model with 17B active parameters Llama 4 Maverick ollama run llama4:maverick 400B parameter MoE model with 17B active parameters Intended Use Intended Use Cases: Llama 4 is intended for commercial and research use in multiple languages. The models have a knowledge cutoff of August 2024. Python bindings for llama. LangChain provides a prebuilt agent architecture and model integrations to help you get started quickly and seamlessly incorporate LLMs into your agents and applications. Amazon Bedrock offers select foundation models (FMs) from leading AI providers like Anthropic, Meta, Mistral AI, and Amazon for batch inference at a 50% lower price compared to on-demand inference pricing. Feb 25, 2026 ยท The Llama 4 models leverage a Mixture of Experts (MoE) architecture, enabling efficient and powerful processing capabilities.