LLaMA 4

Meta's open-source 400B parameter MoE model — beats GPT-4o on most benchmarks.

1 article1 related tools5 related tagsmodel

Key Facts

ReleasedMarch 2026

Max Parameters400B (MoE)

ArchitectureMixture-of-Experts

Context Window128K tokens

Languages35

LicenseLLaMA 4 Community (commercial OK)

LLaMA 4 is Meta's fourth generation of open-source large language models, released in March 2026. The flagship 400B parameter model uses a Mixture-of-Experts (MoE) architecture, making it run efficiently on just two A100 GPUs despite its massive parameter count. It surpasses GPT-4o on MMLU, HumanEval, and MATH benchmarks. The family includes 8B (CPU-capable), 70B (single GPU), and 400B MoE variants, all supporting 128K context windows and 35 languages. Within 72 hours of release, 400+ community fine-tunes had appeared on HuggingFace.

1 Story tagged#llama-4

Open Source

Meta's LLaMA 4 Drops with 400B Parameters — Challenges GPT-4 on Every Benchmark

The open-source release includes three model sizes and a new mixture-of-experts architecture. Developers are already fine-tuning it for specialized use cases within hours of release.

Mar 11, 20265 minDavid Park