Large Language Models

The foundational technology behind all modern AI assistants, chatbots, and coding tools.

2 articles4 related tools12 related tagstechnology

Key Facts

Key ArchitectureTransformer (2017)

Largest ModelGPT-4 est. 1.8T params

Longest Context2M tokens (Gemini 1.5)

Top Open LLMLLaMA 4 (400B MoE)

Training Cost$100M+ for frontier models

Market Size$13B (2025), growing 30%+ YoY

Large Language Models (LLMs) are neural networks trained on vast corpora of text to predict and generate human language. The transformer architecture, introduced in the 2017 paper 'Attention Is All You Need', underpins every modern LLM from GPT to Claude to LLaMA. LLMs learn patterns, facts, reasoning chains, and writing styles from training data — enabling them to answer questions, write code, summarize documents, translate languages, and more. The field is advancing rapidly: model sizes grow from billions to trillions of parameters, context windows expand from 4K to 2M tokens, and capabilities stretch from text to image, audio, and video.

2 Stories tagged#llm

Model Release Breaking

GPT-5 Leaks Surface: OpenAI's Next Model Could Reason Like a PhD Physicist

Internal benchmarks allegedly show GPT-5 achieving near-human performance on graduate-level STEM tasks, with a 92% pass rate on the GPQA Diamond benchmark — a 34-point jump over GPT-4o.

Mar 13, 20266 minJames Whitfield

Open Source

Meta's LLaMA 4 Drops with 400B Parameters — Challenges GPT-4 on Every Benchmark

The open-source release includes three model sizes and a new mixture-of-experts architecture. Developers are already fine-tuning it for specialized use cases within hours of release.

Mar 11, 20265 minDavid Park