#GPT-4.1

All articles tagged with "GPT-4.1"

Lens is a 3.8B parameter text-to-image model using one-fifth the compute of comparable models.
Uses 800M image-text pairs with detailed GPT-4.1 captions (avg 100 words).
Ablation study shows detailed captions outperform short or mixed captions.
Architecture includes semantic VAE from FLUX.2 and GPT-OSS text encoder.

DT Editorial Team·Jun 9, 2026·via the-decoder.com

DT Editorial Team·Apr 16, 2026·via livescience.com