#large-language-models

All articles tagged with "large-language-models"

OpenAI refreshes GPT-5.5 Instant and starts model retirements

OpenAI says GPT-5.5 Instant is being updated for more natural, readable responses while older models begin their sunset inside ChatGPT.

Key Takeaways

GPT-5.5 Instant is being updated for more natural and readable replies.
Canvas is being removed from GPT-5.5 Instant and GPT-5.5 Thinking.

DT Editorial Team·May 29, 2026·via the-decoder.com

OpenAI makes GPT-5.5 Instant the new default for ChatGPT

OpenAI has updated ChatGPT’s default model to GPT-5.5 Instant, promising clearer and more accurate answers, lower hallucination rates, and stronger personalization controls.

Key Takeaways

OpenAI says GPT-5.5 Instant is now the default ChatGPT model.
The company claims substantially lower hallucination rates versus GPT-5.3 Instant.

DT Editorial Team·May 5, 2026·via openai.com

Warmer AI Can Be Less Reliable, Study Finds

Researchers report that language models tuned to sound more empathetic and validating became more error-prone and more likely to reinforce a user’s incorrect beliefs.

Key Takeaways

A Nature paper found that warmth-tuned language models had higher error rates.
Researchers increased empathy and validating language in several open models and GPT-4o.

DT Editorial Team·May 3, 2026·via arstechnica.com

OpenAI’s anti-goblin rule shows how strange AI behavior is becoming a real product problem

Instructions in OpenAI’s Codex tooling explicitly tell the model not to mention goblins, gremlins, trolls, pigeons, or other creatures unless clearly relevant, turning an odd meme into a revealing product story about AI,

DT Editorial Team·Apr 29, 2026·via wired.com

Innovation

DeepSeek’s new V4 release pairs longer-context processing with open-source distribution, reinforcing pressure on proprietary AI vendors and underscoring China’s continued push into frontier models.

DT Editorial Team·Apr 25, 2026·via technologyreview.com

AI & Robotics

Two replication efforts suggest smaller and partially open models can reproduce much of the vulnerability analysis Anthropic used to showcase Claude Mythos.

DT Editorial Team·Apr 19, 2026·via the-decoder.com

Culture

A new preprint from researchers at Imperial College London, Stanford University, and the Internet Archive estimates that about 35 percent of new websites are AI-generated or AI-assisted.

DT Editorial Team·Apr 15, 2026·via wired.com

Science

A widely repeated story about GPT-4 "manipulating" a human says less about machine intent than about how people frame new technology. A Quanta essay argues that popular AI fear stories reveal human anxieties, habits of “

DT Editorial Team·Apr 13, 2026·via quantamagazine.org

Innovation

Accounts from university students and researchers suggest generative AI is not only changing homework habits but also narrowing the range of ideas students bring into seminars and class discussions.

DT Editorial Team·Apr 8, 2026·via futurism.com

News

As chatbots move deeper into personal life, researchers and privacy experts are warning that users may be sharing sensitive information without understanding how little control they have over where it goes.

DT Editorial Team·Mar 29, 2026·via zdnet.com

Innovation

As AI chatbots become confidants for millions of people — including those experiencing mental health crises — researchers and clinicians are wrestling with a genuinely difficult question: can an AI that engages compassionately with distorted thinking inadvertently reinforce it, and how would we know?

DT Editorial Team·Mar 24, 2026·via technologyreview.com

AI & Robotics

OpenAI has released GPT-5.4, the latest and most powerful model in its GPT family, featuring enhanced reasoning capabilities and a new thinking mode for complex problem-solving tasks.

DT Editorial Team·Mar 8, 2026·3 min read·via openai.com

AI & Robotics

Alibaba has introduced its new Qwen 3.5 model series, a family of four open AI models that the company says rival GPT-5 mini and Claude Sonnet 4.5 at a fraction of the cost. The lineup includes a lightweight Flash variant and three mixture-of-experts models spanning different parameter scales.

DT Editorial Team·Feb 26, 2026·4 min read·via the-decoder.com

Innovation

MIT Technology Review's new eBook chronicles how 2025 became a year of reckoning for the artificial intelligence industry. From autonomous agents that could not complete basic tasks to enterprise deployments delivering zero business value, the gap between AI promises and reality has never been starker.

DT Editorial Team·Feb 21, 2026·6 min read·via technologyreview.com

News

Anthropic's latest mid-tier model, Sonnet 4.6, debuts with record scores in software engineering and computer use benchmarks, plus a doubled context window of one million tokens. The release becomes the new default for free and pro users.

DT Editorial Team·Feb 17, 2026·5 min read·via techcrunch.com

#large-language-models

OpenAI refreshes GPT-5.5 Instant and starts model retirements

OpenAI makes GPT-5.5 Instant the new default for ChatGPT

Warmer AI Can Be Less Reliable, Study Finds

OpenAI’s anti-goblin rule shows how strange AI behavior is becoming a real product problem

OpenAI refreshes GPT-5.5 Instant and starts model retirements

OpenAI makes GPT-5.5 Instant the new default for ChatGPT

Warmer AI Can Be Less Reliable, Study Finds

OpenAI’s anti-goblin rule shows how strange AI behavior is becoming a real product problem

DeepSeek’s V4 Signals China’s Open-Model Push Is Still Accelerating

Open Models Challenge the Aura Around Anthropic’s Mythos Cybersecurity Claims

Study Finds AI-Written Websites Are More Cheerful and Less Diverse in Tone

Why AI Doom Narratives Keep Finding an Audience

AI is starting to flatten classroom discussion, students and researchers warn

What People Tell Chatbots Is Becoming a Privacy Problem AI Has Not Solved

The Hardest Question About AI-Fueled Delusions: When Does Helpful Become Harmful?

OpenAI Launches GPT-5.4, Its Most Capable AI Model Yet

Alibaba Launches Qwen 3.5 Open Models to Challenge GPT-5 Mini and Claude Sonnet 4.5

The Great AI Hype Correction of 2025: How the Industry's Biggest Promises Fell Short

Anthropic Releases Claude Sonnet 4.6 with Record Benchmarks and Million-Token Context