Microsoft

Cluster

Microsoft Signal

Signal Sort

Research theregister.com May 11, 9:00 PM UTC

Microsoft researchers reveal frontier AI models degrade on long-running tasks

LLMs like Gemini 3.1 Pro, Claude 4.6 Opus, and GPT-5.4 lose up to 50% accuracy over repeated interactions, corrupting documents in most cases. Agents with tools perform even worse.

Published May 11, 9:00 PM UTC

Developing signal Microsoft Llm Long Task Degradation

theregister.com

65 Avg Signal

2 Verified

100% Linked

techcrunch.com 34x · 14/20

cnbc.com 16x · 8/20

nytimes.com 15x · 8/20

fortune.com 11x · 8/20

finance.yahoo.com 11x · 8/20

AI Pulse

68/100

bullish

AI infrastructure and chip stocks see modest gains on sustained datacenter demand

NVDA +1.8%

NVIDIA chips

Recurring Movers

NVDA 11 hits · +1.8%

MSFT 3 hits · +2.9%

AMD 3 hits · -2.8%

CBRS 2 hits · +70%

Microsoft Signal

Microsoft researchers reveal frontier AI models degrade on long-running tasks

Trusted sources

Quick takes

Trending topics

Market Pulse

Recurring Movers

Recent headlines

Microsoft

Microsoft Signal

Microsoft researchers reveal frontier AI models degrade on long-running tasks

Trusted sources

What X is saying

Quick takes

Trending topics

Market Pulse

Recurring Movers

Recent headlines