METR, which runs the benchmark measuring how well models can complete long-duration tasks, found that Claude Mythos Preview ...
Anthropic says AI development may need to slow as systems approach recursive self-improvement. The company proposes global ...