Large Language Model Training

OpenAI unveils its new GPT-4.5 large language model

OpenAI released a new base model on Thursday called GPT-4.5, which the company said is its best and smartest model for chat yet. It’s not a reasoning model like OpenAI’s o1 and o3 models, but it can ...

The Conversation

Large language models: how the AI behind the likes of ChatGPT actually works

Mark Stevenson has previously received funding from Google. The arrival of AI systems called large language models (LLMs), like OpenAI’s ChatGPT chatbot, has been heralded as the start of a new ...

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be ...

The Economist

Forget DeepSeek. Large language models are getting cheaper still

As recently as 2022, just building a large language model (LLM) was a feat at the cutting edge of artificial-intelligence (AI) engineering. Three years on, experts are harder to impress. To really ...

Virtualization Review

Large Language Model Selection -- Why the Parameter Count Isn't Everything

When choosing a large language model (LLM) for use in a particular task, one of the first things that people often look at is the model's parameter count. A vendor might offer several different ...

Neuroscience News

Cognitive Illusion: Why AI Still Can’t Think Like a Human

Forget the hype about AI "solving" human cognition, new research suggests unified models like Centaur are just overfitted ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results