Large Language Models Deepseek

Cryptopolitan on MSN

DeepSeek V4 rumored to outperform ChatGPT and Claude in long-context coding

February, is rumored to outperform ChatGPT and Claude in long-context coding, targeting elite-level coding tasks.

DeepSeek V4 update: Conditional memory reshapes large-model efficiency

DeepSeek founder Liang Wenfeng has published a new paper with a research team from Peking University, outlining key technical ...

13don MSN

China's DeepSeek kicked off 2026 with a new AI training method that analysts say is a 'breakthrough' for scaling

DeepSeek has released a new AI training method that analysts say is a "breakthrough" for scaling large language models.

Analytics India Magazine

Decoding DeepSeek’s Solution to China’s Compute Shortage

DeepSeek's new research enables retrieval using computational memory, not neural computation, freeing up GPUs.

10don MSN

Another Chinese quant fund joins DeepSeek in AI race with model rivalling GPT-5.1, Claude

Another Chinese quantitative trading firm has entered the race to develop large language models (LLMs), unveiling systems it ...

Analytics India Magazine

New DeepSeek Research Shows Architectural Fix Can Boost Reasoning at Scale

DeepSeek has released new research showing that a promising but fragile neural network design can be stabilised at scale, ...

Centre for International Governance InnovationOpinion

Chinese AI Models and the High-Stakes Fight for AI Neutrality

Chinese and Western large language models are reshaping global information power, embedding political world views into the ...

13d

How DeepSeek's new way to train advanced AI models could disrupt everything - again

The Chinese AI lab may have just found a way to train advanced LLMs in a manner that's practical and scalable, even for more cash-strapped developers.

13d

DeepSeek proposes shift in AI model development with 'mHC' architecture to upgrade ResNet

DeepSeek's latest technical paper, co-authored by the firm's founder and CEO Liang Wenfeng, has been cited as a potential ...

The Information

DeepSeek To Release Next Flagship AI Model With Strong Coding Ability

Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in ...

10don MSN

DeepSeek pitches new route to scale AI, but researchers call for more testing

DeepSeek's proposed "mHC" architecture could transform the training of large language models (LLMs) - the technology behind ...

VentureBeat

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability. The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results