【Caged Women】

2025-06-27 04:01:07 129 views 9682 comments

There's a new AI player in town,Caged Women and you might want to pay attention to this one.

On Monday, Chinese artificial intelligence company DeepSeek launched a new, open-source large language model called DeepSeek R1.

According to DeepSeek, R1 wins over other popular LLMs (large language models) such as OpenAI in several important benchmarks, and it's especially good with mathematical, coding, and reasoning tasks.

DeepSeek R1 is actually a refinement of DeepSeek R1 Zero, which is an LLM that was trained without a conventionally used method called supervised fine-tuning. This made it very capable in certain tasks, but as DeepSeek itself puts it, Zero had "poor readability and language mixing." Enter R1, which fixes these issues by incorporating "multi-stage training and cold-start data" before it was trained with reinforcement learning.

Mashable Light Speed Want more out-of-this world tech, space and science stories? Sign up for Mashable's weekly Light Speed newsletter. By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy. Thanks for signing up!

Arcane technical language aside (the details are online if you're interested), there are several key things you should know about DeepSeek R1. First, it's open source, meaning it's up for scrutiny from experts, which should alleviate concerns about privacy and security. Second, it's free to use as a web app, while API access is very cheap ($0.14 for one million input tokens, compared to OpenAI's $7.5 for its most powerful reasoning model, o1).

SEE ALSO: OpenAI could release agentic AI tool Operator soon

Most importantly, this thing is very, very capable. To test it out, I immediately threw it into deep waters, asking it to code a fairly complex web app which needed to parse publicly available data, and create a dynamic website with travel and weather information for tourists. Amazingly, DeepSeek produced completely acceptable HTML code right away, and was able to further refine the site based on my input while improving and optimizing the code on its own along the way.

DeepSeek AII'll do all of that...tomorrow. Credit: Stan Schroeder / Mashable / DeepSeek

I also asked it to improve my chess skills in five minutes, to which it replied with a number of neatly organized and very useful tips (my chess skills did not improve, but only because I was too lazy to actually go through with DeepSeek's suggestions).

I then asked DeepSeek to prove how smart it is in exactly three sentences. Bad move by me, as I, the human, am not nearly smart enough to verify or even fully understand any of the three sentences. Notice, in the screenshot below, that you can see DeepSeek's "thought process" as it figures out the answer, which is perhaps even more fascinating than the answer itself.

DeepSeek AIWe get it, you're smart. Credit: Stan Schroeder / Mashable / DeepSeek

It's impressive to use. But as ZDnet noted, in the background of all this are training costs which are orders of magnitude lower than for some competing models, as well as chips which aren't as powerful as the chips that are on disposal for U.S. AI companies. DeepSeek thus shows that extremely clever AI with reasoning ability doesn't have to be extremely expensive to train — or to use.


Featured Video For You
3 ways to detect AI generated images

Topics Artificial Intelligence DeepSeek

Comments (477)
Exploration Information Network

Wordle today: The answer and hints for January 28, 2025

2025-06-27 02:59
New Knowledge Information Network

What to do if Jimmy Kimmel's baby has made you cry

2025-06-27 02:56
Prosperous Times Information Network

'Game of Thrones' characters will keep traveling in Season 8

2025-06-27 02:18
Neon Information Network

The best TV episodes of the year

2025-06-27 01:46
Creation Information Network

Best Amazon deal: Save 20% on floral and botanical Lego sets

2025-06-27 01:18
Search
Newsletter

Subscribe to our newsletter for the latest updates.

Follow Us