Close Menu
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The ‘Claude-Gap’ Relationship: One Partner Sleeps, Another Vibe Codes

March 28, 2026

How One Business Owner Is Using AI During Tax Season

March 28, 2026

Younger Billionaires Are Powering a Superyacht Boom

March 28, 2026
Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
Home » DeepSeek’s distilled new R1 AI model can run on a single GPU
AI

DeepSeek’s distilled new R1 AI model can run on a single GPU

IQ TIMES MEDIABy IQ TIMES MEDIAMay 29, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


DeepSeek’s updated R1 reasoning AI model might be getting the bulk of the AI community’s attention this week. But the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks.

The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Google’s Gemini 2.5 Flash on AIME 2025, a collection of challenging math questions.

DeepSeek-R1-0528-Qwen3-8B also nearly matches Microsoft’s recently released Phi 4 reasoning plus model on another math skills test, HMMT.

So-called distilled models like DeepSeek-R1-0528-Qwen3-8B are generally less capable than their full-sized counterparts. On the plus side, they’re far less computationally demanding. According to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). The full-sized new R1 needs around a dozen 80GB GPUs.

DeepSeek trained DeepSeek-R1-0528-Qwen3-8B by taking text generated by the updated R1 and using it to fine-tune Qwen3-8B. In a dedicated web page for the model on the AI dev platform Hugging Face, DeepSeek describes DeepSeek-R1-0528-Qwen3-8B as “for both academic research on reasoning models and industrial development focused on small-scale models.”

DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license, meaning it can be used commercially without restriction. Several hosts, including LM Studio, already offer the model through an API.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
IQ TIMES MEDIA
  • Website

Related Posts

Memory chip giant SK hynix could help end ‘RAMmageddon’ with blockbuster US IPO

March 27, 2026

VCs are betting billions on AI’s next wave, so why is OpenAI killing Sora?

March 27, 2026

David Sacks is done as AI czar — here’s what he’s doing instead

March 27, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

2 students dead and 7 injured in Tennessee school bus crash

March 27, 2026

Suburban Detroit school settles lawsuit over Pledge of Allegiance

March 27, 2026

Changes to Native American tuition waiver could expand access to higher education for thousands

March 27, 2026

Student loan borrowers in SAVE plan directed to prepare for repayment

March 27, 2026
Education

2 students dead and 7 injured in Tennessee school bus crash

By IQ TIMES MEDIAMarch 27, 20260

HUNTINGDON, Tenn. (AP) — A school bus crash in west Tennessee on Friday killed two…

Suburban Detroit school settles lawsuit over Pledge of Allegiance

March 27, 2026

Changes to Native American tuition waiver could expand access to higher education for thousands

March 27, 2026

Student loan borrowers in SAVE plan directed to prepare for repayment

March 27, 2026
IQ Times Media – Smart News for a Smarter You
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 iqtimes. Designed by iqtimes.

Type above and press Enter to search. Press Esc to cancel.