Close Menu
  • Homepage
  • Local News
  • India
  • World
  • Politics
  • Sports
  • Finance
  • Entertainment
  • Business
  • Technology
  • Health
  • Lifestyle
Facebook X (Twitter) Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
Facebook X (Twitter) Instagram Pinterest
JHB NewsJHB News
  • Local
  • India
  • World
  • Politics
  • Sports
  • Finance
  • Entertainment
Let’s Fight Corruption
JHB NewsJHB News
Home»Technology»DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU | Technology News
Technology

DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU | Technology News

October 21, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Some of DeepSeek's statements about its development costs and the technology it used have been questioned by U.S. companies and officials. (Image: Reuters)
Share
Facebook Twitter LinkedIn Pinterest Email

Chinese language AI startup DeepSeek has launched a brand new multimodal AI mannequin, which it stated is able to processing giant and complicated paperwork utilizing considerably fewer tokens.

The Huangzhou-based firm stated that DeepSeek-OCR makes use of visible notion as a medium to compress textual content for big language fashions (LLMs) extra effectively. Each the supply code and weights of the mannequin are publicly accessible through on-line developer platforms Hugging Face and GitHub. In its analysis, DeepSeek discovered that utilizing “imaginative and prescient encoders” to compress textual content for LLMs would allow them to course of huge quantities of textual content at decrease computing prices.

“Via DeepSeek-OCR, we reveal that vision-text compression can obtain important token discount (7-20×) for various historic context phases, providing a promising path for addressing long-context challenges in giant language fashions,” the corporate stated in a technical paper accompanying the mannequin’s launch.

I fairly like the brand new DeepSeek-OCR paper. It’s a superb OCR mannequin (possibly a bit worse than dots), and sure information assortment and so forth., however anyway it doesn’t matter.

The extra fascinating half for me (esp as a pc imaginative and prescient at coronary heart who’s quickly masquerading as a pure language… https://t.co/AxRXBdoO0F

— Andrej Karpathy (@karpathy) October 20, 2025

The launch of DeepSeek-OCR displays the corporate’s continued give attention to enhancing the effectivity of LLMs whereas driving down the prices of constructing and utilizing them. The corporate is alleged to have taken the same method in creating its breakthrough open-weight fashions V3 and R1, which made waves throughout the tech trade for reaching efficiency similar to cutting-edge fashions like OpenAI’s o1 at solely a fraction of the price.

Story continues beneath this advert

Technical specs

With DeepSeek-OCR, the corporate goals to deal with a key limitation of LLMs: dealing with lengthy contexts with out operating into reminiscence limits. Its core speculation is that processing textual content as photos may be extra computationally environment friendly than processing uncooked digital textual content. The brand new OCR mannequin serves as a proof-of-concept for this concept.

The mannequin includes two components: a 380 million-parameter DeepEncoder used to analyse every picture and produce a compressed model of it; and a 570 million-active parameter textual content generator constructed on high of one other three billion-parameter combination of specialists (MoE) language mannequin.

DeepSeek’s researchers stated that they educated the OCR mannequin with 30 million PDF pages in roughly 100 languages, together with 25 million in Chinese language and English, together with 10 million artificial diagrams, 5 million chemical formulae, and a million geometric figures.

Efficiency on benchmarks

The OCR mannequin is able to compressing textual content by as much as an element of ten whereas retaining 97 per cent of the unique info, as per the technical paper. It may be used to course of a variety of doc varieties together with plain textual content, diagrams, chemical formulae, and geometric figures whereas with the ability to preserve the unique formatting, output plain textual content, and even present basic picture descriptions. Nonetheless, the requirement of ‘imaginative and prescient tokens’ can also be more likely to differ based mostly on the doc measurement and picture decision.

Story continues beneath this advert

In sum, DeepSeek-OCR can generate coaching information for LLMs and imaginative and prescient language fashions (VLMs) at a scale of greater than 200,000 pages per day whereas operating on a single Nvidia A100 GPU.

The OCR mannequin was evaluated on two benchmarks, the OmniDocBench check that’s used to judge a mannequin’s doc parsing capabilities and the Fox benchmark check used to judge the focusing capabilities of imaginative and prescient language fashions on dense PDF paperwork.

“On OmniDocBench, it surpasses GOT-OCR2.0 (256 tokens/web page) utilizing solely 100 imaginative and prescient tokens, and outperforms MinerU2.0 (6000+ tokens per web page on common) whereas utilising fewer than 800 imaginative and prescient tokens,” the paper learn.



Source link

200K Daily data DeepSeeks generate GPU model news pages single Technology training
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Never watched football? Start here — A beginner’s guide to FIFA World Cup | Football News

June 11, 2026

Why a new court ruling against Google’s AI Overviews could have far-reaching effects | Technology News

June 11, 2026

Samsung Galaxy Watch Ultra 2 Leak Hints at Huge Battery Upgrade

June 11, 2026

Rahul Dravid’s son Anvay named in India Under-19 squad for Sri Lanka tour | Cricket News

June 11, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Never watched football? Start here — A beginner’s guide to FIFA World Cup | Football News

June 11, 2026

Why a new court ruling against Google’s AI Overviews could have far-reaching effects | Technology News

June 11, 2026

DBS brings tokenised physical gold to the mass market in Singapore

June 11, 2026

Spike Lee Says ICE ‘Is Not Welcome’ To Come To Any New York Knicks Victory Parade

June 11, 2026
Popular Post

Massachusetts Detective Accused of Killing Pregnant Mistress To Silence Her

Pakistan drops PoK from trophy tour of Champions Trophy after BCCI files protest | Cricket News

Govt to raise stake in Vodafone Idea to 49% with fresh acquisitions worth Rs 37,000 crore | Business News

Subscribe to Updates

Get the latest news from JHB News about Bangalore, Worlds, Entertainment and more.

JHB News
Facebook X (Twitter) Instagram Pinterest
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
© 2026 Jhb.news - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.