In a milestone for Manhattan, a pair of coyotes has made Central Park their home (smithsonianmag.com)

Sounds like a good use of "spare" time to me and not that different from many a lab I've been part of: someone gets a hunch, sets up an experiment to follow it, proves poor disproves whatever they were after, pulls down the experiment, rinse, repeat.

UrineSqueegee · 3h ago

they have reduced the token output by 20% and the benchmark scores have decreased by 10% of the original model.

yorwba · 2h ago

The 20% output reduction is relative to R1, the 10% benchmark score reduction is relative to R1-0528.

It produces 60% fewer output tokens than R1-0528 and scores about 10% higher on their benchmark than R1.

So it's a way to turn R1-0528, which is better than R1 but slower, into a model that's worse than R1-0528 but better and faster than R1.

saubeidl · 2h ago

Yup, you can see it well on the graph here: https://venturebeat.com/wp-content/uploads/2025/07/Gu4d8kzWo...

randomNumber7 · 3h ago

From the hugginface model card:

"Due to the strict new guidelines of the EU AI Act that take effect on August 2nd 2025, we recommend that each R1T/R1T2 user in the EU either familiarizes themselves with these requirements and assess their compliance, or ceases using the model in the EU after August 1st, 2025."

Doesn't the deepseek licence completely forbid any use in the EU already? How can a german company legally build this in the first place (which they presumably did)?

qwertox · 3h ago

> Doesn't the deepseek licence completely forbid any use in the EU already?

Care to explain?

https://deepseeklicense.github.io/

https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICE...

akreal · 3h ago

Probably a mix-up with the recently released Huawei model:

https://news.ycombinator.com/item?id=44441447

_ache_ · 1h ago

Is 200% a way to say *3 quicker ? The little 10% reasoning performance decrease seems worth it.

loherj · 1h ago

Yes. If you look at the diagram that plots the performance vs the amount of output tokens, you can see that R1T2 uses about 1/3 of the output tokens that R1-0528 uses.

Keep in mind, the speed improvement doesn’t come from the model running any faster (it’s the exact same architecture as R1, after all) but from using less output tokens while still achieving very good results.

MangoToupe · 1h ago

> The little 10% reasoning performance decrease seems worth it

We need about three orders of magnitude more tests to make these numbers meaningful.

loherj · 1h ago

Fair point. More benchmarks are definitely good but I’m optimistic that they will show similar results.

Anecdotally, I can say that my personal experience with the model is in line with what the benchmarks claim: It’s a bit smarter than R1, a bit faster than R1, much faster than R1-0528, but not quite as smart. (Faster meaning less output tokens). For me, it’s at a sweet spot and I use it as daily driver.

ipsum2 · 3h ago

tl;dr: faster but worse; i.e. on the pareto frontier.

konsalexee · 1h ago

It is always about the trade-off between those two parameters.

Of course an increase in both is the optimal, but a small sacrifice in performance/accuracy for being 200% faster is worth noting. Around 10% drop in accuracy for 200% speed-up, some would take it!

d1sxeyes · 1h ago

Also that “speed up” is actually hiding “less compute used” which is a proxy for cost. Assuming this is 200% faster purely because it needs less compute, that should mean it costs roughly 1/3 as much to run for a 10% decrease in quality of output.

konsalexee · 40m ago

↑

How AI on Microcontrollers Works: Operators and Kernels (danielmangum.com)

The messy reality of SIMD (vector) functions (johnnysswlab.com)

Being too ambitious is a clever form of self-sabotage (maalvika.substack.com)

Learn to love the moat of low status (usefulfictions.substack.com)

Mini NASes marry NVMe to Intel's efficient chip (jeffgeerling.com)

You're All CTO Now (jamie.ideasasylum.com)

N-Back – A Minimal, Adaptive Dual N-Back Game for Brain Training (n-back.net)

The History of Electronic Music in 476 Tracks (1937–2001) (openculture.com)

A 37-year-old wanting to learn computer science (initcoder.com)

EverQuest (filfre.net)

Incapacitating Google Tag Manager (2022) (backlit.neocities.org)

Telli (YC F24) Is Hiring Engineers [On-Site Berlin] (hi.telli.com)

Scientists capture slow-motion earthquake in action (phys.org)

Why I left my tech job to work on chronic pain (sailhealth.substack.com)

Baba Is Eval (fi-le.net)

Nvidia is full of shit (blog.sebin-nyshkim.net)

Impact of PCIe 5.0 Bandwidth on GPU Content Creation and LLM Performance (pugetsystems.com)

In a milestone for Manhattan, a pair of coyotes has made Central Park their home (smithsonianmag.com)

Go, PET, Let Hen - Curious adventures in (Commodore) BASIC tokenizing (masswerk.at)

ADXL345 (2024) (tinytransistors.net)

Show HN: I AI-coded a tower defense game and documented the whole process (github.com)

The story behind Caesar salad (nationalgeographic.com)

Wind Knitting Factory (merelkarhof.nl)

Writing a Game Boy Emulator in OCaml (2022) (linoscope.github.io)

OBBB signed: Reinstates immediate expensing for U.S.-based R&D (kbkg.com)

Robots move Shanghai city block [video] (youtube.com)

Why AO3 Was Down (reddit.com)

The ITTAGE indirect branch predictor (blog.nelhage.com)

Bcachefs may be headed out of the kernel (lwn.net)

Kepler.gl (kepler.gl)

Show HN: AirBending – Hand gesture based macOS app MIDI controller (nanassound.com)

How to Use JSONPath to Query and Extract JSON Data Efficiently (postpilot.dev)

Compression Dictionary Transport (developer.mozilla.org)

I'm Losing All Trust in the AI Industry (thealgorithmicbridge.com)

Ask HN: How did Soham Parekh get so many jobs?

Open Source and FPGA Maker Board for Networking (privateisland.tech)

Sleeping beauty Bitcoin wallets wake up after 14 years to the tune of $2B (marketwatch.com)

The Amiga 3000 Unix and Sun Microsystems: Deal or No Deal? (datagubbe.se)

Everything around LLMs is still magical and wishful thinking (dmitriid.com)

Gremllm (github.com)

Lens: Lenses, Folds and Traversals (hackage.haskell.org)

Show HN: MCP-123, a 2-line MCP server/client (Windows-friendly) (github.com)

European Commission presents Roadmap for lawful access to data (home-affairs.ec.europa.eu)

Launch HN: K-Scale Labs (YC W24) – Open-Source Humanoid Robots

ChatGPT creates phisher's paradise by serving the wrong URLs for major companies (theregister.com)

Why did not numpy copy the J rank concept?

Killer whales groom each other with pieces of kelp (science.org)

Can Large Language Models Play Text Games Well? (2023) (arxiv.org)

Larry (cat) (en.wikipedia.org)

A new, faster DeepSeek R1-0528 variant appears from German lab (venturebeat.com)

A new, faster DeepSeek R1-0528 variant appears from German lab

Comments (17)