Launch HN: Issen (YC F24) – Personal AI language tutor

127 points by mariano54 3h ago 90 comments

Google DeepMind Releases AlphaGenome (deepmind.google)

107 points by i_love_limes 3h ago 8 comments

A Review of Aerospike Nozzles: Current Trends in Aerospace Applications (mdpi.com)

32 points by PaulHoule 2h ago 19 comments

Introducing Gemma 3n (developers.googleblog.com)

69 points by bundie 45m ago 14 comments

Show HN: I built an AI dataset generator (github.com)

58 points by matthewhefferon 2h ago 14 comments

I built an ADHD app with interactive coping tools, noise mixer and self-test (adhdhelp.app)

55 points by digitalions 3h ago 33 comments

FLUX.1 Kontext [Dev] – Open Weights for Image Editing (bfl.ai)

69 points by minimaxir 2h ago 15 comments

Some bits on malloc(0) in C being allowed to return NULL (utcc.utoronto.ca)

15 points by ingve 1d ago 15 comments

Access BMC UART on Supermicro X11SSH (github.com)

29 points by pietrushnic 3h ago 2 comments

A new pyramid-like shape always lands the same side up (quantamagazine.org)

564 points by robinhouston 21h ago 139 comments

Puerto Rico's Solar Microgrids Beat Blackout (spectrum.ieee.org)

283 points by ohjeez 18h ago 160 comments

Muvera: Making multi-vector retrieval as fast as single-vector search (research.google)

58 points by georgehill 7h ago 3 comments

-2000 Lines of code (folklore.org)

471 points by xeonmc 21h ago 192 comments

The Business of Betting on Catastrophe (thereader.mitpress.mit.edu)

25 points by anarbadalov 3d ago 10 comments

Learnings from building AI agents (cubic.dev)

123 points by pomarie 5h ago 42 comments

As AI kills search traffic, Google launches Offerwall to boost publisher revenue (techcrunch.com)

8 points by elektor 29m ago 1 comments

OpenAI charges by the minute, so speed up your audio (george.mand.is)

670 points by georgemandis 1d ago 208 comments

Snow - Classic Macintosh emulator (snowemu.com)

135 points by ColinWright 8h ago 50 comments

Ambient Garden (ambient.garden)

204 points by fipar 3d ago 38 comments

Modeling the World in 280 Characters (tympanus.net)

75 points by OuterVale 3d ago 9 comments

What makes comprehensible input comprehensible? (cij-analysis.streamlit.app)

22 points by surprisetalk 3d ago 7 comments

Writing a basic Linux device driver when you know nothing about Linux drivers (crescentro.se)

372 points by sbt567 4d ago 52 comments

Better Auth, by a self-taught Ethiopian dev, raises $5M from Peak XV, YC (techcrunch.com)

222 points by bundie 23h ago 158 comments

Structured Output with LangChain and Llamafile (blog.brakmic.com)

25 points by brakmic 4d ago 13 comments

What Problems to Solve (1966) (genius.cat-v.org)

440 points by jxmorris12 1d ago 54 comments

Real-world performance comparison of ebtree/cebtree/rbtree (wtarreau.blogspot.com)

36 points by misonic 2d ago 3 comments

LLM code generation may lead to an erosion of trust (jaysthoughts.com)

166 points by CoffeeOnWrite 11h ago 192 comments

AccessOwl (YC S22) is hiring an Elixir Engineer to connect 100s of SaaS (ycombinator.com)

1 points by mathiasn 10h ago 0 comments

Build and Host AI-Powered Apps with Claude – No Deployment Needed (anthropic.com)

289 points by davidbarker 1d ago 125 comments

RSS Server Side Reader (matklad.github.io)

37 points by Bogdanp 5h ago 39 comments

America’s incarceration rate is in decline (theatlantic.com)

235 points by paulpauper 1d ago 415 comments

MCP in LM Studio (lmstudio.ai)

221 points by yags 1d ago 123 comments

Apptainer: Application Containers for Linux (apptainer.org)

98 points by cl3misch 8h ago 61 comments

Define policy forbidding use of AI code generators (github.com)

478 points by todsacerdoti 18h ago 334 comments

Howdy – Windows Hello style facial authentication for Linux (github.com)

60 points by LorenDB 2d ago 33 comments

The Offline Club (theoffline-club.com)

172 points by esher 22h ago 88 comments

Gemini CLI (blog.google)

1338 points by sync 1d ago 734 comments

The Art of Hanakami, or Flower-Petal Folding (origamiusa.org)

60 points by s4074433 3d ago 2 comments

Bot or human? Creating an invisible Turing test for the internet (research.roundtable.ai)

127 points by timshell 1d ago 159 comments

Getting by on the Generosity of Strangers in Japan (theworld.org)

70 points by ilamont 2d ago 30 comments

A new PNG spec (programmax.net)

614 points by bluedel 2d ago 558 comments

Iroh: A library to establish direct connection between peers (github.com)

234 points by gasull 1d ago 52 comments

I fought in Ukraine and here's why FPV drones kind of suck (warontherocks.com)

125 points by _tk_ 7h ago 194 comments

Web Embeddable Common Lisp (turtleware.eu)

136 points by todsacerdoti 1d ago 44 comments

Announcing the Android Workgroup (forums.swift.org)

11 points by desertmonad 4h ago 4 comments

Interstellar Flight: Perspectives and Patience (centauri-dreams.org)

93 points by JPLeRouzic 1d ago 162 comments

The first non-opoid painkiller (worksinprogress.news)

207 points by ortegaygasset 8h ago 165 comments

Libxml2's "no security embargoes" policy (lwn.net)

271 points by jwilk 22h ago 234 comments

Is Lovable getting monetization wrong? (getlago.substack.com)

123 points by FinnLobsien 1d ago 70 comments

Microsoft Dependency Has Risks (blog.miloslavhomer.cz)

144 points by ArcHound 21h ago 198 comments

Ask HN: How Does DeepSeek "Thinks"?

1 JPLeRouzic 3 6/26/2025, 8:12:03 AM

There is a useful feature in DeepSeek that isn't present in other commercial LLMs. It displays its internal "thinking" process. I wonder what technological aspect makes this possible. Do several LLMs communicate with each other before providing a solution? Are there different roles within these LLMs, such as some proposing solutions, others contradicting or offering alternative viewpoints, or reminding of overlooked aspects?

Comments (3)

123yawaworht456 · 8h ago

>Do several LLMs communicate with each other before providing a solution?

no

>I wonder what technological aspect makes this possible.

one of its training datasets (prioritized somehow over the rest of them) contains a large number of examples emulating the thinking process within <think></think> tags before providing an output. the model then emulates it at runtime.

JPLeRouzic · 3h ago

Thank you for taking the time to answer. However I am not sure the answer is "NO" because DeepSeek has a particular technique in their architecture. To cite this blog [0]:

"Modern large language models (LLMs) started introducing a layer called “Mixture of Experts” (MoE) in their Transformer blocks to scale parameter count without linearly increasing compute. This is typically done through top-k (often k=2) “expert routing”, where each token is dispatched to two specialized feed-forward networks (experts) out of a large pool.

A naive GPU cluster implementation would be to place each expert on a separate device and have the router dispatch to the selected experts during inference. But this would have all the non-active experts idle on the expensive GPUs.

GShard, 2021 introduced the concept of sharding these feed-forward (FF) experts across multiple devices, so that each device"

[0] https://www.kernyan.com/hpc,/cuda/2025/02/26/Deepseek_V3_R1_...

123yawaworht456 · 16m ago

any model, MoE or not, can be sharded over multiple devices (separate GPUs on a single machine or separate machines via network), yeah. but your question was "Do several LLMs communicate with each other before providing a solution?", and in this context (Deepseek's thinking), the answer is definitely "no".

models can communicate with one another via tool calling, sure, and there are hypothetical workflows where agents delegate tasks to other agents (with inference being done on different models), but that simply isn't the case here.