Ask HN: Any good tools for viewing congressional bills?

94 points by tlhunter 17h ago 39 comments

Ask HN: Do we need a language designed specifically for AI code generation?

3 points by baijum 5h ago 3 comments

Ask HN: Startup getting spammed with PayPal disputes, what should we do?

278 points by june3739 3d ago 180 comments

Ask HN: What would you work on if you couldn't fail?

7 points by rblion 10h ago 8 comments

Ask HN: A Tetris variant with greater tactical and strategic depth?

3 points by amichail 7h ago 0 comments

Ask HN: Is synthetic data generation practical outside academia?

4 points by cpard 9h ago 2 comments

Ask HN: Has anybody built search on top of Anna's Archive?

284 points by neonate 3d ago 146 comments

Ask HN: Anyone else feeling increasingly alienated from the industry?

32 points by saubeidl 1d ago 22 comments

I Built an AI Agent with Gmail Access and Discovered a Security Hole

4 points by Ada-Ihueze 10h ago 3 comments

Get Your Dev Tool Mentioned by ChatGPT, Gemini Not Just Ranked on Google

3 points by vinodvarma24 14h ago 0 comments

Ask HN: Who is hiring? (June 2025)

367 points by whoishiring 4d ago 472 comments

Tiptap open-sources 10 formerly Pro extensions under MIT license

9 points by philipisik 17h ago 1 comments

Summer projects (preferably open source) for college sophomores

3 points by decartesfolium 13h ago 1 comments

Ask HN: How do I learn robotics in 2025?

398 points by srijansriv 4d ago 99 comments

Ask HN: Anyone making a living from a paid API?

248 points by meander_water 6d ago 173 comments

Ask HN: How do I learn practical electronic repair?

182 points by juanse 7d ago 112 comments

Ask HN: Who wants to be hired? (June 2025)

125 points by whoishiring 4d ago 395 comments

Ask HN: What are some good resources for coding best practices?

6 points by genericmask 18h ago 3 comments

Ask HN: Options for One-Handed Typing

93 points by Townley 3d ago 93 comments

Ask HN: Should I build a directory product?

3 points by alizaid 18h ago 3 comments

Ask HN: Running AI agents in isolated environments

5 points by polycaster 1d ago 1 comments

Ask HN: What do you put in claude.md and what you leave out?

7 points by bognition 1d ago 2 comments

Ask HN: What Does Your Self-Hosted LLM Stack Look Like in 2025?

17 points by anditherobot 2d ago 6 comments

Ask HN: What are your fav/goto decision making hacks/heuristics?

6 points by ottaborra 1d ago 10 comments

Ask HN: What is the best LLM for consumer grade hardware?

238 points by VladVladikoff 7d ago 182 comments

Ask HN: What non-AI projects are you working on?

7 points by kikki 6h ago 4 comments

Ask HN: Where do you go for cutting-edge dev news and info?

4 points by TimTheTinker 1d ago 10 comments

Ask HN: How are parents who program teaching their kids today?

102 points by laze00 5d ago 91 comments

Ask HN: Walking while working and having meetings

3 points by martythemaniak 1d ago 6 comments

Ask HN: Who's Using the Origin Private File System?

5 points by ChadNauseam 1d ago 2 comments

Reaching my first 100 users without money or audience (at 10K users now)

33 points by felixheikka 3d ago 11 comments

Ask HN: Dealing with Vibe Coding Depression?

17 points by softirq 2d ago 24 comments

O(1) memory, no-preprocessing reachability algorithm for 2D grids

2 points by MatthiasGibis 1d ago 1 comments

How do you store and maintain your CV/resume over time?

11 points by xantin 2d ago 16 comments

Ask HN: What tools are you using for AI evals? Everything feels half-baked

4 points by fazlerocks 1d ago 2 comments

Ask HN: What's with the repeated job posts on "Who's hiring"?

85 points by rafavento 3d ago 41 comments

Ask HN: List of skills to survive the AI tsunami

17 points by cookiemonsieur 2d ago 8 comments

Ask HN: Resources for building AI agents for software development?

7 points by nadis 3d ago 3 comments

Reproducing the deep double descent paper

15 stpn 4 6/5/2025, 6:34:23 PM stpn.bearblog.dev ↗

Comments (4)

lcrmorin · 17h ago

Do you change regularisation ?

davidguetta · 1d ago

is this not because the longer you train, the more neurons 'die' (not uilized anymore cause the gradient is flat on the dataset) so you effectively get a smaller models as the training goes on ?

rsfern · 1d ago

I don’t think so? the double descent phenomenon also occurs in linear models under the right conditions. My understanding of this is that when the effective model capacity is exactly equal to the information in the dataset, there is only one solution that interpolates the training data perfectly, but when the capacity increases far beyond this there are many such interpolating solutions. Apply enough regularization and you are likely to find an interpolating solution that generalizes well

stpn · 1d ago

(post author here)

I was curious about this since it kind of makes sense, but I offer a few reasons why I think this isn't the case:

- In the 10% noise case at least, the second descent eventually finds a minima that's better than the original local minima which suggests to me the model really is finding a better fit rather than just reducing itself to a similar smaller model

- If it were the case, I think we might also expect the error for larger models to converge to the performance of smaller models? But instead they converge lower and better

- I checked the logged gradient histograms I had for a the runs. While I'm still learning how to interpret the results, I didn't see signs of vanishing gradients where dead neurons later in the model prevented earlier layers from learning. Gradients do get smaller over time but that seems expected and we don't have big waves of neurons dying which is what I'd expect to have the larger network converge on the size of the smaller one.