Language Models Improve When Pretraining Data Matches Target Tasks

Comments (1)

shallowNuralNet · 34m ago

Isn’t this just benchmarks? Even if they claim the gains generalize, it still seems like they're only optimizing for benchmarks, which is not all there is. Unless benchmark scores perfectly correlate with real-world performance (which they likely don’t), it's not clear this helps much in practice.

Ask HN: What trick of the trade took you too long to learn?

People can exploit your social media pictures and so I've made a tool

Ask HN: What change enabled you to consistently finish your side projects?

Ask HN: What if I fail to make it?

Ask HN: What service should I use to send email from my Node.js application?

Bitlocker Question

Ask HN: Why does YC care what tech stack I use?

Ask HN: Who wants to be hired? (August 2025)

Ask HN: Who is hiring? (August 2025)

I launched 17 side projects. Result? I'm rich in expired domains

Ask HN: What are your best practices for Claude Code?

Ask HN: How to Extract Shell Commands from Raw PTY Sessions? (Rewindtty)

I underestimated how lonely building solo can be

Ask HN: Whats Your Observability Setup?

Claude Research refuses to answer questions about cytotoxic mushrooms

Ask HN: Want to leave my job with nothing lined up

Ask HN: Have you ever regretted open-sourcing something?

Ask HN: Why was Windows ME so bad?

AI Teammates for for Revenue Teams

Ask HN: Is fast.ai's "Deep Learning for Coders" still relevant in 2025?

Nova: A New Web Framework for Erlang

Claude Code weekly rate limits

Ask HN: What are you working on? (July 2025)

Ask HN: Will AI push more of us into freelancing?

Cardano Community Approves $71M Treasury Plan to Boost Network Upgrades

Ask HN: Who'd join a San Diego HN meetup?

Ask HN: How is it possible to get -0.0 in a sum?

Ask HN: How do you avoid job hunting burnout?

Ask HN: Is true democracy possible in online tech communities?

Ask HN: What crypto payments options should I offer for my VPN?

Ask HN: Who Is Looking for a Cofounder?

Language Models Improve When Pretraining Data Matches Target Tasks

Comments (1)