Shifts in diatom and dinoflagellate biomass in the North Atlantic over 6 decades (journals.plos.org)

  "Some evolutionary algorithms keep only the best performers in the population, on the assumption that progress moves endlessly forward. DGMs, however, keep them all, in case an innovation that initially fails actually holds the key to a later breakthrough when further tweaked. It’s a form of “open-ended exploration,” not closing any paths to progress. (DGMs do prioritize higher scorers when selecting progenitors.)"

Kenneth Stanley[0], the creator of the NEAT[1]/HyperNEAT (Picbreeder) algorithms wrote an entire book about open-ended exploration, "Why Greatness Cannot Be Planned: The Myth of the Objective".

[0]: https://en.wikipedia.org/wiki/Kenneth_Stanley

[1]: https://en.wikipedia.org/wiki/Neuroevolution_of_augmenting_t...

paulluuk · 7h ago

It's really a choice: do you want to waste compute or do you want to waste potential?

While prioritizing higher scorers for selecting progenitors will initially mitigate some of the problems, you will eventually end up with hundreds of thousands of agents that only learned to repeat the letter "a" a million times in a row, which is a huge waste of processing.

tmaly · 4h ago

I followed his work on NEAT at the time. It was really cool. But I never imagined we would get to where we are today with AI.

spwa4 · 1h ago

The same could be said of transformers, that only started to perform when scaled up to an absolutely ridiculous degree. I would argue most researchers are of the opinion that any learning system, scaled up enough, would work.

I think the limits of machine learning are related to the fact that all ML "knowledge" is secondhand, except talking to humans and already to a much smaller extent programming. Getting AIs to interact with, say, cars, during training is the way forward.

pvg · 7h ago

Discussion of the paper https://news.ycombinator.com/item?id=44135369

achrono · 7h ago

I wish an org like IEEE would be way more rigorous than what's revealed with the first paragraph:

>In April, Microsoft’s CEO said that artificial intelligence now wrote close to a third of the company’s code. Last October, Google’s CEO put their number at around a quarter. Other tech companies can’t be far off.

Take a moment to reflect -- a third of the company's code? Generative AI capable enough to write reasonable code has arguably not been around longer than 5 years. In the 50 years of Microsoft, have the last 5 years contributed to a third of the total code base? This itself would require that not a single engineer write a single line of code in these 5 years.

Okay, maybe Microsoft meant to say new/incremental code?

No, because Satya is reported to have said, "I’d say maybe 20%, 30% of the code that is inside of our repos today [...] written by software".

zack6849 · 7h ago

I'm pretty sure they meant 1/3rd of newly written code, obviously they don't mean a third of all their code that exists was written by AI

achrono · 7h ago

That's a reasonable interpretation, but that is not what Microsoft has said. Satya talks of "30% of the code that is inside of our repos today".

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

wiz21c · 7h ago

as a regular human he may just have hallucinated :-)

bwfan123 · 7h ago

If a third of microsoft's code looks like this copilot generated PR [1] the company is going to go down the tubes soon. And I hope this happens, so, these corporate chiefs learn a harsh lesson when they are ejected for forcing stupidity across the org.

[1] https://github.com/dotnet/runtime/pull/115762

rvnx · 7h ago

https://www.google.com/search?q=msft+stock

They never did so well

The issue with Copilot is that it is running GPT-4o and GPT-4o-mini, and both models are not good at programming.

davidmurdoch · 7h ago

They clearly mean "new" code. Meaning on any recent day, that amount of code is authored by AI.

achrono · 7h ago

No, because Satya's claim is about "30% of the code that is inside of our repos today".

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

davidmurdoch · 7h ago

Very clearly not what he meant.

bee_rider · 7h ago

> “I’d say maybe 20%, 30% of the code that is inside of our repos today and some of our projects are probably all written by software,” Nadella said during a conversation before a live audience with Meta CEO Mark Zuckerberg.

I think there’s some quibble to be made about “written by software” vs “generated by AI,” but it doesn’t seem like he’s talking about new code, right? He goes out of his way to phrase it “code that is inside of our repos.”

It doesn’t really make any sense, but it does seem to be what he said. Maybe there was some context, maybe it is about some specific repos that was not included in the quote.

But, it isn’t “very clear” in any case.

davidmurdoch · 5h ago

It is "very clear" because it's not possible. He didn't go "out of his way"... it's live conversation, not something he could proof read and edit.

bee_rider · 4h ago

It is in the video “Welcome to LlamaCon 2025 - Closing Session!” posted in full on YouTube, around minute 44.

Zuck asks: “in terms of the coding, and how it improves that, do you have a sense of how much of the code, like what percent of the code that’s being written in Microsoft at this point is written by AI as opposed to by engineers?”

Nadella: “yeah so there’s two sort of thing we’re tracking, one is the accept rate itself, that’s around 30-40, it’s going up monotonically, [talks for a bit about the fact that it works well with Python, less so for C++, from context he’s talking about code completions here, now back to…] the place where the agentic code still, it’s ver-it’s sort of nascent—for new greenfield it’s very very high—but as I said nothing is greenfield in many cases and so therefore I’d say maybe at this point the PR—oh by, the way the code reviews are very high, and so, the agents we have for reviewing code [makes a happy expression], so that usage has increased—and so I’ll say, [this is around 45:00] maybe 20-30 percent of the code that is inside of our repos and some of our projects are probably all, uh, written by software. […]

I dunno. In conclusion having listened to it a couple times and done my best at transcribing it fairly close to what he actually said, I’m still confused as to what he meant. I was going to try and make some point with this, but I lost track of it while writing out the quote. At least folks have the full thing to argue about here, more or less!

achrono · 7h ago

You know that because you're in tech. But the average person who is reading the news, and is likely to participate in a large protest in the next few years, would not naturally make a distinction between incremental and extant code.

AlienRobot · 7h ago

LLM's next breakthrough will be removing 30% of the code of a codebase.

throwawayoldie · 7h ago

It's not his job to accurately report numbers, or really to do anything that involves technical acumen. His job is more akin to being a cheerleader, or a carnival barker.

kordlessagain · 7h ago

Are we really sitting here dissecting what he's saying as if it means anything at all for the future? 20% or 30% today is 100% tomorrow. That much is certain.

AnimalMuppet · 7h ago

100%? Certain? I disagree, strongly.

cimi_ · 7h ago

They probably mean new code not the entire codebase, but even so I think those numbers are ridiculous given my experience.

Is there any evidence of this (anywhere, not just MS or Google)?

paulluuk · 7h ago

I'm not sure if it's ridiculous if you factor in something like copilot. Heck, even just your IDE's built-in autocomplete (which only finishes the current variable name) can get close to being responsible for 20% of your code, with tools like copilot I think you can even more easily hit that target.

SoftTalker · 7h ago

I've always interpreted that as "a third of the company’s (new) code" though I guess it would be nice of them to make that clear.

seydor · 7h ago

newly writte code. But the consensus is that this is inflated numbers that don't involve the revisions that this code needs. Would be interesting for them to tell us what % of the LLM generated code gets thrown away .

bee_rider · 7h ago

I mean… it is objectively the truth to quote the CEO of MS as saying what he said, whether or not he is lying or using a misleading metric. The only questionable things about the quote, imo, are

> Other tech companies can’t be far off.

First, MS and Google are working on coding assistants so I’d expect them to be quite ahead of the curve in terms of what their CEOs report. Both in terms of what they are actually doing (since they have a bunch of people working there who are interested in AI coding assistants, surely they are using them). And in terms of that the head advertisers for these products, the CEOs are willing to say (although I should be clear, I’m not even necessarily saying he’s lying or being misleading. He’s in charge of a company that is advertising some AI tool, maybe all his reports are also emphasizing how good the dogfood is).

Second and relatedly, quoting a AI tool salesman on how much of his company’s code is written by AI… eh, it is a big company, the CEO of MS is a known figure. But maybe they should be explicitly skeptical toward him. As you note, I wouldn’t be surprised if MS was itself far off from what he said in the quote, let alone other companies…

Although, if he says:

> "I’d say maybe 20%, 30% of the code that is inside of our repos today [...] written by software".

Depending on how you look at it, that doesn’t necessarily preclude, like, classic macros and other classic code generation tools, so actually I have no idea what it even means. If an AI touches a JavaScript minifier, does it get credit for all the JavaScript that gets generated by it? Haha.

exe34 · 7h ago

Maybe they've had LLMs for a very long time, given the quality of their code...

mucha · 7h ago

What Satya says: “I’d say maybe 20%, 30% of the code that is inside of our repos today and some of our projects are probably all written by software,”

First line from the article: In April, Microsoft’s CEO said that artificial intelligence now wrote close to a third of the company’s code.

Software != AI

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

CNBC misquotes Satya in the same article with his actual quote.

datameta · 7h ago

I think this is interesting enough for a post in and of itself: https://arxiv.org/abs/2505.22954

wiz21c · 7h ago

From the article abstract: "All experiments were done with safety precautions (e.g., sandboxing, human oversight)."

Do the authors really believe "safety" is necessary, that is, there is a risk that somethign goes wrong ? What kind of risk ?

datameta · 7h ago

From what I understand, alignment and interpretability were rewarded as part of the optimization function. I think it is prudent that we bake in these "guardrails" early on.

catoc · 7h ago

Number of lines of code… airplane weight… etc

AlphaGenome: AI for better understanding the genome (deepmind.google)

Launch HN: Issen (YC F24) – Personal AI language tutor

Memory safety is table stakes (usenix.org)

Kea 3.0, our first LTS version (isc.org)

Robots that learn (openai.com)

The time is right for a DOM templating API (justinfagnani.com)

Alternative Layout System (alternativelayoutsystem.com)

Matrix v1.15 (matrix.org)

"Why is the Rust compiler so slow?" (sharnoff.io)

A Review of Aerospike Nozzles: Current Trends in Aerospace Applications (mdpi.com)

Introducing Gemma 3n (developers.googleblog.com)

Show HN: Magnitude – Open-source AI browser automation framework (github.com)

SigNoz (YC W21, Open Source Datadog) Is Hiring DevRel Engineers (Remote)(US) (ycombinator.com)

Snow - Classic Macintosh emulator (snowemu.com)

Shifts in diatom and dinoflagellate biomass in the North Atlantic over 6 decades (journals.plos.org)

Show HN: I built an AI dataset generator (github.com)

How much slower is random access, really? (samestep.com)

A new pyramid-like shape always lands the same side up (quantamagazine.org)

Low Overhead Allocation Sampling in a Garbage Collected Virtual Machine (arxiv.org)

Puerto Rico's Solar Microgrids Beat Blackout (spectrum.ieee.org)

Starcloud says 1 launch, $8M but ISS tech says 17 launches, $850M+ (angadh.com)

Typr – TUI typing test with a word selection algorithm inspired by keybr (github.com)

Lateralized sleeping positions in domestic cats (cell.com)

The Business of Betting on Catastrophe (thereader.mitpress.mit.edu)

US economy shrank 0.5% in the first quarter, worse than earlier estimates (apnews.com)

Muvera: Making multi-vector retrieval as fast as single-vector search (research.google)

Access BMC UART on Supermicro X11SSH (github.com)

FLUX.1 Kontext [Dev] – Open Weights for Image Editing (bfl.ai)

Ambient Garden (ambient.garden)

Writing a basic Linux device driver when you know nothing about Linux drivers (crescentro.se)

Better Auth, by a self-taught Ethiopian dev, raises $5M from Peak XV, YC (techcrunch.com)

Learnings from building AI agents (cubic.dev)

Structured Output with LangChain and Llamafile (blog.brakmic.com)

What Problems to Solve (1966) (genius.cat-v.org)

Modeling the World in 280 Characters (tympanus.net)

Senators reintroduce App Store bill to rein in 'gatekeeper power' (9to5mac.com)

Build and Host AI-Powered Apps with Claude – No Deployment Needed (anthropic.com)

America’s incarceration rate is in decline (theatlantic.com)

LLM code generation may lead to an erosion of trust (jaysthoughts.com)

Box Combinators (mmapped.blog)

Show HN: I built a JSON-RPC library for Zig with compile time reflection (github.com)

Nintendo Switch 2 Slammed for Poor Display–120 Hz with 30 FPS Response Times (techpowerup.com)

AllCiphers – An online toolkit to encrypt/decrypt popular ciphers (allciphers.com)

MCP in LM Studio (lmstudio.ai)

The Offline Club (theoffline-club.com)

Some bits on malloc(0) in C being allowed to return NULL (utcc.utoronto.ca)

Define policy forbidding use of AI code generators (github.com)

I built an ADHD app with interactive coping tools, noise mixer and self-test (adhdhelp.app)

Researchers develop a battery cathode material that does it all (arstechnica.com)

Real-world performance comparison of ebtree/cebtree/rbtree (wtarreau.blogspot.com)

AI Improves at Improving Itself Using an Evolutionary Trick

Comments (33)