AI Improves at Improving Itself Using an Evolutionary Trick

26 pseudolus 33 6/26/2025, 1:41:57 PM spectrum.ieee.org ↗

Comments (33)

gavinray · 7h ago
There's an interesting parallel to be drawn here from prior RL research:

  "Some evolutionary algorithms keep only the best performers in the population, on the assumption that progress moves endlessly forward. DGMs, however, keep them all, in case an innovation that initially fails actually holds the key to a later breakthrough when further tweaked. It’s a form of “open-ended exploration,” not closing any paths to progress. (DGMs do prioritize higher scorers when selecting progenitors.)"
Kenneth Stanley[0], the creator of the NEAT[1]/HyperNEAT (Picbreeder) algorithms wrote an entire book about open-ended exploration, "Why Greatness Cannot Be Planned: The Myth of the Objective".

[0]: https://en.wikipedia.org/wiki/Kenneth_Stanley

[1]: https://en.wikipedia.org/wiki/Neuroevolution_of_augmenting_t...

paulluuk · 7h ago
It's really a choice: do you want to waste compute or do you want to waste potential?

While prioritizing higher scorers for selecting progenitors will initially mitigate some of the problems, you will eventually end up with hundreds of thousands of agents that only learned to repeat the letter "a" a million times in a row, which is a huge waste of processing.

tmaly · 4h ago
I followed his work on NEAT at the time. It was really cool. But I never imagined we would get to where we are today with AI.
spwa4 · 1h ago
The same could be said of transformers, that only started to perform when scaled up to an absolutely ridiculous degree. I would argue most researchers are of the opinion that any learning system, scaled up enough, would work.

I think the limits of machine learning are related to the fact that all ML "knowledge" is secondhand, except talking to humans and already to a much smaller extent programming. Getting AIs to interact with, say, cars, during training is the way forward.

pvg · 7h ago
achrono · 7h ago
I wish an org like IEEE would be way more rigorous than what's revealed with the first paragraph:

>In April, Microsoft’s CEO said that artificial intelligence now wrote close to a third of the company’s code. Last October, Google’s CEO put their number at around a quarter. Other tech companies can’t be far off.

Take a moment to reflect -- a third of the company's code? Generative AI capable enough to write reasonable code has arguably not been around longer than 5 years. In the 50 years of Microsoft, have the last 5 years contributed to a third of the total code base? This itself would require that not a single engineer write a single line of code in these 5 years.

Okay, maybe Microsoft meant to say new/incremental code?

No, because Satya is reported to have said, "I’d say maybe 20%, 30% of the code that is inside of our repos today [...] written by software".

zack6849 · 7h ago
I'm pretty sure they meant 1/3rd of newly written code, obviously they don't mean a third of all their code that exists was written by AI
achrono · 7h ago
That's a reasonable interpretation, but that is not what Microsoft has said. Satya talks of "30% of the code that is inside of our repos today".

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

wiz21c · 7h ago
as a regular human he may just have hallucinated :-)
bwfan123 · 7h ago
If a third of microsoft's code looks like this copilot generated PR [1] the company is going to go down the tubes soon. And I hope this happens, so, these corporate chiefs learn a harsh lesson when they are ejected for forcing stupidity across the org.

[1] https://github.com/dotnet/runtime/pull/115762

rvnx · 7h ago
https://www.google.com/search?q=msft+stock

They never did so well

The issue with Copilot is that it is running GPT-4o and GPT-4o-mini, and both models are not good at programming.

davidmurdoch · 7h ago
They clearly mean "new" code. Meaning on any recent day, that amount of code is authored by AI.
achrono · 7h ago
No, because Satya's claim is about "30% of the code that is inside of our repos today".

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

davidmurdoch · 7h ago
Very clearly not what he meant.
bee_rider · 7h ago
> “I’d say maybe 20%, 30% of the code that is inside of our repos today and some of our projects are probably all written by software,” Nadella said during a conversation before a live audience with Meta CEO Mark Zuckerberg.

I think there’s some quibble to be made about “written by software” vs “generated by AI,” but it doesn’t seem like he’s talking about new code, right? He goes out of his way to phrase it “code that is inside of our repos.”

It doesn’t really make any sense, but it does seem to be what he said. Maybe there was some context, maybe it is about some specific repos that was not included in the quote.

But, it isn’t “very clear” in any case.

davidmurdoch · 5h ago
It is "very clear" because it's not possible. He didn't go "out of his way"... it's live conversation, not something he could proof read and edit.
bee_rider · 4h ago
It is in the video “Welcome to LlamaCon 2025 - Closing Session!” posted in full on YouTube, around minute 44.

Zuck asks: “in terms of the coding, and how it improves that, do you have a sense of how much of the code, like what percent of the code that’s being written in Microsoft at this point is written by AI as opposed to by engineers?”

Nadella: “yeah so there’s two sort of thing we’re tracking, one is the accept rate itself, that’s around 30-40, it’s going up monotonically, [talks for a bit about the fact that it works well with Python, less so for C++, from context he’s talking about code completions here, now back to…] the place where the agentic code still, it’s ver-it’s sort of nascent—for new greenfield it’s very very high—but as I said nothing is greenfield in many cases and so therefore I’d say maybe at this point the PR—oh by, the way the code reviews are very high, and so, the agents we have for reviewing code [makes a happy expression], so that usage has increased—and so I’ll say, [this is around 45:00] maybe 20-30 percent of the code that is inside of our repos and some of our projects are probably all, uh, written by software. […]

I dunno. In conclusion having listened to it a couple times and done my best at transcribing it fairly close to what he actually said, I’m still confused as to what he meant. I was going to try and make some point with this, but I lost track of it while writing out the quote. At least folks have the full thing to argue about here, more or less!

achrono · 7h ago
You know that because you're in tech. But the average person who is reading the news, and is likely to participate in a large protest in the next few years, would not naturally make a distinction between incremental and extant code.
AlienRobot · 7h ago
LLM's next breakthrough will be removing 30% of the code of a codebase.
throwawayoldie · 7h ago
It's not his job to accurately report numbers, or really to do anything that involves technical acumen. His job is more akin to being a cheerleader, or a carnival barker.
kordlessagain · 7h ago
Are we really sitting here dissecting what he's saying as if it means anything at all for the future? 20% or 30% today is 100% tomorrow. That much is certain.
AnimalMuppet · 7h ago
100%? Certain? I disagree, strongly.
cimi_ · 7h ago
They probably mean new code not the entire codebase, but even so I think those numbers are ridiculous given my experience.

Is there any evidence of this (anywhere, not just MS or Google)?

paulluuk · 7h ago
I'm not sure if it's ridiculous if you factor in something like copilot. Heck, even just your IDE's built-in autocomplete (which only finishes the current variable name) can get close to being responsible for 20% of your code, with tools like copilot I think you can even more easily hit that target.
SoftTalker · 7h ago
I've always interpreted that as "a third of the company’s (new) code" though I guess it would be nice of them to make that clear.
seydor · 7h ago
newly writte code. But the consensus is that this is inflated numbers that don't involve the revisions that this code needs. Would be interesting for them to tell us what % of the LLM generated code gets thrown away .
bee_rider · 7h ago
I mean… it is objectively the truth to quote the CEO of MS as saying what he said, whether or not he is lying or using a misleading metric. The only questionable things about the quote, imo, are

> Other tech companies can’t be far off.

First, MS and Google are working on coding assistants so I’d expect them to be quite ahead of the curve in terms of what their CEOs report. Both in terms of what they are actually doing (since they have a bunch of people working there who are interested in AI coding assistants, surely they are using them). And in terms of that the head advertisers for these products, the CEOs are willing to say (although I should be clear, I’m not even necessarily saying he’s lying or being misleading. He’s in charge of a company that is advertising some AI tool, maybe all his reports are also emphasizing how good the dogfood is).

Second and relatedly, quoting a AI tool salesman on how much of his company’s code is written by AI… eh, it is a big company, the CEO of MS is a known figure. But maybe they should be explicitly skeptical toward him. As you note, I wouldn’t be surprised if MS was itself far off from what he said in the quote, let alone other companies…

Although, if he says:

> "I’d say maybe 20%, 30% of the code that is inside of our repos today [...] written by software".

Depending on how you look at it, that doesn’t necessarily preclude, like, classic macros and other classic code generation tools, so actually I have no idea what it even means. If an AI touches a JavaScript minifier, does it get credit for all the JavaScript that gets generated by it? Haha.

exe34 · 7h ago
Maybe they've had LLMs for a very long time, given the quality of their code...
mucha · 7h ago
What Satya says: “I’d say maybe 20%, 30% of the code that is inside of our repos today and some of our projects are probably all written by software,”

First line from the article: In April, Microsoft’s CEO said that artificial intelligence now wrote close to a third of the company’s code.

Software != AI

Source: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

CNBC misquotes Satya in the same article with his actual quote.

datameta · 7h ago
I think this is interesting enough for a post in and of itself: https://arxiv.org/abs/2505.22954
wiz21c · 7h ago
From the article abstract: "All experiments were done with safety precautions (e.g., sandboxing, human oversight)."

Do the authors really believe "safety" is necessary, that is, there is a risk that somethign goes wrong ? What kind of risk ?

datameta · 7h ago
From what I understand, alignment and interpretability were rewarded as part of the optimization function. I think it is prudent that we bake in these "guardrails" early on.
catoc · 7h ago
Number of lines of code… airplane weight… etc