Ask HN: Do you think differently about working on open source these days?

29 gillyb 49 8/11/2025, 2:13:02 PM
Wondering if anyone here has changed their opinion on open sourcing their projects or contributing to open source now that LLM's are a thing...

I don't know yet what I think, but my latest side project I decided to create privately on github.

Comments (49)

data-ottawa · 1h ago
No, when I licensed my open source code as MIT I essentially stated to not care what happens to it.

I am bothered that I was able to reproduce code from my blog through an LLM (suggesting exact same default values). That was not licensed for permissive use.

I still contribute to open source because I still use a lot of it. In my mind I owe it to the community to contribute back, and if nobody did the same my workflow would be a lot worse.

tcdent · 2h ago
I owe my entire 20 year career to open source, so if anything, I plan to increase the number of contributions I make as time goes on; it's one of the few areas in life where I feel indebted.
ProofHouse · 1h ago
Same
nosignono · 1h ago
Open Source was always a collection of anarchist experiments in different ways of working. It turns out that it's quite effective and popular, and people will organize themselves naturally around projects they find interesting.

That said, capital has always been squeezing open source. Whether it was the Embrace; Extend; Extinguish mantra of Microsoft, Amazon's hosting of Open Source in AWS to control the market for it, or Oracle's litigiousness about trademarks and patents. To say nothing of all the companies who profit from it and give nothing back in return.

LLMs being trained on Open Source software is nothing new with respect to capital attempting to consume it and profit from it but not giving anything back in exchange.

So no, I'm not worried, I'm not going to change anything. I expect maybe we see a license that says you cannot use it as AI training material at some point in the future, and the lawyers will fight over that for a decade or two.

koolba · 1h ago
An oft overlooked part of being involved in open source is the networking and mentorship, both technical and personal. Being a part of one or more well run projects lets you interact with some incredibly smart people discussing technical topics well outside your day job. Not all of that will pan out to something directly useful, but over time it adds up to both a wider skillset at actual development and dealing with a distributed team. The evolution of your written communication skills alone are probably worth it.
majora2007 · 2h ago
It hasn't changed anything for me. I don't really care if AI trains on my code or not. I write open source to share the code for other developers and give back as a way of pay it forward, to all the people's libraries I've relied on before.

Are people open sourcing their works in hopes to make money and that's their concern? I've never heard of that from people involved in open source.

lordkrandel · 2h ago
Aaahahah people tend to forget that GPL exists. If they use your software to build another, then you can sue and make all their software opensource. Also, you dont have to use github where your source is easily pray of LLM. You can host your own.
gillyb · 2h ago
I don't think GPL licenses really stop companies from scraping the code and using it to train their LLM's. Also, how would you prove they actually used your code ?
lordkrandel · 1h ago
Nothing. You protect yourself with hiding your code away from Github. Anyway, when software will be produced WITH LLMs, that will inevitably repeat your code, then you will sue. Opensource is MADE to be copied. It's meant to be. You WANT it. You just want it to benefit EVERYONE and not corporations who make money out of it, and there are defenses to that. Opensource, free software, is a philosophy, not a business opportunity per se. Business is just tangential to it.
stavros · 1h ago
Do companies train LLMs with anything any more? All the code we write now is written by LLMs, they can't really train on that.
fsflover · 2h ago
And AGPL is even better.
e3bc54b2 · 1h ago
Yes.

I always licensed my projects under GPL variants. That contract was broken by LLM vendors. So now I'm taking my toys and going home.

All my new projects are hosted on Sourcehut. I trust Drew when he says they are not letting LLM bots have at it.

Its not just the dev either. I'm no longer posting any content on blogs. Almost all of my other online interactions have moved to private channels and closed forums. I'm no longer giving my work away for free, unless you've passed the entry tests.

msgodel · 1h ago
I understand why people initially feel this way but the destruction of copyright is a realization of the end goal of the GPL, furthermore the way LLMs do it doesn't seem to impinge on the way it's used practically: preventing corporate administrators from mishandling source.

I wish pro-copy-left people could see this better. The future is brighter than you think.

e3bc54b2 · 53m ago
LLMs did not destroy copyright. They inky destroyed copyright for the little guy. As I understand it, neither of the FAMANG have put their main codebas3 in training dataset. If I train an LLM on the leaked sources I'll get sued right down to my undies. But all the LLM vendors took my code, betrayed the license term about attribution/viral licensing of derived content, but I am told to get stuffed.

I am assuming you are commenting in good faith, but it does tingle my gaslight-senses.

WJW · 1h ago
No, it hasn't changed my approach. I open source things precisely because I don't care if other people profit from it or not. In fact, I *hope* the users of my code gain a better life because of it, just like how I hope the readers of my blog learn something (however small, sometimes).

Whether that process is intermediated by a LLM or not is not really relevant.

sitkack · 2h ago
In the grand scheme of things, even GPLing your code doesn't matter as the AI is still trained on it. As AI increases in capabilities, it can just read and rewrite your code. And all code is useful in some way.

As mentioned in the comment, private on GH has no bearing, it is still in full sight of the AI.

From what I can tell, OSS submissions are on the rise as people embrace AI to work on things they could not previously.

throwaway290 · 2h ago
> From what I can tell, OSS submissions are on the rise as people embrace AI to work on things they could not previously

https://news.ycombinator.com/item?id=44729461

whs · 1h ago
I kinda want to quit making my code publicly available. It is my believe that the model may output license contaminated code and any license I put into my code will not be accurate, so my policy is I don't use AI coding on my publicly available open source code. However, this also means that I'm working for free fueling the AI industry.

However, it is quite fun to remove the boring part in programming with AI, so any hobby code I write I won't be making them public.

Currently I'm working on a way to use models trained from MIT-licensed code (eg. Comma) by using normal commercial model to supervise it. I believe this make the output code only be tainted with permissive code, and so I can now slowly use AI to write open source code again.

JohnFen · 1h ago
I've drifted away from the OSS subculture years ago, before LLMs. However, I did continue to publicly release source code under permissive licenses. That has absolutely changed because of genAI crawlers. I've removed my sources and have no plans to publicly release any new ones yet. I'll start making my code available again if/when I feel that I can adequately protect it.
jononor · 1h ago
No, the open source code I make is there to be used freely and for all. LLMs do not change this for me.

I think that open source libraries (but maybe not applications) may be even more relevant now than ever. More application code will be written, and there is still a need for correct and reliable components.

throwawa14223 · 1h ago
Specifically I'm looking for licenses that bar loading the code into LLMs. Github is feeding open source code into Copilot and selling it to people and they need to be stopped.
aatd86 · 1h ago
They also provide free hosting. Besides, compute is not free and alledgedly llms require a lot of it. Personally, I would look at the numbers before complaining.

Where you are right though is that it does lower the barrier for people to copy simpler projects. For large ones, the llms are still not up to par.

Unless you want people to pay a llm plugin per-project which is akin to having a subscription service for netflix, hulu, disney plus, amazon prime, etc. Just bad UX. I think there is no fighting it.

robertlagrant · 1h ago
Could you move to Gitlab? Or do they do it too?
1gn15 · 2h ago
Opinion has not changed. I still think information wants to be free, which is why I'm still publishing open source code and encouraging ~~piracy~~ distillations of closed models.
monster_truck · 1h ago
Everything I do differently wrt open source is because of humans and the way they act.

Now that we have LLMs I can make them tell off the people harassing me and generate features that do the opposite of what they asked for

haskman · 1h ago
Yes, I use to throw all my random experiments on github. Now I host my own code repos, block bots as much as possible, and keep most repositories private. Those repos that I do open source (because I want to share with people I know), I release under AGPL
esafak · 2h ago
Yes, differently: I contribute more now that I don't have to spend time getting ramped up on the code base. I'm fixing issues left and right.
aizk · 1h ago
Without my open source projects, my career would not have taken off. Simple as that.
danieldk · 2h ago
For me, I think the value of sharing with other people still outweighs the 'leeching' by proprietary for-profit vendors. Ideally, I'd like to be able to set an option on a repo: no LLM training, only training material for open weight models, and training material for all models. Though sadly, I do not think we could trust all vendors to respect such a flag, so now it is all or nothing.

Personally, I would select only training material for open models. Pandora's box is already open, but if we are going to have LLMs, I want them to be available to everyone and not gated by a small number of companies. Since open models generally have less of a benefit of scale in terms of GPUs, I want them to have the benefit of more available/higher quality training data.

donatj · 2h ago
I've got mixed feelings.

"A little copying is better than a little dependency" becomes much easier to fulfil with AI, in a literal sense, and I support a lot of "little dependencies"

ronsor · 1h ago
I really don't care about LLM training. If I don't want code to be used, I won't share it.
fellowniusmonk · 2h ago
Open Source is one of the most humanistic, progress driving cultures people have ever come up with.

I mean I am fucking shocked that people don't get this, our whole fucking modern world, all the parts that make stuff work, every last bit of it is built on top of or dependent on OSS.

There isn't a single lab, company, person or country that doesn't use and benefit from open source. Whether they know it or not.

It is what has supported widespread fractal improvement starting at the individual level. It's the greatest grassroots story movement ever and it's still driven by grassroot adoption.

It's like programatic peer review writ large with no gate keeping journals and its changed humanity forever, if we ever deflect that asteroid headed towards earth or if we make it to the stars, or if we figure out how to avoid the heat death, it's because open source got us there.

walterbell · 1h ago
> supported widespread fractal improvement starting at the individual level

Fitting that open-source was popularized by people who worked with nanotechnology, https://www.facesofopensource.com/christine-peterson/

throwaway290 · 2h ago
Me I open source less and 100% keep off github all new code unless it's for pay for a client who wants to use github.
incomingpain · 43m ago
I have public open source projects on githubs that collect stars.

I'm currently working on a new project(my first big one from scratch using LLM coding), and using a few open source library. 9x under MIT. 2x BSD 3 clause, and 1 apache 2.0.

None of them are copyleft? I didnt do that intentionally. I dont know what i plan to license it; I typically go gpl. It's private until I decide i guess.

My big 'think differently' is that i gain a bunch of responsibility for the project. Do I want that? Am i ready for this long term commitment?

znpy · 1h ago
Not due to llms.

Put simply… Too much drama, too little technical joy.

I had my best technical achievement writing proprietary software at one faang, if i have to be honest.

calvinmorrison · 1h ago
Long before LLMs, I had code on github as a portfolio, none of it was licensed to be copied or modified. People still forked my code.
nosioptar · 2h ago
I've not released some stuff because I don't want the hassle of dealing with slop in PRs. If someone can't be bothered to write their code, I'm not wasting time reviewing it.

I don't love the idea of my work training an ai without compensation either.

majora2007 · 2h ago
> I don't love the idea of my work training an ai without compensation either.

How is it different then if a Company uses your code without paying you due to a permissive license? It sounds like Open Source doesn't fit your mindset.

nosioptar · 2h ago
I don't use permissive licenses for just that reason. If someone wants to make money off my work, they can cut me in on it or find a different piece if code.
noelwelsh · 2h ago
LLMs have not changed my opinion on contributing to open source. Slop PRs are quite easy to deal with, and I'm not too concerned about LLMs being trained on the fairly niche things I work on. If my projects were more popular I might have a different opinion.
behohippy · 2h ago
Sure, all the slop code projects I produce get MIT licensed on public repos. It wasn't mine to begin with, so I wouldn't prevent anyone from using it.
nisegami · 2h ago
I consider it an honor to have my work be a part of large language models.
fsflover · 3h ago
https://news.ycombinator.com/item?id=44766771

> privately on github

So Microsoft and their AI will have access to your code but people won't.

gillyb · 2h ago
Yeah, I'm aware, but didn't bother yet to go further... I have been thinking about hosting my own git server to work on.

I'm not a prolific open source contributor anyways, but i'm wondering if the tides are changing in regards to open source.

I can imaging that many people in the open source community - even if they're giving away their code for free under the most permissive licenses, still somewhere enjoy the fact that people know they're looking at their code on github or wherever it may be hosted. But now that slowly people shifting to only looking at code via LLM chat apps, then they might want to contirbute less to open source... just wondering... ?

autotune · 2h ago
It's impossible to develop open source software without having a credit/debit card and cell phone present. Feels more like a scam to me than not but I understand it from an operator's perspective; you have to prevent fraud. From a contributor's perspective who had their life derailed for more than a year, it's a pain to get everything back up and running as it was before.
JohnFen · 1h ago
> It's impossible to develop open source software without having a credit/debit card and cell phone present.

This is absolutely not true

01HNNWZ0MV43FF · 2h ago
I don't think I needed a credit card or cell phone to register with codeberg. I did need an email address.

In theory if we wanted to be very inclusive we could do things based on web of trust or P2P tech or in-person sneakernet. It'd be fun, but it'd be slow.

autotune · 2h ago
I'm writing this from a DevOps perspective. I suppose branding all OSS as "impossible to develop without credit/debit card and cell phone" isn't in good taste on my part from a more generalized perspective.