Show HN: Inspect and extract files from MSI installers directly in your browser (pymsi.readthedocs.io)

Earlier this year I led our migration off AWS to European cloud (Hetzner + OVHcloud), driven by cost (we cut 90%) and data sovereignty (GDPR + CLOUD Act concerns).

We rebuilt key AWS features ourselves using Terraform for VPS provisioning, and Ansible for everything from hardening (auditd, ufw, SSH policies) to rolling deployments (with Cloudflare integration). Our Prometheus + Alertmanager + Blackbox setup monitors infra, apps, and SSL expiry, with ISO 27001-aligned alerts. Loki + Grafana Agent handle logs to S3-compatible object storage.

The stack includes: • Ansible roles for PostgreSQL (with automated s3cmd backups + Prometheus metrics) • Hardening tasks (auditd rules, ufw, SSH lockdown, chrony for clock sync) • Rolling web app deploys with rollback + Cloudflare draining • Full monitoring with Prometheus, Alertmanager, Grafana Agent, Loki, and exporters • TLS automation via Certbot in Docker + Ansible

I wrote up the architecture, challenges, and lessons learned: https://medium.com/@accounts_73078/goodbye-aws-how-we-kept-i...

I’m happy to share insights, diagrams, or snippets if people are interested — or answer questions on pitfalls, compliance, or cost modeling.

Comments (24)

jillesvangurp · 3m ago

> We rebuilt key AWS features ourselves

At what cost? People usually exclude the cost of DIY style hosting. Which usually is the most expensive part. Providing 24x7 support for the stuff that you've home grown alone is probably going to make large dent into any savings you got by not outsourcing that to amazon.

> $24,000 annual bill felt disproportionate

That's around 1-2 months of time for a decent devops freelancer. If you underpay your devs, about 1/3rd of an FTE per year. And you are not going to get 24x7 support with such a budget.

This still could make sense. But you aren't telling the full story here. And I bet it's a lot less glamorous when you factor in development time for this.

Don't get me wrong; I'm actually considering making a similar move but more for business reasons (some of our German customers really don't like US hosting companies) than for cost savings. But this will raise cost and hassle for us and I probably will need some re-enforcements on my team. As the CTO, my time is a very scarce commodity. So, the absolute worst use of my time would be doing this myself. My focus should be making our company and product better. Your techstack is fine. Been there done that. IMHO Terraform is overkill for small setups like this; fits solidly in the YAGNI category. But I like Ansible.

Keyframe · 1h ago

I think the most often mentioned problems mentioned are pollution of Hetzner addresses by shady people (might be addressed with "exits" from AWS / Cloudflare) and you are running on hardware which does tend to fail / needs upgrades. Were there some concerns on those from you?

Also, Loki! How do you handle memory hunger on loki reader for those pesky long range queries, and are there alternatives?

sksjvsla · 1h ago

Pollution: We front everything user-facing through Cloudflare, so external users (and bots) don’t interact directly with our Hetzner/OVH IPs. We lock down our IPs at the firewall (ufw + Cloudflare IP allowlisting) so only trusted sources can even connect at L4.

Failures/upgrades: We provision with Terraform, so spinning up replacements or adding capacity is fast and deterministic.

We monitor hardware metrics via Prometheus and node exporter to get early warnings. So far (9 months in) no hardware failure, but it’s a risk we offset through this automation + design.

Apps are mostly data-less and we have (frequently tested) disaster recovery for the database.

Loki: We’re handling the memory hunger by

• Distinguishing retention limits and index retention

• Tuning query concurrency and max memory usage via Loki'’'s config + systemd resource limits.

• Use Promtail-style labels + structured logging so queries can filter early rather than regex the whole log content.

• Where we need true deep history search, we offload to object store access tools or simple grep of backups — we treat Loki as operational logs + nearline, not as an archive search engine.

Keyframe · 1h ago

Thanks for thorough answer! Seems like you've platformized(!) yourself to an extent, have you considered going full on with k8s on top of metal (their machines) to offset some of the concerns about hardware?

sksjvsla · 1h ago

Thanks for the compliment.

We used AWS EKS in the old days and we never liked the extreme complexity of it.

With two Spring Boot apps, a database and Redis running across Ubuntu servers, we found simpler tools to distribute and scale workloads.

Since compute is dirt cheap, we over-provision and sleep well.

We have live alerts and quarterly reviews (just looking at a dashboard!) to assess if we balance things well.

K8s on EKS was not pleasant, I wanna make sure I never learn how much worse it can get across European VPS providers.

sksjvsla · 1h ago

A good alternatives for Loki is Victoria. Popular, way more performant and reputable but we went with Loki because of the relative size and diversity of maintainers between the two projects. Your points are super valid and we worked around it as mentioned above.

TZubiri · 1h ago

https://en.wikipedia.org/wiki/Sybil_attack

One of the advantages of more expensive providers seems to be that they have good reputation due to a de facto PoW mechanism.

sksjvsla · 1h ago

Depends on the use case, right? I don’t accept traffic from random Hetzner IPs — only Cloudflare’s IPs are allowed.

The only potential indirect risks is if your Hetzner VPS IP range gets blacklisted (because some Hetzner clients abuse it for Sybil attacks or spam).

Or if Hetzner infrastructure was heavily abused, their upstream or internal networking could (in theory) experience congestion or IP reputation problems — but this is very unlikely to affect your individual VPS performance.

This depends on what you are doing on Hetzner and how you restrict access but for an ISO-27001 certified enterprise app, I believe this is extremely unlikely.

saltysalt · 1h ago

I love Hetzner, I run my Internet search engine from there: bare metal FTW.

ArtTimeInvestor · 45m ago

How did you decide on Hetzner and OVH and why do you need both?

Have you looked into others as well, like IONOS and Scaleway?

sksjvsla · 38m ago

Great question. Technically speaking I might not need both, but I have a gut feeling that one of these cloud providers might not be as hardened as the hyperscalers, and that Russia is just waiting to put one of these two services down. So for maximal resiliency I chose to design from a multi-cloud setup from the beginning.

Scaleway came up but is more expensive. IONOS did not come up in our research.

Part of what we tried to do was to make ourselves independent from traditional cloud services and be really good at doing stuff on a VPS. Once you start doing that, you can actually allow yourself to look more at uptimes and at costs. Also, since we wanted everything to be fully automated, Terraform support was important for us, and OVHcloud and Hetzner had that.

I'm sure there's many great cloud providers out in Europe, but it's hard to vet them to understand if they can meet demand and if they are financially stable. We would want not to keep switching cloud providers. So picking two of the major ones seemed like a safe choice.

handfuloflight · 10m ago

What would Russia's interests be in putting these ISPs down, specifically?

sksjvsla · 5m ago

Without making it too political and speculating on things I don't know, I, like many other Europeans, have seen plenty of cases of Russia ruining infrastructure projects in Europe, everything from internet cables on the ocean bed, telcos, water supplies, railways and more. Authorities are asking civilians in Scandinavia to be prepare their hiused with. Good and water and are actively hardening security around critical infrastructure, including their software. I won't comment more on this because it's gonna derail this discussion.

jordanbeiber · 1h ago

Same here, but Azure. About 90% saved, with a very similar stack.

It is a great big cloud play to make enterprises reliant on the competency in their weird service abstractions, which is slowly draining the quite simple ops story an enterprise usually needs.

ed_mercer · 59m ago

Can you please elaborate how Azure is cheaper?

jordanbeiber · 40m ago

”Same here” meaning moving to Hetzner, but from Azure - could’ve made it less ambiguous!

Might throw together a post on it eventually:

https://news.ycombinator.com/context?id=43216847

miyuru · 41m ago

I think the parent meant that they moved from Azure to Hetzner.

sokoloff · 1h ago

Might be interesting, but doesn’t seem to be a valid “Show HN”

* - https://news.ycombinator.com/showhn.html

nopakos · 1h ago

I think a European CloudFlare would be nice to exist.

abc123abc123 · 1h ago

No problem! https://bunny.net/about/ Enjoy!

miyuru · 39m ago

bunny still don't support IPv6 to origin, or else I would have switched.

sksjvsla · 1h ago

Yes, it would be nice. Given Cloudflare's dev-friendly branding for some reason, I did not mind keeping it.

louwrentius · 52m ago

I'm involved with a cloud migration myself so I like the topic, but the Medium article contains less information than this "Shown HN" post.

The Medium post is mostly fluff and a lead generator.

sksjvsla · 48m ago

The Medium post is more of a high-level case study for a mixed audience (including non-technical decision makers). I intentionally kept the details lighter there, partly to avoid overwhelming readers and partly because the real “meat” (like our Ansible/Terraform patterns, Prometheus config, etc.) is harder to convey in that format without turning it into a giant technical appendix.

I’m happy to share specific configs, diagrams, or lessons learned here on HN if people want — and actually I’m finding this thread a much better forum for that kind of deep dive.

I'll dive into other aspects elsewhere: You can't doubt that given what I am sharing here.

Any particular area you’d like me to expand on? (e.g. how we structured Terraform modules, Ansible hardening, Prometheus alerting, Loki tuning?)

Show HN: We moved from AWS to Hetzner, saved 90%, kept ISO 27001 with Ansible (medium.com)

Show HN: A color name API that maps hex to the closest human-readable name (meodai.github.io)

Show HN: MMOndrian (mmondrian.com)

Show HN: Nxtscape – an open-source agentic browser (github.com)

Show HN: Inspect and extract files from MSI installers directly in your browser (pymsi.readthedocs.io)

Show HN: SnapQL – Desktop app to query Postgres with AI (github.com)

Show HN: I Built a Site That Curates Weird YouTube Rabbit Holes Daily (yourabbit.com)

Show HN: I wrote a new BitTorrent tracker in Elixir (github.com)

Show HN: EnrichMCP – A Python ORM for Agents (github.com)

Show HN: Ts-SSH – SSH over Tailscale without running the daemon (github.com)

Show HN: SecureBuild – Zero-CVE Images That Pay OSS Projects (securebuild.com)

Show HN: Unregistry – “docker push” directly to servers without a registry (github.com)

Show HN: RM2000 Tape Recorder, an audio sampler for macOS (rm2000.app)

Show HN: Workout.cool – Open-source fitness coaching platform (github.com)

Show HN: A DOS-like hobby OS written in Rust and x86 assembly (github.com)

Show HN: MCP to log your AI agents progress (taskerio.com)

Show HN: Tool to Automatically Create Organized Commits for PRs (github.com)

Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs (github.com)

Show HN: TypeScript MCP Server (github.com)

Show HN: Tree-hugger-JS: CSS selectors for JavaScript AST analysis and MCP

Show HN: Vpuna AI Search – A semantic search platform (aisearch.vpuna.com)

Show HN: Sexprs – Lisp dialect written in Rust (github.com)

Show HN: TrendFi – I built AI trading signals that self-optimize (trend.fi)

Show HN: Turbine – 16-bit CPU Architecture and Emulator built in C (errorcodezero.dev)

Show HN: I built a tensor library from scratch in C++/CUDA (github.com)

Show HN: Pickaxe – a TypeScript library for building AI agents (github.com)

Show HN: Onri, the Google Map for micro-learning (onri.ai)

Show HN: wasque – Lightweight Cloudlare Warp Proxy Container for Linux (github.com)

Show HN: Lstr – A modern, interactive tree command written in Rust (github.com)

Show HN: Free local security checks for AI coding in VSCode, Cursor and Windsurf

Show HN: Brisqi – privacy-first offline Kanban board that works locally (brisqi.com)

Show HN: Semantic search and ask your Gmail using Local LLMs (github.com)

Show HN: I made an online Unicode Cuneiform digital clock (oisinmoran.com)

Show HN: Nexus.js - Fabric.js for 3D (punk.cam)

Show HN: Kichan.ai, Chrome extension generates JavaScript to augment any website (kichan.ai)

Show HN: Trieve CLI – Terminal-based LLM agent loop with search tool for PDFs (github.com)

Show HN: ConfigKit – in browser Mac defaults creator (explosion-scratch.github.io)

Show HN: Picomatch – A tiny C library for evaluating regular expressions (github.com)

Show HN: Pomodoro Plaza – multi-timer Pomodoro app with activity heatmap (github.com)

Show HN: I made an app that lets you save audio to your Cameral Roll (justsendrecord.com)

Show HN: Chawan TUI web browser (chawan.net)

Show HN: Canine – A Heroku alternative built on Kubernetes (github.com)

Show HN: Relix: A Unix-like OS based on MIT's xv6 (github.com)

Show HN: Atomic – A to-do app inspired by GitHub squares and Atomic Habits (github.com)

Show HN: PlutoBook – Fast, lightweight C++ library for generating PDF from HTML (github.com)

Show HN: Gifty – A real-world gift hunt you play with your feet (gifty-en.vercel.app)

Show HN: Delve, an open source (AGPL) enterprise-grade data analytics platform (github.com)

Show HN: Tiny Hoare logic verifier using SMT (github.com)

Show HN: VS Code extension to share code snippets instantly (snippetshare.dev)

Show HN: dk – A script runner and cross-compiler, written in OCaml (diskuv.com)

Show HN: We moved from AWS to Hetzner, saved 90%, kept ISO 27001 with Ansible

Comments (24)