Show HN: PyTorch K-Means GPU-friendly, single-file, hierarchical and resampling

8 hassonofer 0 8/29/2025, 9:57:08 AM

I built a small, self-contained K-Means implementation in pure PyTorch: https://gitlab.com/hassonofer/pt_kmeans

I was working on dataset sampling and approximate nearest neighbor search, and tried several existing libraries for large-scale K-Means. I couldn't find something that was fast, simple, and would run comfortably on my own workstation without hitting memory limits. Maybe I missed an existing solution, but I ended up writing one that fit my needs.

The core insight: Keep your data on CPU (where you have more RAM) and intelligently move only the necessary chunks to GPU for computation during the iterative steps. Results always come back to CPU for easy post-processing. (Note: For K-Means++ initialization when computing on GPU, the full dataset still needs to fit on the GPU.)

It offers a few practical features:

  - Chunked Computations: Memory-efficient processing of large datasets by only moving necessary data chunks to the GPU, preventing Out-Of-Memory errors
  - Cluster splitting: Refine existing clusters by splitting a single cluster into multiple sub-clusters
  - Zero Dependencies: Single file, only requires PyTorch. Copy-paste into any project
  - Advanced Clustering: Hierarchical K-Means with optional resampling (following recent research), cluster splitting utilities.
  - Device Flexibility: Explicit device control - data can live anywhere, computation happens where you specify (any accelerator PyTorch supports)

Future plans:

  - Add support for memory-mapped files to handle even bigger datasets
  - Explore PyTorch distributed for multi-node K-Means

The implementation handles both L2 and cosine distances, includes K-Means++ initialization.

Available on PyPI (`pip install pt_kmeans`) and the full implementation is at: https://gitlab.com/hassonofer/pt_kmeans

Would love feedback on the approach and any use cases I might have missed!

Ask HN: The government of my country blocked VPN access. What should I use?

Ask HN: Theory That Economic Growth Stagnates in a Civilization

Ask HN: What to learn for math for modeling?

Ask HN: Should we stop worrying that AI will replace developer jobs?

Ask HN: What options do I have for self-hosted end to end encrypted group chat?

Tired of Broken .onion Links? I Built a Real-Time Tor SE That Works

Ask HN: Daily/weekly/monthly technical rituals?

Ask HN: Avoiding AI slop in music services

Ask HN: Why hasn't x86 caught up with Apple M series?

Apple App Store Review Mess

Ask HN: What to do when you suspect your interview is with a state operative?

Ask HN: How can I recover and run my old mobile game from the 2010s?

Best up-to-date guides on Vibe Coding

Staying up to date on the best AI dev workflow?

Ask HN: Anyone working on bringing software back from US clouds?

Ask HN: How much better can the LLMs become assuming no AGI

Ask HN: Services for Shutting Down a Startup?

Ask HN: Where can I see a live octopus in Maine?

Ask HN: Did modern AI's coding abilities make you lose interest in programming?

CompactifAI Inference API

Ask HN: What to Do with Old iPads?

Ask HN: What are the best Google alternatives in 2025?

Ask HN: GitHub Copilot down?

Anthropick.com Redirects to ChatGPT

Ask HN: How to teach a 4 year old to code?

Ask HN: Best codebases to study to learn software design?

Petition to stop Google from restricting sideloading and FOSS apps

Units of Economics of LLMs. Reply to Ed Zitron's "AI Is a Money Trap"

Ask HN: Does sentience put stress on the brain?

Ask HN: Is there a temp phone number like temp email?

Out of curiosity: what kind of people use this "forum" (I mean Hacker News)?

Ask HN: Why are so many services rejecting Google Voice numbers for signups?

Ask HN: How to Learn to Build Agentic AI Systems (Like Claude Code)

Ask HN: What measures are you taking to stop AI crawlers?

Ask HN: What should I use to run React Native tests on a device?

Ask HN: Are AI filters becoming stricter than society itself?

Ask HN: Is backlink trading still a problem worth solving?

Stop squashing your commits. You're squashing your AI too

Ask HN: Windows 11 Update Fail – Linux Distro Suggestions?

Show HN: PyTorch K-Means GPU-friendly, single-file, hierarchical and resampling

Comments (0)