How to Make Things Slower So They Go Faster (gojiberries.io)

1 points by neehao 3m ago 0 comments

The race to stop mirror organisms (ft.com)

1 points by loopdoend 4m ago 0 comments

Ask HN: Best codebases to study to learn software design?

1 points by pixelworm 5m ago 0 comments

DNS Caching issue in Java (chaitanyawaikar1993.medium.com)

1 points by tanelpoder 8m ago 0 comments

A New Adventure – 西游记 (luan.family)

2 points by dluan 14m ago 0 comments

Next-gen voice, video, and chat messaging using your domain name not your number (thunderbolt.com)

1 points by vyrotek 21m ago 1 comments

US Government announces 10% stake in chipmaker Intel (bbc.com)

1 points by doppp 25m ago 0 comments

Work on Revolution Wind project halted (text.npr.org)

2 points by derriz 29m ago 0 comments

A lightweight TypeScript library for assertion-based runtime data validation (github.com)

2 points by nayajunimesh 29m ago 1 comments

Omarchy Is Out (world.hey.com)

1 points by kristianp 36m ago 0 comments

Lithium May Be the Key to Stopping Alzheimer's (scitechdaily.com)

1 points by sowbug 38m ago 0 comments

Lightcap AI – Weighted Language Model Agent (lightcap.ai)

2 points by WASDAai 46m ago 1 comments

I hacked a way to use URL shortener to extract users' emails from GitHub/Docker

1 points by lexokoh 49m ago 0 comments

ThinkMesh: A Python lib for parallel thinking in LLMs (github.com)

2 points by martianlantern 49m ago 0 comments

Retailers slash prices of AMD and Intel latest EPYC and Xeon CPUs by up to 50% (tomshardware.com)

2 points by ksec 52m ago 1 comments

Mirage2.net AI World Generator

2 points by RyanMu 54m ago 0 comments

There's No Gold in Removing Signaling Costs (liorbd.me)

1 points by liorben-david 58m ago 0 comments

Kioxia's 5TB, 64 GB/s flash module puts NAND toward the memory bus for AI GPU (tomshardware.com)

2 points by ksec 1h ago 1 comments

Concept Poisoning: Probing LLMs without probes (lesswrong.com)

1 points by qouteall 1h ago 0 comments

I started building a permissions broker for AI agent for myself (github.com)

1 points by lexokoh 1h ago 0 comments

Show HN: brew-cleaner – CLI to bulk uninstall Homebrew formulae and free space (github.com)

2 points by theharshb 1h ago 0 comments

Google Develops KFuzzTest for Fuzzing Internal Linux Kernel Functions (phoronix.com)

2 points by westurner 1h ago 3 comments

LLMs and the Russellian Inversion (philosophicalhacker.com)

2 points by kmdupree 1h ago 0 comments

OmniNeural – First NPU-Aware Multimodal Model (huggingface.co)

1 points by huragok 1h ago 0 comments

Show HN: AWS Cost Explorer Clone (github.com)

1 points by 9woc 1h ago 0 comments

We Built This to Film What's 7,500 Light-Years Away [video] (youtube.com)

2 points by angilr 1h ago 0 comments

What Are Oklch Colors? (jakub.kr)

5 points by vinhnx 1h ago 1 comments

Show HN: Text-to-Explainer Video in Seconds (trytorial.com)

1 points by bames_jond 1h ago 1 comments

Show HN: How to Build a Coding Agent (free workshop) (ghuntley.com)

40 points by ghuntley 1h ago 6 comments

I guess I was wrong about AI persuasion (dynomight.substack.com)

3 points by paulpauper 1h ago 0 comments

Day 56 of latte art until I'm good it (instagram.com)

1 points by shreya51 2h ago 4 comments

You shouldn't salt a leech that's sucking your blood (cbc.ca)

1 points by pabs3 2h ago 0 comments

Kinds of Programming People (leftoversalad.com)

5 points by saulpw 2h ago 0 comments

I built a tiny mac app to monitor and manage my development processes (github.com)

27 points by lexokoh 2h ago 3 comments

PeaceFounder: Simple centralized Evoting thats voter-verifiable and private (peacefounder.org)

1 points by TheWiggles 2h ago 0 comments

Show HN: Run AI models directly in the browser – no server or internet required (private-ai-chat.vercel.app)

2 points by nadchif 2h ago 3 comments

Drawing Persona 5 UI with Compose [video] (youtube.com)

1 points by koakuma-chan 2h ago 0 comments

Scientists Are Caught in a Political Trap (theatlantic.com)

2 points by petethomas 2h ago 1 comments

Exceptional Nordic heatwave stumps tourists seeking shade (yahoo.com)

2 points by PaulHoule 2h ago 0 comments

CVE-2025-8901: OOB write in Chrome (nvd.nist.gov)

2 points by larkinrichards 2h ago 0 comments

Using information theory to solve Mastermind (goranssongaspar.com)

1 points by SchwKatze 2h ago 0 comments

Review: Ollantay (astralcodexten.com)

1 points by paulpauper 2h ago 0 comments

Agentic Browser Security: Indirect Prompt Injection in Perplexity Comet (brave.com)

8 points by drak0n1c 2h ago 5 comments

Lovable raises $200M and partners with Hangout for vibe coding with music (hangout.fm)

2 points by ljlolel 2h ago 1 comments

AI App Dev Log: The Story of Our App Begins

3 points by monstergpt 2h ago 0 comments

Claude-Trace (github.com)

3 points by handfuloflight 2h ago 0 comments

What Is Customer Identity and Access Management (CIAM)? CIAM Explained (fusionauth.io)

1 points by mooreds 2h ago 0 comments

How A Neuron Learns (rvns.moe)

2 points by jxmorris12 2h ago 0 comments

Scientists found the missing nutrients bees need – Colonies grew 15-fold (sciencedaily.com)

3 points by fcpguru 2h ago 2 comments

Deal to get ChatGPT Plus for whole of UK discussed by Open AI boss and minister (theguardian.com)

3 points by thunderbong 2h ago 0 comments

(ROCm) AMD-GPU-Boost: Unlock Full Performance on Consumer AMD GPUs

4 painter3000 2 8/24/2025, 12:39:08 AM

https://github.com/Painter3000/AMD-GPU-BOOST

I've been frustrated with AMD GPU performance in AI/ML applications - my RX 6800 XT was only using ~25% of its potential in PyTorch. The issue? ROCm was designed for MI-series enterprise GPUs and severely underdetects consumer GPU capabilities.

ROCm reports only 36 compute units instead of 72, and uses warp size 32 instead of the optimal 64 for RDNA2/3. This affects the entire RX 6000/7000 series.

AMD-GPU-BOOST fixes this at runtime by monkey-patching PyTorch's device detection. Results: - 4x performance improvement in inference - "NVIDIA-only" apps now run perfectly on AMD - Works with ComfyUI, Stable Diffusion, WAN 2.1, etc.

The tool includes a GUI installer for easy Pinokio integration and supports 18+ GPU models from RX 6400 to RX 7900 XTX.

This has been a major pain point for the AMD AI community - curious what the HN crowd thinks about runtime hardware detection fixes vs. proper driver/framework solutions.

Demo: My RX 6800 XT went from 1152 threads (36×32) to 4608 threads (72×64) - exactly what the hardware specs promise.

Comments (2)

wmf · 3h ago

Did you try upstreaming this?

painter3000 · 4h ago

“What do you think – is runtime patching a legitimate option or should AMD urgently address this issue?”