AMD's Freshly-Baked MI350: An Interview with the Chief Architect

55 pella 23 6/20/2025, 9:20:46 PM chipsandcheese.com ↗

Comments (23)

pella · 5h ago

FP6:

  "Alan: Sure, yep, so one of the things that we felt like on MI350 in this  timeframe, that it's going into the market and the current state of AI... we felt like that FP6 is a format that has potential to not only be used for inferencing, but potentially for training. And so we wanted to make sure that the capabilities for FP6 were class-leading relative to... what others maybe would have been implementing, or have implemented. And so, as you know, it's a long lead time to design hardware, so we were thinking about this years ago and wanted to make sure that MI350 had leadership in FP6 performance. So we made a decision to implement the FP6 data path at the same throughput as the FP4 data path. Of course, we had to take on a little bit more hardware in order to do that. FP6 has a few more bits, obviously, that's why it's called FP6. But we were able to do that within the area of constraints that we had in the matrix engine, and do that in a very power- and area-efficient way.

treesciencebot · 1h ago

the main question is going to be software stack. NVIDIA is already shipping NVFP4 kernels and perf is looking good. It took a really long time after MI300X's that the FP8 kernels were OK (not even good, compared to almost perfect FP8 support in NVIDIA side of things).

I will doubt that they will be able to reach %60-70 of the FLOPs in majority of the workloads (unless they hand craft and tune a specific GEMM kernel for their benchmark shape). But would be happy to be proven wrong, and go buy a bunch of them

pella · 1h ago

(related)

Tinygrad:

  "We've been negotiating a $2M contract to get AMD on MLPerf, but one of the sticking points has been confidentiality. Perhaps posting the deliverables on X will help legal to get in the spirit of open source!"

   "Contract is signed! No confidentiality, AMD has leadership that's capable of acting. Let's make this training run happen, we work in public on our Discord.

" https://x.com/__tinygrad__/status/1935364905949110532

kristianp · 3h ago

Will 1.58 bits be in the MI400? Or is it not established as a widely useful technology yet?

See https://arxiv.org/abs/2402.17764

teleforce · 2h ago

This 8-combo MI350 is a beauty with 2304 GB VRAM of HMB3E memory on each UBB [1].

[1] This is the AMD Instinct MI350:

https://www.servethehome.com/this-is-the-amd-instinct-mi350/

behnamoh · 3h ago

Does this also ship only in x8 batches? I really liked MI300 and could afford one of them for my research, but they only come in batches of x8 in a server rack, so I decided to buy an RTX Pro 6000.

jiggawatts · 2h ago

Of course not.

AMD stubbornly refuses to recognise the huge numbers of low- or medium- budget researchers, hobbyists, and open source developers.

This ignorance of how software development is done has resulted in them losing out on a multi-trillion-dollar market.

It's incredible to me how obstinate certain segments of the industry (such as hardware design) can be.

rfv6723 · 1h ago

These ppl are very loud online, but they don't make decisions for hyperscalers which are biggest spenders on AI chips.

AMD is doing just fine, Oracle just announced an AI cluster with up to 131,072 of AMD's new MI355X GPUs.

AMD needs to focus on bringing rack-scale mi400 as quickly as possible to market, rather than those hobbyists always find something to complain instead of spending money.

behnamoh · 1h ago

> these people

we're talking about the majority of open source developers (I'm one of them). if researchers don't get access to hardware X, they write their paper using hardware Y (Nvidia). AMD isn't doing fine because most low level research on AI is done purely on CUDA.

qualifiedeephd · 1h ago

Serious researchers use HPC clusters not desktop workstations. Currently the biggest HPC cluster in North America has AMD GPUs. I think it'll be fine.

almostgotcaught · 1h ago

> These ppl are very loud online, but they don't make decisions for hyperscalers which are biggest spenders on AI chips.

this guy gets it - absolutely no one cares about the hobby market because it's absolutely not how software development is done (nor is it how software is paid for).

pstuart · 16m ago

The hobby market should be considered as a pipeline to future customers. It doesn't mean AMD should drop everything and cater specifically to them, but they'd be foolish to ignore them altogether.

gdiamos · 2h ago

startups and researchers are broke, just like Geoff Hinton in 2006 - https://blog.waqasrana.me/assets/papers/hinton2006.pdf

AbuAssar · 1h ago

If MI350 employs CDNA, which is based on the VEGA (GCN) architecture, does that imply that MI400, when introduced next year, will skip the 2020 GCN and directly transition to RDNA 5 equivalent?

pella · 1h ago

2026 - MI400X - CDNA 5 - UALink/IF - Helios - HBM Bandwidth: 1,400 TB/s

https://www.tomshardware.com/pc-components/gpus/amd-says-ins...

rfv6723 · 44m ago

RDNA is a dead-end.

AMD went down the wrong path by focusing on traditional rendering instead of machine learning.

I think future AMD consumer GPUs would go back to GCN.

almostgotcaught · 1h ago

it's all just called gcn now

https://llvm.org/docs/AMDGPUUsage.html#id38

tedunangst · 3h ago

A solid 40% of George's questions were deemed great. (Not counting some fluff like what's your job.)

deadbabe · 3h ago

Will AMD catch up to Nvidia?

mobilio · 2h ago

If they improve software quality and providing some low budget versions then - Yes.

wmf · 3h ago

Yes, if they can ship on time.

zombiwoof · 2h ago

They don’t care to catch up.

jonfromsf · 2h ago

NVDAs advantage is software, not just hardware. Would be amazing to have a competitive market but better hardware won't be enough to make it happen.

Ask HN: What do you think about app native vs. portable look-and-feel?

Free Virtual CS Classes and Tutoring

Ask HN: What newspaper are you paying for these days?

Ask HN: What is the equivalent to Win32 on Linux

Ask HN: Tips for hiring? It has been difficult

Ask HN: Advice about transitioning to remote role?

Stripe alternative for India for payment processing

Ask HN: Is AI 'context switching' exhausting?

Ask HN: AI agents and the future of UI/UX design. Opinions?

Ask HN: Data engineers, What suck when working on exploratory data-related task?

BMW ConnectedDrive lets me control my returned rental car (Sixt)

Ask HN: What cool skill or project interests you, but feels out of reach?

Ask HN: Tech people who are self employed. How do you do it?

Is there a way to run an LLM as a better local search engine?

Ask HN: Is cloud infra making us forget the local file system and memory?

I would enjoy an HN chat. Is there one?

Ask HN: For a team experienced with LLMs – Any concrete reason to use LangGraph?

Tell HN: Help restore the tax deduction for software dev in the US (Section 174)

Ask HN: How do I give back to people helped me when I was young and had nothing?

Khalifa University and Knowledge E to Organise AI Futures Summit in Abu Dhabi

Ask HN: How should I spend 10 weeks delving into AI?

Ask HN: What happens post ESOPs vesting period is over at a startup?

Ask HN: What is your fallback job if AI takes away your career?

Is GitHub Down?

Ask HN: How to learn CUDA to professional level

Ask HN: How to Deal with a Bad Manager?

Ask HN: In a guide to inner work for founders and engs, what topics to cover?

"A Crowd-Driven Platform That Lets People Vote

AMD's Freshly-Baked MI350: An Interview with the Chief Architect

Comments (23)