(www.servethehome.com)

28 points addaon | 1 comments | 20 Nov 24 02:53 UTC | HN request time: 0.201s | source

Show context

tuananh ◴[20 Nov 24 04:24 UTC] No.42190811[source]▶

it's 16TB of DDR5 btw

replies(1): >>42190905 #

metadat ◴[20 Nov 24 04:50 UTC] No.42190905[source]▶

Yes, 128x128.

Good for a database, maybe.

What else?

A half dozen GPT-4 instances

replies(1): >>42195659 #

1. metadat ◴[20 Nov 24 16:39 UTC] No.42195659[source]▶

LLM inference processors (GPUs) don't use DDR, it uses special, costly stacked HBM ram soldered to the board.

I tested out running Llama on a 512GB machine, it's rather slow and inefficient. Maybe 1-token/sec.

Lenovo Has a CXL Memory Monster with 128x 128GB DDR5 DIMMs