7

An NVIDIA CUDA core is 1 FP32 ALU, not 1/32 of an ALU. An ALU processes one FP32...

 2 years ago
source link: https://twitter.com/RyanSmithAT/status/1450216681326796800
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
Don’t miss what’s happening
People on Twitter are the first to know.

Tweet

See new Tweets

Conversation

gpus are vector processors. each instruction in a shader gets executed on multiple pixels (32 at a time for nvidia, 24 for apple). 3080 has 68 sm, 272 shader units, and 8704 "cores", 3080ti has 80:320:10240. m1 max has 32:4096:98304 but much lower clock?

Apple's current GPU architecture offers 128 FP32 ALUs per "core", which is similar to an Ampere SM. So M1 Max is powerful at 4096 ALUs, but that's still well under a high-end NV GPU.

But again, ALUs are not 1:1 comparable in real-world performance

again, the number that nvidia is calling a core.. is 1/32 of an execution unit or alu. it seriously looks by all the numbers that apple has -more- raw hardware by threads and transistors.. and is just running it at a fraction of the ghz for power efficiency.

An NVIDIA CUDA core is 1 FP32 ALU, not 1/32 of an ALU. An ALU processes one FP32 operation per cycle.

And you can ignore the thread number. That's for developers, who need to know how many threads they can switch between

but i am a developer? gpu threads are not at all like cpu/os threads, because -the same instruction- is run simultaneously on all 32 threads in the "shader processor"/alu/execution unit. a 3090 has 328 shader processors, each run 1 instruction on 32 fragments = 10240 "cores".
The scheduling has become successively more free in later gens. It's still precisely that, an FP32 ALU that is partially run in sync with 31 other FP32 ALUs. Apple has 32 cores, 16 EUs, 8 ALUs per EU = 4,096 for full chip.
Show replies
New to Twitter?
Sign up now to get your own personalized timeline!
Sign up with Apple
By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.
Relevant people

science and computing nerd : she/her : 🏳️‍⚧️ : black lives matter

(ex-mercurial/linux/facebook/cisco/etc)

Amateur scholar of First World War -- start of the worst conflict human kind has ever seen. Tongue-hero. Lousy ideologue. Aspiring hesher.

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK