Next.js App Router + React Server Components Demo

new
past
show
ask
show
jobs
submit

▲Decoupling Compute and Memory for Async GPUs

8 points by yiyingzhang 21 hours ago | 2 comments

bobbyzhu2008 21 hours ago [-]

67% less kernel code is the more interesting number here — Hopper's async capabilities have been underutilized largely because the programming model is painful. Curious how it handles cases where compute and memory phases aren't cleanly separable.

jhap 19 hours ago [-]

This seems like a better version of CUDA, for Hopper GPUs?

preetham_rangu 7 hours ago [-]

[dead]

Rendered at 14:24:41 GMT+0000 (Coordinated Universal Time) with Vercel.