Hangrui Cao.
Notes

Recent writing

Drafts and notes on ML systems, LLM inference, and the work I find worth writing about. Updated occasionally — not on a schedule.

01

NCCL vs NVSHMEM: Two Answers to the Same Question

Read essay →
02

Auto-tuning vLLM in Production: A Field Report

Coming soon →
03

Notes from a Non-CUDA Accelerator

Coming soon →