Guanxiong Luo
  • About
  • Projects
  • Blog
  • Publications
  • generative models
  • •

  • diffusion models
  • •

  • linear algebra
  • •

  • computational imaging
  • •

  • inverse problems
  • •

  • tools
  • •

  • coding
  • •

  • Tensor Parallel + Sequence Parallel — A Deep Dive

    17 min read   ·   April 30, 2026

    2026

  • ZeRO-1 Distributed Optimizer - A Deep Dive

    19 min read   ·   April 30, 2026

    2026

  • Let agent play with agent in shell script

    7 min read   ·   April 28, 2026

    2026

  • Optimizing softmax on GPU

    7 min read   ·   December 29, 2025

    2025   ·   cuda   self-attention   numerical computation   coding  

  • The details of flash attention - algorithm

    5 min read   ·   December 17, 2025

    2025   ·   generative models   self-attention   coding  

  • Newer
  • 1
  • 2
  • 3
  • Older
© Copyright 2026 Guanxiong Luo. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Last updated: May 12, 2026.