Publications

Publications by categories in reversed chronological order.

2026

  1. SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
    Junsong Chen*Yuyang Zhao*Jincheng Yu*Ruihang Chu, and 16 more authors
    In ICLR 2026 Oral
  2. LongLive: Real-time Interactive Long Video Generation
    Shuai YangWei HuangRuihang ChuYicheng Xiao, and 8 more authors
    In ICLR 2026
  3. StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
    Tianrui FengZhi LiShuo YangHaocheng Xi, and 10 more authors
    In MLSys 2026

2025

  1. Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation
    Xingyang Li*Muyang Li*Tianle CaiHaocheng Xi, and 10 more authors
    In NeurIPS 2025
  2. Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
    Shuo Yang*Haocheng Xi*Yilong ZhaoMuyang Li, and 10 more authors
    In NeurIPS 2025 Spotlight
  3. Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
    Lvmin ZhangShengqu CaiMuyang LiGordon Wetzstein, and 1 more author
    NeurIPS 2025 Spotlight
  4. Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
    Haocheng Xi*Shuo Yang*Yilong ZhaoChenfeng Xu, and 10 more authors
    In ICML 2025
  5. SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
    Enze Xie*Junsong Chen*Yuyang ZhaoJincheng Yu, and 10 more authors
    In ICML 2025
  6. STORM: Token-Efficient Long Video Understanding for Multimodal LLMs
    Jindong Jiang*Xiuyu Li*Zhijian LiuMuyang Li, and 12 more authors
    In ICCV CLVL 2025
  7. SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
    Muyang Li*Yujun Lin*Zhekai Zhang*Tianle Cai, and 6 more authors
    In ICLR 2025 Spotlight
  8. Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
    Enze Xie*Junsong Chen*Junyu ChenHan Cai, and 7 more authors
    In ICLR 2025 Oral
  9. Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
    Junyu Chen*Han Cai*Junsong ChenEnze Xie, and 5 more authors
    In ICLR 2025

2024

  1. Condition-Aware Neural Network for Controlled Image Generation
    Han CaiMuyang LiZhuoyang ZhangQinsheng Zhang, and 2 more authors
    In CVPR 2024
  2. DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
    Muyang Li*Tianle Cai*Jiaxin CaoQinsheng Zhang, and 6 more authors
    In CVPR 2024 Highlight

2022

  1. Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
    Muyang LiJi LinChenlin MengStefano Ermon, and 2 more authors
    In T-PAMI & NeurIPS 2022
  2. Lite Pose: Efficient Architecture Design for 2d Human Pose Estimation
    Yihan WangMuyang LiHan CaiWei-Ming Chen, and 1 more author
    In CVPR 2022

2020

  1. GAN Compression: Efficient Architectures for Interactive Conditional GANs
    Muyang LiJi LinYaoyao DingZhijian Liu, and 2 more authors
    In T-PAMI & CVPR 2020