jt-zhang

Follow

Jintao Zhang jt-zhang

Follow

A PhD student at Tsinghua University, focusing on efficient training and inference of large models.

198 followers · 64 following

@thu-ml, Tsinghua University
Beijing, China
https://jt-zhang.github.io/

Achievements

Achievements

Highlights

Pro

Organizations

jt-zhang/README.md

Hi 😊

I am a first-year PhD student in the CS Dept. at Tsinghua University, focusing on efficient training and inference of large models.

🏠 My Homepage.

WeChat ID : Zjt_Tete

Pinned Loading

thu-ml/SageAttention thu-ml/SageAttention Public

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 2.3k 206
thu-ml/SpargeAttn thu-ml/SpargeAttn Public

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 696 55
CardinalityEstimationTestbed CardinalityEstimationTestbed Public

CardinalityEstimationTestbed

Python 47 14
Sparse_SageAttention_API Sparse_SageAttention_API Public

Python 51 6
attention-survey/Efficient_Attention_Survey attention-survey/Efficient_Attention_Survey Public

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

170 4