Quansheng GU

古权胜

Building toward impactful AI infrastructure.

I am passionate about high-performance computing and AI infrastructure, currently exploring research opportunities and industry internships, aiming to build impactful systems for efficient inference. FLOPS IS ALL YOUR NEED!!!

AI infrastructure

Efficient inference engines

High-performance computing

Portrait

Current Work

Sparse-vLLM

with Jitai Hao

Sparse-vLLM logo

Project

Sparse-vLLM

A sparse-first inference engine that rethinks KV cache layout, controller flow, and kernels, while also including DeltaKV compressor training and evaluation tooling.

Education

Education

Harbin Institute of Technology, Shenzhen · Computer Science and Technology

Harbin Institute of Technology logo

School

Harbin Institute of Technology, Shenzhen

Major

Computer Science and Technology

Status

Sophomore

Campus

Campus Experience

HPC team work and technical exploration around AI systems.

Special Experience

The 15th National Games Volunteer

Awarded Outstanding Volunteer for event service.

Interests

AI infraEfficient inference enginesHPC systems

Competitions

Competition Experience

Supercomputing and ICT competition records.

Tech Stack

Technical Stack

Languages, accelerators, frameworks, profiling tools, and HPC systems.

Languages

PythonC++

Hardware

NVIDIA GPUAscend NPUKunpeng CPU

Frameworks

PyTorchTensorRTCUDA Graphs

Tools / Systems

Nsight SystemsPyTorch ProfilerApptainerMPISlurm

Contact

Contact