Picture for Anne Ouyang

Anne Ouyang

CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe

Add code
Apr 01, 2026
Viaarxiv icon

Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Add code
Sep 09, 2025
Viaarxiv icon

Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining

Add code
Oct 16, 2021
Figure 1 for Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Figure 2 for Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Figure 3 for Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Figure 4 for Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Viaarxiv icon