[go: up one dir, main page]

Follow
Guyue Huang
Guyue Huang
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Machine learning for electronic design automation: A survey
G Huang, J Hu, Y He, J Liu, M Ma, Z Shen, J Wu, Y Xu, H Zhang, K Zhong, ...
ACM Transactions on Design Automation of Electronic Systems (TODAES) 26 (5 …, 2021
4222021
GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks
G Huang, G Dai, W Yu, Y Huazhong
Proceedings of the International Conference for High Performance Computing …, 2020
1982020
{TC-GNN}: Bridging Sparse {GNN} Computation and Dense Tensor Cores on {GPUs}
Y Wang, B Feng, Z Wang, G Huang, Y Ding
2023 USENIX Annual Technical Conference (USENIX ATC 23), 149-164, 2023
902023
Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective
H Zhang, Z Yu, G Dai, G Huang, Y Ding, Y Xie, Y Wang
Proceedings of Machine Learning and Systems 4, 467-484, 2022
812022
Lightseq2: Accelerated training for transformer-based models on gpus
X Wang, Y Wei, Y Xiong, G Huang, X Qian, Y Ding, M Wang, L Li
SC22: International Conference for High Performance Computing, Networking …, 2022
492022
Mixdq: Memory-efficient few-step text-to-image diffusion models with metric-decoupled mixed precision quantization
T Zhao, X Ning, T Fang, E Liu, G Huang, Z Lin, S Yan, G Dai, Y Wang
European Conference on Computer Vision, 285-302, 2024
352024
Heuristic adaptability to input dynamics for spmm on gpus
G Dai, G Huang, S Yang, Z Yu, H Zhang, Y Ding, Y Xie, H Yang, Y Wang
Proceedings of the 59th ACM/IEEE Design Automation Conference, 595-600, 2022
352022
Alcop: Automatic load-compute pipelining in deep learning compiler for ai-gpus
G Huang, Y Bai, L Liu, Y Wang, B Yu, Y Ding, Y Xie
Proceedings of Machine Learning and Systems 5, 680-694, 2023
302023
Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning
G Huang, H Li, M Qin, F Sun, Y Ding, Y Xie
Proceedings of the 59th ACM/IEEE Design Automation Conference, 1153-1158, 2022
222022
RM-STC: Row-Merge Dataflow Inspired GPU Sparse Tensor Core for Energy-Efficient Sparse Acceleration
G Huang, Z Wang, PA Tsai, C Zhang, Y Ding, Y Xie
Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023
202023
Exploiting Online Locality and Reduction Parallelism for Sampled Dense Matrix Multiplication on GPUs
Z Yu, G Dai, G Huang, Y Wang, H Yang
2021 IEEE 39th International Conference on Computer Design (ICCD), 567-574, 2021
142021
{OPER}:{Optimality-Guided} Embedding Table Parallelization for Large-scale Recommendation Model
Z Wang, Y Wang, B Feng, G Huang, D Mudigere, B Muthiah, A Li, Y Ding
2024 USENIX Annual Technical Conference (USENIX ATC 24), 667-682, 2024
52024
Efficient Sparse Matrix Kernels based on Adaptive Workload-Balancing and Parallel-Reduction
G Huang, G Dai, Y Wang, Y Ding, Y Xie
arXiv preprint arXiv:2106.16064, 2021
52021
Enabling Efficient Sparse Multiplications on GPUs with Heuristic Adaptability
J Xu, S Huang, J Li, G Huang, Y Xie, Y Wang, G Dai
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024
32024
Low-power in-pixel buffer circuit for smart image sensor
Q Li, H Zhu, G Huang, Z Yu, F Qiao, Q Wei, X Liu, H Yang
Sensor Review 40 (5), 585-590, 2020
12020
TRACI: Network Acceleration of Input-Dynamic Communication for Large-Scale Deep Learning Recommendation Model
G Huang, H Li, L Qin, J Huang, Y Kang, Y Ding, Y Xie
Proceedings of the 52nd Annual International Symposium on Computer …, 2025
2025
{GMI-DRL}: Empowering {Multi-GPU}{DRL} with {Adaptive-Grained} Parallelism
Y Wang, B Feng, Z Wang, G Huang, TT Geng, A Li, Y Ding
2025 USENIX Annual Technical Conference (USENIX ATC 25), 89-103, 2025
2025
Warp execution method and associated GPU
Y Gao, F Sun, H Li, G Huang, C Zhang, R Zhong
US Patent 12,100,064, 2024
2024
High-Performance Deep Learning Systems via DL Sparsity and DL Compiler
G Huang
University of California, Santa Barbara, 2024
2024
Systems and methods for neural network training with weight sparsity
F Sun, M Qin, H Li, G Zhu, Y Gao, G Huang, Y Zhang
US Patent App. 17/866,194, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20