[go: up one dir, main page]

Follow
Jingwen Leng
Jingwen Leng
Verified email at cs.sjtu.edu.cn - Homepage
Title
Cited by
Cited by
Year
GPUWattch: Enabling energy optimizations in GPGPUs
J Leng, T Hetherington, A ElTantawy, S Gilani, NS Kim, TM Aamodt, ...
ACM SIGARCH computer architecture news 41 (3), 487-498, 2013
8172013
Olive: Accelerating large language models via hardware-friendly outlier-victim pair quantization
C Guo, J Tang, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
Proceedings of the 50th Annual International Symposium on Computer …, 2023
1812023
A locality-aware memory hierarchy for energy-efficient GPU architectures
M Rhu, M Sullivan, J Leng, M Erez
Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013
1812013
Dual-side sparse tensor core
Y Wang, C Zhang, Z Xie, C Guo, Y Liu, J Leng
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
1262021
Accelerating sparse dnn models without hardware-support via tile-wise sparsity
C Guo, BY Hsueh, J Leng, Y Qiu, Y Guan, Z Wang, X Jia, X Li, M Guo, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
1212020
Safe limits on voltage reduction efficiency in GPUs: A direct measurement approach
J Leng, A Buyuktosunoglu, R Bertran, P Bose, VJ Reddi
Proceedings of the 48th International Symposium on Microarchitecture, 294-307, 2015
1182015
Ant: Exploiting adaptive numerical data type for low-bit deep neural network quantization
C Guo, C Zhang, J Leng, Z Liu, F Yang, Y Liu, M Guo, Y Zhu
2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO …, 2022
1072022
Squant: On-the-fly data-free quantization via diagonal hessian approximation
C Guo, Y Qiu, J Leng, X Gao, C Zhang, Y Liu, F Yang, Y Zhu, M Guo
arXiv preprint arXiv:2202.07471, 2022
962022
Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction
W Cui, H Zhao, Q Chen, N Zheng, J Leng, J Zhao, Z Song, T Ma, Y Yang, ...
Proceedings of the International Conference for High Performance Computing …, 2021
752021
Adaptive guardband scheduling to improve system-level efficiency of the POWER7+
Y Zu, CR Lefurgy, J Leng, M Halpern, MS Floyd, VJ Reddi
Proceedings of the 48th International Symposium on Microarchitecture, 308-321, 2015
752015
Adversarial defense through network profiling based path extraction
Y Qiu, J Leng, C Guo, Q Chen, C Li, M Guo, Y Zhu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
692019
Architectural implications of graph neural networks
Z Zhang, J Leng, L Ma, Y Miao, C Li, M Guo
IEEE Computer architecture letters 19 (1), 59-62, 2020
682020
Characterizing and demystifying the implicit convolution algorithm on commercial matrix-multiplication accelerators
Y Zhou, M Yang, C Guo, J Leng, Y Liang, Q Chen, M Guo, Y Zhu
2021 IEEE International Symposium on Workload Characterization (IISWC), 214-225, 2021
652021
GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures
J Leng, Y Zu, VJ Reddi
2015 IEEE 21st International Symposium on High Performance Computer …, 2015
652015
Dubhe: Towards data unbiasedness with homomorphic encryption in federated learning client selection
S Zhang, Z Li, Q Chen, W Zheng, J Leng, M Guo
Proceedings of the 50th International Conference on Parallel Processing, 1-10, 2021
632021
VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling
Z Liu, J Leng, Z Zhang, Q Chen, C Li, M Guo
Proceedings of the 27th ACM International Conference on Architectural …, 2022
622022
Chimera: An analytical optimizing framework for effective compute-intensive operators fusion
S Zheng, S Chen, P Song, R Chen, X Li, S Yan, D Lin, J Leng, Y Liang
2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023
522023
Transkimmer: Transformer learns to layer-wise skim
Y Guan, Z Li, J Leng, Z Lin, M Guo
arXiv preprint arXiv:2205.07324, 2022
492022
GPUVolt: Modeling and characterizing voltage noise in GPU architectures
J Leng, Y Zu, M Rhu, M Gupta, VJ Reddi
Proceedings of the 2014 international symposium on Low power electronics and …, 2014
492014
SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
G Shen, J Zhao, Q Chen, J Leng, C Li, M Guo
Proceedings of the 59th ACM/IEEE Design Automation Conference, 571-576, 2022
432022
The system can't perform the operation now. Try again later.
Articles 1–20