Address
:
[go:
up one dir
,
main page
]
Include Form
Remove Scripts
Accept Cookies
Show Images
Show Referer
Rotate13
Base64
Strip Meta
Strip Title
Session Cookies
Loading...
The system can't perform the operation now. Try again later.
Citations per year
Duplicate citations
The following articles are merged in Scholar. Their
combined citations
are counted only for the first article.
Merged citations
This "Cited by" count includes citations to the following articles in Scholar. The ones marked
*
may be different from the article in the profile.
Add co-authors
Co-authors
Follow
New articles by this author
New citations to this author
New articles related to this author's research
Email address for updates
Done
My profile
My library
Metrics
Alerts
Settings
Sign in
Sign in
Get my own profile
Cited by
All
Since 2021
Citations
258
258
h-index
8
8
i10-index
7
7
0
140
70
35
105
2021
2022
2023
2024
2025
2026
2
8
27
81
136
3
Public access
View all
View all
5 articles
1 article
available
not available
Based on funding mandates
Follow
Zheng Wang
Ph.D. student,
University of California, San Diego
Verified email at ucsd.edu -
Homepage
Machine Learning
System for ML
Articles
Cited by
Public access
Title
Sort
Sort by citations
Sort by year
Sort by title
Cited by
Cited by
Year
{TC-GNN}: Bridging Sparse {GNN} Computation and Dense Tensor Cores on {GPUs}
Y Wang, B Feng, Z Wang, G Huang, Y Ding
2023 USENIX Annual Technical Conference (USENIX ATC 23), 149-164
, 2023
90
2023
{MGG}: Accelerating Graph Neural Networks with {Fine-Grained}{Intra-Kernel}{Communication-Computation} Pipelining on {Multi-GPU} Platforms
Y Wang, B Feng, Z Wang, T Geng, K Barker, A Li, Y Ding
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …
, 2023
55
2023
EL-Rec: efficient large-scale recommendation model training via tensor-train embedding table
Z Wang, Y Wang, B Feng, D Mudigere, B Muthiah, Y Ding
SC22: International Conference for High Performance Computing, Networking …
, 2022
33
2022
ECSSD: Hardware/Data Layout Co-Designed In-Storage-Computing Architecture for Extreme Classification
S Li, F Tu, L Liu, J Lin, Z Wang, Y Kang, Y Ding, Y Xie
Proceedings of the 50th Annual International Symposium on Computer …
, 2023
18
2023
ZENO: A Type-based Optimization Framework for Zero Knowledge Neural Network Inference
B Feng, Z Wang, Y Wang, S Yang, Y Ding
Proceedings of the 29th ACM International Conference on Architectural …
, 2024
15
2024
RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing
Z Wang, Y Wang, J Deng, D Zheng, A Li, Y Ding
Proceedings of the 29th ACM International Conference on Architectural …
, 2024
15
2024
WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training
Z Wang, A Cai, X Xie, Z Pan, Y Guan, W Chu, J Wang, S Li, J Huang, ...
arXiv preprint arXiv:2503.17924
, 2025
12
2025
Uncertainty-aware attention graph neural network for defending adversarial attacks
B Feng, Y Wang, Z Wang, Y Ding
arXiv preprint arXiv:2009.10235
, 2020
8
2020
FastTree: Optimizing Attention Kernel and Runtime for Tree-Structured LLM Inference
Z Pan, Y Ding, Y Guan, Z Wang, Z Yu, X Tang, Y Wang, Y Ding
Eighth Conference on Machine Learning and Systems
, 0
6
*
{OPER}:{Optimality-Guided} Embedding Table Parallelization for Large-scale Recommendation Model
Z Wang, Y Wang, B Feng, G Huang, D Mudigere, B Muthiah, A Li, Y Ding
2024 USENIX Annual Technical Conference (USENIX ATC 24), 667-682
, 2024
5
2024
Faith: An Efficient Framework for Transformer Verification on {GPUs}
B Feng, T Tang, Y Wang, Z Chen, Z Wang, S Yang, Y Xie, Y Ding
2022 USENIX Annual Technical Conference (USENIX ATC 22), 167-182
, 2022
1
2022
Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding
Y Guan, C Yu, S Fang, W Hu, Z Pan, Z Wang, Z Liu, Y Zhou, Y Ding, ...
arXiv preprint arXiv:2512.23858
, 2025
2025
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing
Y Wang, B Feng, Z Wang, T Geng, A Li, Y Ding
arXiv preprint arXiv:2206.08482
, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–13
Show more
Privacy
Terms
Help
About Scholar
Search help