[go: up one dir, main page]

Follow
Hongyuan Liu
Title
Cited by
Cited by
Year
Why GPUs are Slow at Executing NFAs and How to Make them Faster
H Liu, S Pai, A Jog
25th International Conference on Architectural Support for Programming …, 2020
422020
Architectural Support for Efficient Large-scale Automata Processing
H Liu, M Ibrahim, O Kayiran, S Pai, A Jog
51st International Symposium on Microarchitecture (MICRO), 908-920, 2018
322018
On-GPU Thread-data Remapping for Branch Divergence Reduction
H Lin, CL Wang, H Liu
ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-24, 2018
272018
Analyzing and Leveraging Remote-core Bandwidth for Enhanced Performance in GPUs
MA Ibrahim, H Liu, O Kayiran, A Jog
28th International Conference on Parallel Architectures and Compilation …, 2019
262019
Asynchronous Automata Processing on GPUs
H Liu, S Pai, A Jog
Proceedings of the ACM on Measurement and Analysis of Computing Systems 7 (1 …, 2023
152023
Accelerating DNN Architecture Search at Scale Using Selective Weight Transfer
H Liu, B Nicolae, S Di, F Cappello, A Jog
2021 IEEE International Conference on Cluster Computing (CLUSTER), 2021
132021
ngAP: Non-blocking Large-scale Automata Processing on GPUs
T Ge, T Zhang, H Liu
Proceedings of the 29th ACM International Conference on Architectural …, 2024
92024
Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis
W Luo, R Fan, Z Li, D Du, H Liu, Q Wang, X Chu
arXiv preprint arXiv:2501.12084, 2025
82025
Lightweight Dependency Checking for Parallelizing Loops with Non-Deterministic Dependency on GPU
H Liu, KT Lam, H Lin, CL Wang, J Ma
22nd International Conference on Parallel and Distributed Systems (ICPADS …, 2016
52016
gHyPart: GPU-friendly End-to-End Hypergraph Partitioner
Z Wu, H Zhao, H Liu, W Wen, J Li
ACM Transactions on Architecture and Code Optimization 22 (1), 1-25, 2025
42025
Method and Apparatus for Detecting Inter-instruction Data Dependency
H Liu, CL Wang, KT Lam, H Lin, B Zhang, J Ma
US Patent 10,684,834, 2020
22020
Interleaved Bitstream Execution for Multi-Pattern Regex Matching on GPUs
T Ge, X Chu, H Liu
Proceedings of the 58th IEEE/ACM International Symposium on …, 2025
12025
Towards Scalable and Non-blocking Automata Processing on GPUs with ngAP
T Ge, T Zhang, H Liu
ACM Transactions on Computer Systems, 2025
12025
Advancing Matrix Operations for High-Performance and Memory-Efficient Automata Processing on GPUs
Z Wu, T Ge, J Li, X Chen, H Liu
ACM Transactions on Architecture and Code Optimization 22 (4), 1-26, 2025
2025
VESTA: A Secure and Efficient FHE-based Three-Party Vectorized Evaluation System for Tree Aggregation Models
H Zhao, J Huang, Z Chen, K Zhu, D Chen, Z Ji, H Liu
Proceedings of the ACM on Measurement and Analysis of Computing Systems …, 2025
2025
Efficient Point Cloud Analytics on Edge Devices
K Zhu, Z Wu, H Liu
IEEE 30th International Conference on Parallel and Distributed Systems …, 2024
2024
Techniques for Accelerating Large-Scale Automata Processing
H Liu
The College of William and Mary, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–17