[go: up one dir, main page]

Follow
Zhewen Yu
Zhewen Yu
Verified email at imperial.ac.uk
Title
Cited by
Cited by
Year
A parameterisable FPGA-tailored architecture for YOLOv3-tiny
Z Yu, CS Bouganis
International Symposium on Applied Reconfigurable Computing, 330-344, 2020
722020
SATAY: A streaming architecture toolflow for accelerating YOLO models on FPGA devices
A Montgomerie-Corcoran, P Toupas, Z Yu, CS Bouganis
2023 International Conference on Field Programmable Technology (ICFPT), 179-187, 2023
382023
Samo: Optimised mapping of convolutional neural networks to streaming architectures
A Montgomerie-Corcoran, Z Yu, CS Bouganis
2022 32nd International Conference on Field-Programmable Logic and …, 2022
162022
Svd-nas: Coupling low-rank approximation and neural architecture search
Z Yu, CS Bouganis
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
122023
Pass: Exploiting post-activation sparsity in streaming architectures for cnn acceleration
A Montgomerie-Corcoran, Z Yu, J Cheng, CS Bouganis
2023 33rd International Conference on Field-Programmable Logic and …, 2023
52023
Streamsvd: Low-rank approximation and streaming accelerator co-design
Z Yu, CS Bouganis
2021 International Conference on Field-Programmable Technology (ICFPT), 1-9, 2021
52021
A Dataflow Compiler for Efficient LLM Inference using Custom Microscaling Formats
J Cheng, C Zhang, Z Yu, CS Bouganis, GA Constantinides, Y Zhao
arXiv preprint arXiv:2307.15517, 2023
42023
MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration
C Zhang, J Cheng, Z Yu, Y Zhao
NeurIPS 2023 Workshop on Machine Learning Systems (MLSys), 2023
42023
Auto WS: Automate Weights Streaming in Layer-Wise Pipelined DNN Accelerators
Z Yu, CS Bouganis
2024 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-6, 2024
32024
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
P Toupas, Z Yu, CS Bouganis, D Tzovaras
2024 IEEE 32nd Annual International Symposium on Field-Programmable Custom …, 2024
22024
Fast prototyping next-generation accelerators for new ml models using mase: Ml accelerator system exploration
J Cheng, C Zhang, Z Yu, A Montgomerie-Corcoran, C Xiao, CS Bouganis, ...
CoRR, 2023
22023
From Loop Nests to Silicon: Mapping AI Workloads onto AMD NPUs with MLIR-AIR
E Wang, S Bayliss, A Bisca, Z Blair, S Chowdhary, K Denolf, J Fifield, ...
arXiv preprint arXiv:2510.14871, 2025
12025
HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator
Z Yu, S Sreeram, K Agrawal, J Wu, A Montgomerie-Corcoran, C Zhang, ...
12024
ITERA-LLM: Boosting Sub-8-Bit Large Language Model Inference via Iterative Tensor Decomposition
Y Huang, K Zheng, Z Yu, CS Bouganis
2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom …, 2025
2025
Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor Decomposition
Z Yu, CS Bouganis
2023 33rd International Conference on Field-Programmable Logic and …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–15