‪Zhewen Yu‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2021
Citations	165	164
h-index	5	5
i10-index	4	4

0

70

35

20202021202220232024202520261 7 20 24 43 68 2

Zhewen Yu

Zhewen Yu

PhD, Imperial College London

Verified email at imperial.ac.uk

Machine learning FPGA


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A parameterisable FPGA-tailored architecture for YOLOv3-tiny Z Yu, CS Bouganis International Symposium on Applied Reconfigurable Computing, 330-344, 2020	72	2020
SATAY: A streaming architecture toolflow for accelerating YOLO models on FPGA devices A Montgomerie-Corcoran, P Toupas, Z Yu, CS Bouganis 2023 International Conference on Field Programmable Technology (ICFPT), 179-187, 2023	38	2023
Samo: Optimised mapping of convolutional neural networks to streaming architectures A Montgomerie-Corcoran, Z Yu, CS Bouganis 2022 32nd International Conference on Field-Programmable Logic and …, 2022	16	2022
Svd-nas: Coupling low-rank approximation and neural architecture search Z Yu, CS Bouganis Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023	12	2023
Pass: Exploiting post-activation sparsity in streaming architectures for cnn acceleration A Montgomerie-Corcoran, Z Yu, J Cheng, CS Bouganis 2023 33rd International Conference on Field-Programmable Logic and …, 2023	5	2023
Streamsvd: Low-rank approximation and streaming accelerator co-design Z Yu, CS Bouganis 2021 International Conference on Field-Programmable Technology (ICFPT), 1-9, 2021	5	2021
A Dataflow Compiler for Efficient LLM Inference using Custom Microscaling Formats J Cheng, C Zhang, Z Yu, CS Bouganis, GA Constantinides, Y Zhao arXiv preprint arXiv:2307.15517, 2023	4	2023
MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration C Zhang, J Cheng, Z Yu, Y Zhao NeurIPS 2023 Workshop on Machine Learning Systems (MLSys), 2023	4	2023
Auto WS: Automate Weights Streaming in Layer-Wise Pipelined DNN Accelerators Z Yu, CS Bouganis 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-6, 2024	3	2024
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction P Toupas, Z Yu, CS Bouganis, D Tzovaras 2024 IEEE 32nd Annual International Symposium on Field-Programmable Custom …, 2024	2	2024
Fast prototyping next-generation accelerators for new ml models using mase: Ml accelerator system exploration J Cheng, C Zhang, Z Yu, A Montgomerie-Corcoran, C Xiao, CS Bouganis, ... CoRR, 2023	2	2023
From Loop Nests to Silicon: Mapping AI Workloads onto AMD NPUs with MLIR-AIR E Wang, S Bayliss, A Bisca, Z Blair, S Chowdhary, K Denolf, J Fifield, ... arXiv preprint arXiv:2510.14871, 2025	1	2025
HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator Z Yu, S Sreeram, K Agrawal, J Wu, A Montgomerie-Corcoran, C Zhang, ...	1	2024
ITERA-LLM: Boosting Sub-8-Bit Large Language Model Inference via Iterative Tensor Decomposition Y Huang, K Zheng, Z Yu, CS Bouganis 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom …, 2025		2025
Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor Decomposition Z Yu, CS Bouganis 2023 33rd International Conference on Field-Programmable Logic and …, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–15