[go: up one dir, main page]

Follow
Yufei Cui
Yufei Cui
Verified email at mail.mcgill.ca - Homepage
Title
Cited by
Cited by
Year
Large language model (llm) for telecommunications: A comprehensive survey on principles, key techniques, and opportunities
H Zhou, C Hu, Y Yuan, Y Cui, Y Jin, C Chen, H Wu, D Yuan, L Jiang, ...
IEEE Communications Surveys & Tutorials 27 (3), 1955-2005, 2024
2602024
Retrieval-augmented generation for natural language processing: A survey
S Wu, Y Xiong, Y Cui, H Wu, C Chen, Y Yuan, L Huang, X Liu, TW Kuo, ...
arXiv preprint arXiv:2407.13193, 2024
1442024
Trace: A fast transformer-based general-purpose lossless compressor
Y Mao, Y Cui, TW Kuo, CJ Xue
Proceedings of the ACM Web Conference 2022, 1829-1838, 2022
632022
Exploiting asymmetric errors for LDPC decoding optimization on 3D NAND flash memory
Q Li, L Shi, Y Cui, CJ Xue
IEEE Transactions on Computers 69 (4), 475-488, 2019
552019
Shaving retries with sentinels for fast read over high-density 3D flash
Q Li, M Ye, Y Cui, L Shi, X Li, TW Kuo, CJ Xue
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
512020
Bayes-MIL: A new probabilistic perspective on attention-based multiple instance learning for whole slide images
Y Cui, Z Liu, X Liu, X Liu, C Wang, TW Kuo, CJ Xue, AB Chan
11th International Conference on Learning Representations (ICLR 2023), 2023
402023
NFL: Robust Learned Index via Distribution Transformation
S Wu, Y Cui, J Yu, X Sun, TW Kuo, CJ Xue
Proceedings of the VLDB Endowment 15 (10), 2188-2200, 2022
372022
{CacheSifter}: Sifting cache files for boosted mobile performance and lifetime
Y Liang, R Pan, T Ren, Y Cui, R Ausavarungnirun, X Chen, C Li, TW Kuo, ...
20th USENIX Conference on File and Storage Technologies (FAST 22), 445-459, 2022
322022
Accelerating general-purpose lossless compression via simple and scalable parameterization
Y Mao, Y Cui, TW Kuo, CJ Xue
Proceedings of the 30th ACM International Conference on Multimedia, 3205-3213, 2022
222022
FlashEmbedding: storing embedding tables in SSD for large-scale recommender systems
H Wan, X Sun, Y Cui, CL Yang, TW Kuo, CJ Xue
Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 9-16, 2021
212021
Faster and stronger lossless compression with optimized autoregressive framework
Y Mao, J Li, Y Cui, JC Xue
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023
202023
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
X Wang, J Chi, Z Tai, TST Kwok, H He, Z Li, Y Hua, M Li, P Lu, S Wang, ...
Proceedings of the 34th ACM International Conference on Information and …, 2025
192025
Accelerating Monte Carlo Bayesian prediction via approximating predictive uncertainty over the simplex
Y Cui, W Yao, Q Li, AB Chan, CJ Xue
IEEE transactions on neural networks and learning systems, 2020
182020
Fully Nested Neural Network for Adaptive Compression and Quantization.
Y Cui, Z Liu, W Yao, Q Li, AB Chan, T Kuo, CJ Xue
IJCAI, 2080-2087, 2020
182020
Pruning deep reinforcement learning for dual user experience and storage lifetime improvement on mobile devices
C Wu, Y Cui, C Ji, TW Kuo, CJ Xue
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2020
172020
Sentinel cells enabled fast read for {NAND} flash
Q Li, M Ye, Y Cui, L Shi, X Li, CJ Xue
11th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 19), 2019
172019
Improve generalization and robustness of neural networks via weight scale shifting invariant regularizations
Z Liu, Y Cui, AB Chan
arXiv preprint arXiv:2008.02965, 2020
152020
The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
Z Liu, Y Cui, Y Yan, Y Xu, X Ji, X Liu, AB Chan
International Conference on Machine Learning (ICML 2024), 2024
132024
A hardware-accelerated solution for hierarchical index-based merge-join
Z Zhou, C Yu, S Nutanong, Y Cui, C Fu, CJ Xue
IEEE Transactions on Knowledge and Data Engineering 31 (1), 91-104, 2018
132018
Raee: A training-free retrieval-augmented early exiting framework for efficient inference
L Huang, S Wu, Y Cui, Y Xiong, X Liu, TW Kuo, N Guan, CJ Xue
arXiv e-prints, arXiv: 2405.15198, 2024
122024
The system can't perform the operation now. Try again later.
Articles 1–20