| The global landscape of neural networks: An overview R Sun, D Li, S Liang, T Ding, R Srikant IEEE Signal Processing Magazine 37 (5), 95-108, 2020 | 127 | 2020 |
| Sparsity learning-based multiuser detection in grant-free massive-device multiple access T Ding, X Yuan, SC Liew IEEE Transactions on Wireless Communications 18 (7), 3569-3582, 2019 | 115 | 2019 |
| Why transformers need adam: A hessian perspective Y Zhang, C Chen, T Ding, Z Li, R Sun, Z Luo Advances in neural information processing systems 37, 131786-131823, 2024 | 95 | 2024 |
| Adam-mini: Use fewer learning rates to gain more Y Zhang, C Chen, Z Li, T Ding, C Wu, DP Kingma, Y Ye, ZQ Luo, R Sun arXiv preprint arXiv:2406.16793, 2024 | 85 | 2024 |
| On the benefit of width for neural networks: Disappearance of basins D Li, T Ding, R Sun SIAM Journal on Optimization 32 (3), 1728-1758, 2022 | 84* | 2022 |
| Suboptimal local minima exist for wide neural networks with smooth activations T Ding, D Li, R Sun Mathematics of Operations Research 47 (4), 2784-2814, 2022 | 58* | 2022 |
| Federated learning with lossy distributed source coding: Analysis and optimization H Yang, T Ding, X Yuan IEEE Transactions on Communications 71 (8), 4561-4576, 2023 | 16 | 2023 |
| Pdhg-unrolled learning-to-optimize method for large-scale linear programming B Li, L Yang, Y Chen, S Wang, Q Chen, H Mao, Y Ma, A Wang, T Ding, ... arXiv preprint arXiv:2406.01908, 2024 | 15 | 2024 |
| Enabling scalable oversight via self-evolving critic Z Tang, Z Li, Z Xiao, T Ding, R Sun, B Wang, D Liu, F Huang, T Liu, B Yu, ... arXiv preprint arXiv:2501.05727, 2025 | 13 | 2025 |
| On the degrees of freedom of the symmetric multi-relay MIMO Y channel T Ding, X Yuan, SC Liew IEEE Transactions on Wireless Communications 16 (9), 5673-5688, 2017 | 13 | 2017 |
| Network-coded fronthaul transmission for cache-aided C-RAN T Ding, X Yuan, SC Liew 2017 IEEE International Symposium on Information Theory (ISIT), 1182-1186, 2017 | 11 | 2017 |
| CoRT: Code-integrated Reasoning within Thinking C Li, Z Tang, Z Li, M Xue, K Bao, T Ding, R Sun, B Wang, X Wang, J Lin, ... arXiv preprint arXiv:2506.09820, 2025 | 9 | 2025 |
| Realcritic: Towards effectiveness-driven evaluation of language model critiques Z Tang, Z Li, Z Xiao, T Ding, R Sun, B Wang, D Liu, F Huang, T Liu, B Yu, ... arXiv preprint arXiv:2501.14492, 2025 | 9 | 2025 |
| Knapsack rl: Unlocking exploration of llms via optimizing budget allocation Z Li, C Chen, T Yang, T Ding, R Sun, G Zhang, W Huang, ZQ Luo arXiv preprint arXiv:2509.25849, 2025 | 8 | 2025 |
| Algorithmic beamforming design for MIMO multiway relay channel with clustered full data exchange T Ding, X Yuan, SC Liew IEEE Transactions on Vehicular Technology 67 (10), 10081-10086, 2018 | 7 | 2018 |
| On the power of small-size graph neural networks for linear programming Q Li, T Ding, L Yang, M Ouyang, Q Shi, R Sun Advances in Neural Information Processing Systems 37, 38695-38719, 2024 | 5 | 2024 |
| On representing convex quadratically constrained quadratic programs via graph neural networks C Wu, Q Chen, A Wang, T Ding, R Sun, W Yang, Q Shi arXiv preprint arXiv:2411.13805, 2024 | 4 | 2024 |
| Unlocking black-box prompt tuning efficiency via zeroth-order optimization H Zhan, C Chen, T Ding, Z Li, R Sun Findings of the Association for Computational Linguistics: EMNLP 2024, 14825 …, 2024 | 4 | 2024 |
| Mofo: Momentum-filtered optimizer for mitigating forgetting in llm fine-tuning Y Chen, S Wang, Y Zhang, Z Lin, H Zhang, W Sun, T Ding, R Sun arXiv preprint arXiv:2407.20999, 2024 | 4 | 2024 |
| Bridging formal language with chain-of-thought reasoning to geometry problem solving T Yang, Y Li, Z Li, Z Lin, R Sun, T Ding arXiv preprint arXiv:2508.09099, 2025 | 2 | 2025 |