| Global versus localized generative adversarial nets GJ Qi, L Zhang, H Hu, M Edraki, J Wang, XS Hua Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 101 | 2018 |
| State-frequency memory recurrent neural networks H Hu, GJ Qi International Conference on Machine Learning, 1568-1577, 2017 | 69 | 2017 |
| Reason for future, act for now: A principled framework for autonomous llm agents with provable sample efficiency Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang arXiv preprint arXiv:2309.17382, 2023 | 67 | 2023 |
| Learning compact features for human activity recognition via probabilistic first-take-all J Ye, GJ Qi, N Zhuang, H Hu, KA Hua IEEE transactions on pattern analysis and machine intelligence 42 (1), 126-139, 2018 | 59 | 2018 |
| Offline reinforcement learning with value-based episodic memory X Ma, Y Yang, H Hu, Q Liu, J Yang, C Zhang, Q Zhao, B Liang arXiv preprint arXiv:2110.09796, 2021 | 57 | 2021 |
| Generalizable episodic memory for deep reinforcement learning H Hu, J Ye, G Zhu, Z Ren, C Zhang arXiv preprint arXiv:2103.06469, 2021 | 53 | 2021 |
| What is essential for unseen goal generalization of offline goal-conditioned rl? R Yang, L Yong, X Ma, H Hu, C Zhang, T Zhang International Conference on Machine Learning, 39543-39571, 2023 | 42 | 2023 |
| Metacure: Meta reinforcement learning with empowerment-driven exploration J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang International Conference on Machine Learning, 12600-12610, 2021 | 42 | 2021 |
| Semantic neighbor graph hashing for multimodal retrieval L Jin, K Li, H Hu, GJ Qi, J Tang IEEE Transactions on Image Processing 27 (3), 1405-1417, 2017 | 40 | 2017 |
| Maximize to explore: One objective function fusing estimation, planning, and exploration Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang Advances in Neural Information Processing Systems 36, 22151-22165, 2023 | 37 | 2023 |
| On the estimation bias in double q-learning Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang Advances in Neural Information Processing Systems 34, 10246-10259, 2021 | 35 | 2021 |
| On the role of discount factor in offline reinforcement learning H Hu, Y Yang, Q Zhao, C Zhang International conference on machine learning, 9072-9098, 2022 | 33 | 2022 |
| One-step hydrothermal method synthesized pH-dependent carbon dots for multistage anti-counterfeiting X Mao, X Zhao, H Hu, Z Li, W Xiong, Y Wei, W Gao Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy 303, 123257, 2023 | 28 | 2023 |
| The provable benefits of unsupervised data sharing for offline reinforcement learning H Hu, Y Yang, Q Zhao, C Zhang arXiv preprint arXiv:2302.13493, 2023 | 21 | 2023 |
| Flow to control: Offline reinforcement learning with lossless primitive discovery Y Yang, H Hu, W Li, S Li, J Yang, Q Zhao, C Zhang Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 10843 …, 2023 | 19 | 2023 |
| Reason for future, act for now: A principled architecture for autonomous llm agents Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang Forty-first International Conference on Machine Learning, 2024 | 18 | 2024 |
| One objective to rule them all: A maximization objective fusing estimation and planning for exploration Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang CoRR, 2023 | 16 | 2023 |
| A temporal order modeling approach to human action recognition from multimodal sensor data J Ye, H Hu, GJ Qi, KA Hua ACM Transactions on Multimedia Computing, Communications, and Applications …, 2017 | 16 | 2017 |
| Unsupervised behavior extraction via random intent priors H Hu, Y Yang, J Ye, Z Mai, C Zhang Advances in Neural Information Processing Systems 36, 51491-51514, 2023 | 15 | 2023 |
| Learning to adaptively scale recurrent neural networks H Hu, L Wang, GJ Qi Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3822-3829, 2019 | 15 | 2019 |