| Adaattn: Revisit attention mechanism in arbitrary neural style transfer S Liu, T Lin, D He, F Li, M Wang, X Li, Z Sun, Q Li, E Ding Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 527 | 2021 |
| Multi-label classification with label graph superimposing Y Wang, D He, F Li, X Long, Z Zhou, J Ma, S Wen Proceedings of the AAAI conference on artificial intelligence 34 (07), 12265 …, 2020 | 255 | 2020 |
| Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features M Yang, D He, M Fan, B Shi, X Xue, F Li, E Ding, J Huang Proceedings of the IEEE/CVF International conference on Computer Vision …, 2021 | 216 | 2021 |
| Read, watch, and move: Reinforcement learning for temporally grounding natural language descriptions in videos D He, X Zhao, J Huang, F Li, X Liu, S Wen Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8393-8400, 2019 | 190 | 2019 |
| Stnet: Local and global spatial-temporal modeling for action recognition D He, Z Zhou, C Gan, F Li, X Liu, Y Li, L Wang, S Wen Proceedings of the AAAI conference on artificial intelligence 33 (01), 8401-8408, 2019 | 185 | 2019 |
| Image inpainting by end-to-end cascaded refinement with mask awareness M Zhu, D He, X Li, C Li, F Li, X Liu, E Ding, Z Zhang IEEE Transactions on Image Processing 30, 4855-4866, 2021 | 172 | 2021 |
| Multimodal keyless attention fusion for video classification X Long, C Gan, G Melo, X Liu, Y Li, F Li, S Wen Proceedings of the aaai conference on artificial intelligence 32 (1), 2018 | 159 | 2018 |
| Drafting and revision: Laplacian pyramid network for fast high-quality artistic style transfer T Lin, Z Ma, F Li, D He, X Li, E Ding, N Wang, J Li, X Gao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 130 | 2021 |
| Paint transformer: Feed forward neural painting with stroke prediction S Liu, T Lin, D He, F Li, R Deng, X Li, E Ding, H Wang Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 123 | 2021 |
| Mvfnet: Multi-view fusion network for efficient video recognition W Wu, D He, T Lin, F Li, C Gan, E Ding Proceedings of the AAAI conference on artificial intelligence 35 (4), 2943-2951, 2021 | 97 | 2021 |
| Learning semantic person image generation by region-adaptive normalization Z Lv, X Li, X Li, F Li, T Lin, D He, W Zuo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 94 | 2021 |
| Videogen: A reference-guided latent diffusion approach for high definition text-to-video generation X Li, W Chu, Y Wu, W Yuan, F Liu, Q Zhang, F Li, H Feng, E Ding, J Wang arXiv preprint arXiv:2309.00398, 2023 | 80 | 2023 |
| Temporal modeling approaches for large-scale youtube-8m video understanding F Li, C Gan, X Liu, Y Bian, X Long, Y Li, Z Li, J Zhou, S Wen arXiv preprint arXiv:1707.04555, 2017 | 76 | 2017 |
| Predict, prevent, and evaluate: Disentangled text-driven image manipulation empowered by pre-trained vision-language model Z Xu, T Lin, H Tang, F Li, D He, N Sebe, R Timofte, L Van Gool, E Ding Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 59 | 2022 |
| Revisiting the effectiveness of off-the-shelf temporal modeling approaches for large-scale video classification Y Bian, C Gan, X Liu, F Li, X Long, Y Li, H Qi, J Zhou, S Wen, Y Lin arXiv preprint arXiv:1708.03805, 2017 | 56 | 2017 |
| Deltaedit: Exploring text-free training for text-driven image manipulation Y Lyu, T Lin, F Li, D He, J Dong, T Tan arXiv preprint arXiv:2303.06285, 2023 | 50 | 2023 |
| Coder: Coupled diversity-sensitive momentum contrastive learning for image-text retrieval H Wang, D He, W Wu, B Xia, M Yang, F Li, Y Yu, Z Ji, E Ding, J Wang European conference on computer vision, 700-716, 2022 | 47 | 2022 |
| Aim 2022 challenge on super-resolution of compressed image and video: Dataset, methods and results R Yang, R Timofte, X Li, Q Zhang, L Zhang, F Liu, D He, F Li, H Zheng, ... European Conference on Computer Vision, 174-202, 2022 | 44 | 2022 |
| LMR: a large-scale multi-reference dataset for reference-based super-resolution L Zhang, X Li, D He, F Li, E Ding, Z Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 38 | 2023 |
| Deep concept-wise temporal convolutional networks for action localization X Li, T Lin, X Liu, W Zuo, C Li, X Long, D He, F Li, S Wen, C Gan Proceedings of the 28th ACM International Conference on Multimedia, 4004-4012, 2020 | 37 | 2020 |