| Crossvit: Cross-attention multi-scale vision transformer for image classification CFR Chen, Q Fan, R Panda Proceedings of the IEEE/CVF international conference on computer vision, 357-366, 2021 | 2639 | 2021 |
| A unified multi-scale deep convolutional neural network for fast object detection Z Cai, Q Fan, RS Feris, N Vasconcelos European conference on computer vision, 354-370, 2016 | 2008 | 2016 |
| Moments in time dataset: one million videos for event understanding M Monfort, A Andonian, B Zhou, K Ramakrishnan, SA Bargal, T Yan, ... IEEE transactions on pattern analysis and machine intelligence 42 (2), 502-508, 2019 | 784 | 2019 |
| Adversarial t-shirt! evading person detectors in a physical world K Xu, G Zhang, S Liu, Q Fan, M Sun, H Chen, PY Chen, Y Wang, X Lin European conference on computer vision, 665-681, 2020 | 490 | 2020 |
| Regionvit: Regional-to-local attention for vision transformers CF Chen, R Panda, Q Fan arXiv preprint arXiv:2106.02689, 2021 | 307 | 2021 |
| A closer look at Faster R-CNN for vehicle detection Q Fan, L Brown, J Smith 2016 IEEE intelligent vehicles symposium (IV), 124-129, 2016 | 305 | 2016 |
| Curve matching, time warping, and light fields: New algorithms for computing similarity between curves A Efrat, Q Fan, S Venkatasubramanian Journal of mathematical imaging and vision 27 (3), 203-216, 2007 | 268 | 2007 |
| Structured adversarial attack: Towards general implementation and better interpretability K Xu, S Liu, P Zhao, PY Chen, H Zhang, Q Fan, D Erdogmus, Y Wang, ... arXiv preprint arXiv:1808.01664, 2018 | 202 | 2018 |
| Deep analysis of cnn-based spatio-temporal representations for action recognition CFR Chen, R Panda, K Ramakrishnan, R Feris, J Cohn, A Oliva, Q Fan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 169 | 2021 |
| More is less: Learning efficient video representations by big-little network and depthwise temporal aggregation Q Fan, CFR Chen, H Kuehne, M Pistoia, D Cox Advances in Neural Information Processing Systems 32, 2019 | 165 | 2019 |
| Big-little net: An efficient multi-scale feature representation for visual and speech recognition CF Chen, Q Fan, N Mallinar, T Sercu, R Feris arXiv preprint arXiv:1807.03848, 2018 | 126 | 2018 |
| Temporal sequence modeling for video event detection Y Cheng, Q Fan, S Pankanti, A Choudhary Proceedings of the IEEE conference on computer vision and pattern …, 2014 | 115 | 2014 |
| Multi-moments in time: Learning and interpreting models for multi-action video understanding M Monfort, B Pan, K Ramakrishnan, A Andonian, BA McNamara, ... IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 9434 …, 2021 | 101 | 2021 |
| Attribute-based alert ranking for alert adjudication Q Fan, SU Pankanti US Patent 9,020,190, 2015 | 99 | 2015 |
| Adamml: Adaptive multi-modal learning for efficient video recognition R Panda, CFR Chen, Q Fan, X Sun, K Saenko, A Oliva, R Feris Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 83 | 2021 |
| Generating adversarial computer programs using optimized obfuscations S Srikant, S Liu, T Mitrovska, S Chang, Q Fan, G Zhang, UM O'Reilly arXiv preprint arXiv:2103.11882, 2021 | 72 | 2021 |
| Modeling of temporarily static objects in surveillance video data RP Bobbitt, Q Fan, Z Lu, J Pan, SU Pankanti US Patent 8,744,123, 2014 | 71 | 2014 |
| Random laplace feature maps for semigroup kernels on histograms J Yang, V Sindhwani, Q Fan, H Avron, MW Mahoney Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2014 | 69 | 2014 |
| Using detailed process information at a point of sale RP Bobbitt, Q Fan, A Hampapur, F Kjeldsen, SU Pankanti, A Yanagawa, ... US Patent 7,962,365, 2011 | 69 | 2011 |
| Sequential event detection from video RP Bobbitt, L Ding, Q Fan, S Miyazawa, SU Pankanti, Y Zhai US Patent 8,548,203, 2013 | 68 | 2013 |