| Averaging weights leads to wider optima and better generalization P Izmailov, D Podoprikhin, T Garipov, D Vetrov, AG Wilson Uncertainty in Artificial Intelligence (UAI), 2018 | 2262 | 2018 |
| GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference with GPU Acceleration JR Gardner, G Pleiss, D Bindel, KQ Weinberger, AG Wilson Advances in Neural Information Processing Systems (NIPS), 2018 | 1872 | 2018 |
| BoTorch: A framework for efficient Monte-Carlo Bayesian optimization M Balandat, B Karrer, D Jiang, S Daulton, B Letham, AG Wilson, E Bakshy Advances in neural information processing systems 33, 21524-21538, 2020 | 1559 | 2020 |
| Deep kernel learning AG Wilson, Z Hu, R Salakhutdinov, EP Xing Artificial Intelligence and Statistics (AISTATS), 2016 | 1351 | 2016 |
| A simple baseline for Bayesian uncertainty in deep learning W Maddox, T Garipov, P Izmailov, D Vetrov, AG Wilson Advances in Neural Information Processing Systems (NeurIPS), 2019 | 1164 | 2019 |
| Bayesian deep learning and a probabilistic perspective of generalization AG Wilson, P Izmailov Advances in Neural Information Processing Systems (NeurIPS), 2020 | 1022 | 2020 |
| Loss surfaces, mode connectivity, and fast ensembling of DNNs T Garipov, P Izmailov, D Podoprikhin, DP Vetrov, AG Wilson Advances in Neural Information Processing Systems (NIPS), 2018 | 1016 | 2018 |
| Gaussian process kernels for pattern discovery and extrapolation AG Wilson, RP Adams Proceedings of the 30th International Conference on Machine Learning (ICML …, 2013 | 946 | 2013 |
| Large language models are zero-shot time series forecasters N Gruver, M Finzi, S Qiu, AG Wilson Advances in Neural Information Processing Systems 36, 19622-19635, 2023 | 833 | 2023 |
| Simple black-box adversarial attacks C Guo, JR Gardner, Y You, AG Wilson, KQ Weinberger International Conference on Machine Learning (ICML), 2019 | 830 | 2019 |
| Kernel interpolation for scalable structured Gaussian processes (KISS-GP) AG Wilson, H Nickisch Proceedings of the 32nd International Conference on Machine Learning (ICML …, 2015 | 740 | 2015 |
| Chronos: Learning the language of time series AF Ansari, L Stella, C Turkmen, X Zhang, P Mercado, H Shen, O Shchur, ... arXiv preprint arXiv:2403.07815, 2024 | 739 | 2024 |
| What Are Bayesian Neural Network Posteriors Really Like? P Izmailov, S Vikram, MD Hoffman, AG Wilson International Conference on Machine Learning, 2021 | 615 | 2021 |
| A Cookbook of Self-Supervised Learning R Balestriero, M Ibrahim, V Sobal, A Morcos, S Shekhar, T Goldstein, ... arXiv preprint arXiv:2304.12210, 2023 | 544 | 2023 |
| Last layer re-training is sufficient for robustness to spurious correlations P Kirichenko, P Izmailov, AG Wilson arXiv preprint arXiv:2204.02937, 2022 | 485 | 2022 |
| Generalizing convolutional neural networks for equivariance to lie groups on arbitrary continuous data M Finzi, S Stanton, P Izmailov, AG Wilson International Conference on Machine Learning (ICML), 2020 | 443 | 2020 |
| Why normalizing flows fail to detect out-of-distribution data P Kirichenko, P Izmailov, AG Wilson Advances in Neural Information Processing Systems (NeurIPS), 2020 | 403 | 2020 |
| Cyclical stochastic gradient MCMC for Bayesian deep learning R Zhang, C Li, J Zhang, C Chen, AG Wilson International Conference on Learning Representations (ICLR), 2019 | 390 | 2019 |
| Stochastic variational deep kernel learning AG Wilson, Z Hu, RR Salakhutdinov, EP Xing Advances in Neural Information Processing Systems (NIPS) 29, 2586-2594, 2016 | 381 | 2016 |
| Bayesian optimization with gradients J Wu, M Poloczek, AG Wilson, PI Frazier Advances in Neural Information Processing Systems (NIPS) 30, 2017 | 355 | 2017 |