| Measuring and characterizing generalization in deep reinforcement learning S Witty, JK Lee, E Tosch, A Atrey, K Clary, ML Littman, D Jensen Applied AI Letters 2 (4), e45, 2021 | 79 | 2021 |
| Let's play again: Variability of deep reinforcement learning agents in atari environments K Clary, E Tosch, J Foley, D Jensen arXiv preprint arXiv:1904.06312, 2019 | 42 | 2019 |
| Creating conversational characters using question generation tools X Yao, E Tosch, G Chen, E Nouri, R Artstein, A Leuski, K Sagae, D Traum Dialogue & Discourse 3 (2), 125-146, 2012 | 25 | 2012 |
| Planalyzer: Assessing threats to the validity of online experiments E Tosch, E Bakshy, ED Berger, DD Jensen, JEB Moss Proceedings of the ACM on Programming Languages 3 (OOPSLA), 1-30, 2019 | 12 | 2019 |
| Evaluating Conversational Characters Created through Question Generation. G Chen, E Tosch, R Artstein, A Leuski, DR Traum FLAIRS, 2011 | 11 | 2011 |
| Toybox: a suite of environments for experimental evaluation of deep reinforcement learning E Tosch, K Clary, J Foley, D Jensen arXiv preprint arXiv:1905.02825, 2019 | 9 | 2019 |
| SurveyMan: Programming and Automatically Debugging Surveys E Tosch, ED Berger Proceedings of the 2014 ACM International Conference on Object Oriented …, 2014 | 9 | 2014 |
| Compositional autoconstructive dynamics K Harrington, E Tosch, L Spector, J Pollack Proc. of the 8th Intl. Conf. on Complex Systems, 856-870, 2011 | 9 | 2011 |
| Toybox: Better atari environments for testing reinforcement learning agents J Foley, E Tosch, K Clary, D Jensen arXiv preprint arXiv:1812.02850, 2018 | 8 | 2018 |
| Privacy Policies on the Fediverse: A Case Study of Mastodon Instances E Tosch, L Garcia, C Li, C Martens Proceedings on Privacy Enhancing Technologies, 2024 | 5 | 2024 |
| Exploring Consequences of Privacy Policies with Narrative Generation via Answer Set Programming C Dabral, E Tosch, C Martens arXiv preprint arXiv:2212.06719, 2022 | 3 | 2022 |
| PlanAlyzer: assessing threats to the validity of online experiments E Tosch, E Bakshy, ED Berger, DD Jensen, JEB Moss Communications of the ACM 64 (9), 108-116, 2021 | 3 | 2021 |
| Stick It to The Man: Correcting for Non-Cooperative Behavior of Subjects in Experiments on Social Networks K Clary, E Tosch, J Onaolapo, DD Jensen USENIX Security, 2022 | 1 | 2022 |
| Achieving COSMOS: A metric for determining when to give up and when to reach for the stars E Tosch, L Spector Proceedings of the 14th annual conference companion on Genetic and …, 2012 | 1 | 2012 |
| Helical: A High Level Language Framework for Specifying Hypotheses and Experiments E Tosch, G Lincroft Proceedings of the 3rd ACM Conference on Reproducibility and Replicability …, 2025 | | 2025 |
| System Design for Digital Experimentation and Explanation Generation E Tosch | | 2020 |
| Generalization in Deep Reinforcement Learning S Witty, JK Lee, E Tosch, A Atrey, M Littman, D Jensen Critiquing and Correcting Trends in Machine Learning Workshop at NeurIPS, 2018 | | 2018 |
| Evaluating Conversational Characters Created through Question Generation. E Tosch, A Leuski, D Traum | | 2011 |
| DEMO Lab, Brandeis University, Waltham, MA pollack@ brandeis. edu K Harrington, E Tosch, L Spector | | |
| PLDI: G: Programming and Debugging Surveys E Tosch | | |