| Data cleaning: Overview and emerging challenges X Chu, IF Ilyas, S Krishnan, J Wang Proceedings of the 2016 international conference on management of data, 2201 …, 2016 | 910 | 2016 |
| Crowder: Crowdsourcing entity resolution J Wang, T Kraska, MJ Franklin, J Feng arXiv preprint arXiv:1208.1927, 2012 | 773 | 2012 |
| Crowdsourced data management: A survey G Li, J Wang, Y Zheng, MJ Franklin IEEE Transactions on Knowledge and Data Engineering 28 (9), 2296-2319, 2016 | 410 | 2016 |
| Activeclean: Interactive data cleaning for statistical modeling S Krishnan, J Wang, E Wu, MJ Franklin, K Goldberg Proceedings of the VLDB Endowment 9 (12), 948-959, 2016 | 379 | 2016 |
| Can we beat the prefix filtering? An adaptive framework for similarity join and search J Wang, G Li, J Feng Proceedings of the 2012 ACM SIGMOD international conference on management of …, 2012 | 304 | 2012 |
| Leveraging transitive relations for crowdsourced joins J Wang, G Li, T Kraska, MJ Franklin, J Feng Proceedings of the 2013 ACM SIGMOD International Conference on Management of …, 2013 | 272 | 2013 |
| Pass-join: A partition-based method for similarity joins G Li, D Deng, J Wang, J Feng arXiv preprint arXiv:1111.7171, 2011 | 254 | 2011 |
| QASCA: A quality-aware task assignment system for crowdsourcing applications Y Zheng, J Wang, G Li, R Cheng, J Feng Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015 | 242 | 2015 |
| Fast-join: An efficient method for fuzzy token matching based string similarity join J Wang, G Li, J Fe 2011 IEEE 27th International Conference on Data Engineering, 458-469, 2011 | 224 | 2011 |
| Trie-join: Efficient trie-based string similarity joins with edit-distance constraints J Wang, J Feng, G Li Proceedings of the VLDB Endowment 3 (1-2), 1219-1230, 2010 | 196 | 2010 |
| Entity matching: How similar is similar J Wang, G Li, JX Yu, J Feng Proceedings of the VLDB Endowment 4 (10), 622-633, 2011 | 194 | 2011 |
| Are we ready for learned cardinality estimation? X Wang, C Qu, W Wu, J Wang, Q Zhou arXiv preprint arXiv:2012.06743, 2020 | 179 | 2020 |
| A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data J Wang, S Krishnan, MJ Franklin, K Goldberg, T Milo, T Kraska SIGMOD, 2014 | 172 | 2014 |
| Massjoin: A mapreduce-based method for scalable string similarity joins D Deng, G Li, S Hao, J Wang, J Feng 2014 IEEE 30th International Conference on Data Engineering, 340-351, 2014 | 161 | 2014 |
| Towards dependable data repairing with fixing rules J Wang, N Tang Proceedings of the 2014 ACM SIGMOD international conference on Management of …, 2014 | 160 | 2014 |
| Crowdsourced data management: Overview and challenges G Li, Y Zheng, J Fan, J Wang, R Cheng Proceedings of the 2017 ACM international conference on Management of Data …, 2017 | 132 | 2017 |
| Learning accurate kinematic control of cable-driven surgical robots using data cleaning and gaussian process regression J Mahler, S Krishnan, M Laskey, S Sen, A Murali, B Kehoe, S Patil, ... 2014 IEEE international conference on automation science and engineering …, 2014 | 105 | 2014 |
| Activeclean: An interactive data cleaning framework for modern machine learning S Krishnan, MJ Franklin, K Goldberg, J Wang, E Wu Proceedings of the 2016 international conference on management of data, 2117 …, 2016 | 102 | 2016 |
| Aqp++ connecting approximate query processing with aggregate precomputation for interactive analytics J Peng, D Zhang, J Wang, J Pei Proceedings of the 2018 International Conference on Management of Data, 1477 …, 2018 | 90 | 2018 |
| Clamshell: Speeding up crowds for low-latency data labeling D Haas, J Wang, E Wu, MJ Franklin arXiv preprint arXiv:1509.05969, 2015 | 84 | 2015 |