| Tokenization as the initial phase in NLP JJ Webster, C Kit COLING 1992 volume 4: The 14th international conference on computational …, 1992 | 719 | 1992 |
| Short and sparse text topic modeling via self-aggregation X Quan, C Kit, Y Ge, SJ Pan 24th international joint conference on artificial intelligence, IJCAI 2015 …, 2015 | 286 | 2015 |
| Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition H Zhao, C Kit Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, 2008 | 152 | 2008 |
| Extending machine translation evaluation metrics with lexical cohesion to document level BTM Wong, C Kit Proceedings of the 2012 joint conference on empirical methods in natural …, 2012 | 117 | 2012 |
| Measuring mono-word termhood by rank difference via corpus comparison C Kit, X Liu Terminology. International Journal of Theoretical and Applied Issues in …, 2008 | 105 | 2008 |
| Unsupervised learning of word boundary with description length gain C Kit, Y Wilks | 105 | 1999 |
| On methods of Chinese automatic word segmentation C Kit, Y Liu, N Liang Journal of Chinese Information Processing 3 (1), 1-32, 1989 | 102* | 1989 |
| Darwin series: Domain specific large language models for natural science T Xie, Y Wan, W Huang, Z Yin, Y Liu, S Wang, Q Linghu, C Kit, C Grazian, ... arXiv preprint arXiv:2308.13565, 2023 | 87 | 2023 |
| Comparative evaluation of online machine translation systems with legal texts C Kit, TM Wong Law Libr. J. 100, 299, 2008 | 75 | 2008 |
| Multilingual dependency learning: A huge feature engineering method to semantic dependency parsing H Zhao, W Chen, C Kit, G Zhou 13th Conference on Computational Natural Language Learning, CoNLL 2009, 55-60, 2009 | 72 | 2009 |
| Integrating unsupervised and supervised word segmentation: The role of goodness measures H Zhao, C Kit Information Sciences 181 (1), 163-183, 2011 | 71 | 2011 |
| An empirical comparison of goodness measures for unsupervised Chinese word segmentation with a unified framework H Zhao, C Kit 3rd International Joint Conference on Natural Language Processing (IJCNLP …, 2008 | 64 | 2008 |
| Chinese word segmentation as morpheme-based lexical chunking G Fu, C Kit, JJ Webster Information Sciences 178 (9), 2282-2296, 2008 | 60 | 2008 |
| Cross language dependency parsing using a bilingual lexicon H Zhao, Y Song, C Kit, G Zhou Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL …, 2009 | 59 | 2009 |
| Example-based machine translation: A new paradigm K Chunyu, P Haihua, JJ Webster Translation and information technology 57, 2002 | 54 | 2002 |
| Parsing syntactic and semantic dependencies with two single-stage maximum entropy models H Zhao, C Kit CoNLL 2008: Proceedings of the Twelfth Conference on Computational Natural …, 2008 | 51 | 2008 |
| A query-focused multi-document summarizer based on lexical chains J Li, L Sun, C Kit, J Webster Proc. of Document Understanding Conference, 2007 | 49 | 2007 |
| Unsupervised segmentation of Chinese corpus using accessor variety H Feng, K Chen, C Kit, X Deng International Conference on Natural Language Processing, 694-703, 2004 | 48 | 2004 |
| Semantic dependency parsing of NomBank and PropBank: An efficient integrated approach via a large-scale feature selection H Zhao, W Chen, C Kit Proceedings of the 2009 Conference on Empirical Methods in Natural Language …, 2009 | 42 | 2009 |
| Incorporating global information into supervised learning for Chinese word segmentation H Zhao, C Kit Proceedings of the 10th Conference of the Pacific Association for …, 2007 | 40 | 2007 |