The power and the potential of deep learning models attract many researchers to design advanced and sophisticated architectures. Nevertheless, the progress is sometimes unreal due to various possible reasons. In this work, through an astonishing example we argue that more efforts should be paid to ensure the progress in developing a new deep learning method. For a highly influential multi-label text classification method XML-CNN, we show that the superior performance claimed in the original paper was mainly due to some unbelievable coincidences. We re-examine XML-CNN and make a re-implementation which reveals some contradictory findings to the claims in the original paper. Our study suggests suitable baselines for multi-label text classification tasks and confirms that the progress on a new architecture cannot be confidently justified without a cautious investigation.
CITATION STYLE
Chen, S. A., Liu, J. J., Yang, T. H., Lin, H. T., & Lin, C. J. (2022). Even the Simplest Baseline Needs Careful Re-investigation: A Case Study on XML-CNN. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 1987–2000). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.naacl-main.145
Mendeley helps you to discover research relevant for your work.