Automatic personality recognition from source code is a scarcely explored problem. We propose personality recognition with handcrafted features, based on lexical, syntactic and semantic properties of source code. Out of 35 proposed features, 22 features are completely novel. We also show that n-gram features are simple but surprisingly good predictors of personality and present results arising from joint usage of both handcrafted and baseline features. Additionally we compare our results with scores obtained within the Personality Recognition in SOurce COde track during Forum for Information Retrieval Evaluation 2016 and set up state-of-the-art results for conscientiousness and neuroticism traits.
CITATION STYLE
Biel, M., Kuta, M., & Kitowski, J. (2020). Personality recognition from source code based on lexical, syntactic and semantic features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12138 LNCS, pp. 351–363). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-50417-5_26
Mendeley helps you to discover research relevant for your work.