We quantified several different elements that reflect writing styles of scientific papers in four related disciplines: physics, astrophysics, mathematics, and computer science. Text descriptors such as the use of punctuation characters, the use of upper case letters, use of quotations, and other descriptors that are not based on the words used in the papers were extracted from each document. Based on these features alone an automatic classifier was able to identify the discipline of the paper with accuracy much higher than mere chance, showing that different disciplines can be differentiated by their writing styles, and without using their content directly as reflected by common words used in the papers. The study showed statistically significant differences between the different disciplines such as use of acronyms, sentence length, word length, and more. Our findings also show changes in writing styles in specific disciplines over time. For instance, mathematicians and computer scientists began to use less acronyms starting from 2006, and there is a dramatic decrease of the average of punctuation characters in mathematics papers. These observations suggest that even in closely related disciplines there are differences in the scientific communication expressed through writing styles, demonstrating the existence of a “signature” writing style developed in each discipline. These findings should also be taken into account when a multidisciplinary group of collaborators assign writing duties on a joint scientific manuscript.
CITATION STYLE
Alluqmani, A., & Shamir, L. (2018). Writing styles in different scientific disciplines: a data science approach. Scientometrics, 115(2), 1071–1085. https://doi.org/10.1007/s11192-018-2688-8
Mendeley helps you to discover research relevant for your work.